3 ways to spot a bad statistic Mona Chalabi

I’m going to be talking
about statistics today.

If that makes you immediately feel
a little bit wary, that’s OK,

that doesn’t make you some
kind of crazy conspiracy theorist,

it makes you skeptical.

And when it comes to numbers,
especially now, you should be skeptical.

But you should also be able to tell
which numbers are reliable

and which ones aren’t.

So today I want to try to give you
some tools to be able to do that.

But before I do,

I just want to clarify which numbers
I’m talking about here.

I’m not talking about claims like,

“9 out of 10 women recommend
this anti-aging cream.”

I think a lot of us always
roll our eyes at numbers like that.

What’s different now is people
are questioning statistics like,

“The US unemployment
rate is five percent.”

What makes this claim different is
it doesn’t come from a private company,

it comes from the government.

About 4 out of 10 Americans
distrust the economic data

that gets reported by government.

Among supporters of President Trump
it’s even higher;

it’s about 7 out of 10.

I don’t need to tell anyone here

that there are a lot of dividing lines
in our society right now,

and a lot of them start to make sense,

once you understand people’s relationships
with these government numbers.

On the one hand, there are those who say
these statistics are crucial,

that we need them to make sense
of society as a whole

in order to move beyond
emotional anecdotes

and measure progress
in an [objective] way.

And then there are the others,

who say that these statistics are elitist,

maybe even rigged;

they don’t make sense
and they don’t really reflect

what’s happening
in people’s everyday lives.

It kind of feels like that second group
is winning the argument right now.

We’re living in a world
of alternative facts,

where people don’t find statistics
this kind of common ground,

this starting point for debate.

This is a problem.

There are actually
moves in the US right now

to get rid of some government
statistics altogether.

Right now there’s a bill in congress
about measuring racial inequality.

The draft law says that government
money should not be used

to collect data on racial segregation.

This is a total disaster.

If we don’t have this data,

how can we observe discrimination,

let alone fix it?

In other words:

How can a government create fair policies

if they can’t measure
current levels of unfairness?

This isn’t just about discrimination,

it’s everything – think about it.

How can we legislate on health care

if we don’t have good data
on health or poverty?

How can we have public debate
about immigration

if we can’t at least agree

on how many people are entering
and leaving the country?

Statistics come from the state;
that’s where they got their name.

The point was to better
measure the population

in order to better serve it.

So we need these government numbers,

but we also have to move
beyond either blindly accepting

or blindly rejecting them.

We need to learn the skills
to be able to spot bad statistics.

I started to learn some of these

when I was working
in a statistical department

that’s part of the United Nations.

Our job was to find out how many Iraqis
had been forced from their homes

as a result of the war,

and what they needed.

It was really important work,
but it was also incredibly difficult.

Every single day, we were making decisions

that affected the accuracy
of our numbers –

decisions like which parts
of the country we should go to,

who we should speak to,

which questions we should ask.

And I started to feel
really disillusioned with our work,

because we thought we were doing
a really good job,

but the one group of people
who could really tell us were the Iraqis,

and they rarely got the chance to find
our analysis, let alone question it.

So I started to feel really determined

that the one way to make
numbers more accurate

is to have as many people as possible
be able to question them.

So I became a data journalist.

My job is finding these data sets
and sharing them with the public.

Anyone can do this,
you don’t have to be a geek or a nerd.

You can ignore those words;
they’re used by people

trying to say they’re smart
while pretending they’re humble.

Absolutely anyone can do this.

I want to give you guys three questions

that will help you be able to spot
some bad statistics.

So, question number one
is: Can you see uncertainty?

One of things that’s really changed
people’s relationship with numbers,

and even their trust in the media,

has been the use of political polls.

I personally have a lot of issues
with political polls

because I think the role of journalists
is actually to report the facts

and not attempt to predict them,

especially when those predictions
can actually damage democracy

by signaling to people:
don’t bother to vote for that guy,

he doesn’t have a chance.

Let’s set that aside for now and talk
about the accuracy of this endeavor.

Based on national elections
in the UK, Italy, Israel

and of course, the most recent
US presidential election,

using polls to predict electoral outcomes

is about as accurate as using the moon
to predict hospital admissions.

No, seriously, I used actual data
from an academic study to draw this.

There are a lot of reasons why
polling has become so inaccurate.

Our societies have become really diverse,

which makes it difficult for pollsters
to get a really nice representative sample

of the population for their polls.

People are really reluctant to answer
their phones to pollsters,

and also, shockingly enough,
people might lie.

But you wouldn’t necessarily
know that to look at the media.

For one thing, the probability
of a Hillary Clinton win

was communicated with decimal places.

We don’t use decimal places
to describe the temperature.

How on earth can predicting the behavior
of 230 million voters in this country

be that precise?

And then there were those sleek charts.

See, a lot of data visualizations
will overstate certainty, and it works –

these charts can numb
our brains to criticism.

When you hear a statistic,
you might feel skeptical.

As soon as it’s buried in a chart,

it feels like some kind
of objective science,

and it’s not.

So I was trying to find ways
to better communicate this to people,

to show people the uncertainty
in our numbers.

What I did was I started taking
real data sets,

and turning them into
hand-drawn visualizations,

so that people can see
how imprecise the data is;

so people can see that a human did this,

a human found the data and visualized it.

For example, instead
of finding out the probability

of getting the flu in any given month,

you can see the rough
distribution of flu season.

This is –

(Laughter)

a bad shot to show in February.

But it’s also more responsible
data visualization,

because if you were to show
the exact probabilities,

maybe that would encourage
people to get their flu jabs

at the wrong time.

The point of these shaky lines

is so that people remember
these imprecisions,

but also so they don’t necessarily
walk away with a specific number,

but they can remember important facts.

Facts like injustice and inequality
leave a huge mark on our lives.

Facts like Black Americans and Native
Americans have shorter life expectancies

than those of other races,

and that isn’t changing anytime soon.

Facts like prisoners in the US
can be kept in solitary confinement cells

that are smaller than the size
of an average parking space.

The point of these visualizations
is also to remind people

of some really important
statistical concepts,

concepts like averages.

So let’s say you hear a claim like,

“The average swimming pool in the US
contains 6.23 fecal accidents.”

That doesn’t mean every single
swimming pool in the country

contains exactly 6.23 turds.

So in order to show that,

I went back to the original data,
which comes from the CDC,

who surveyed 47 swimming facilities.

And I just spent one evening
redistributing poop.

So you can kind of see
how misleading averages can be.

(Laughter)

OK, so the second question
that you guys should be asking yourselves

to spot bad numbers is:

Can I see myself in the data?

This question is also
about averages in a way,

because part of the reason
why people are so frustrated

with these national statistics,

is they don’t really tell the story
of who’s winning and who’s losing

from national policy.

It’s easy to understand why people
are frustrated with global averages

when they don’t match up
with their personal experiences.

I wanted to show people the way
data relates to their everyday lives.

I started this advice column
called “Dear Mona,”

where people would write to me
with questions and concerns

and I’d try to answer them with data.

People asked me anything.

questions like, “Is it normal to sleep
in a separate bed to my wife?”

“Do people regret their tattoos?”

“What does it mean to die
of natural causes?”

All of these questions are great,
because they make you think

about ways to find
and communicate these numbers.

If someone asks you,
“How much pee is a lot of pee?”

which is a question that I got asked,

you really want to make sure
that the visualization makes sense

to as many people as possible.

These numbers aren’t unavailable.

Sometimes they’re just buried
in the appendix of an academic study.

And they’re certainly not inscrutable;

if you really wanted to test
these numbers on urination volume,

you could grab a bottle
and try it for yourself.

(Laughter)

The point of this isn’t necessarily

that every single data set
has to relate specifically to you.

I’m interested in how many women
were issued fines in France

for wearing the face veil, or the niqab,

even if I don’t live in France
or wear the face veil.

The point of asking where you fit in
is to get as much context as possible.

So it’s about zooming out
from one data point,

like the unemployment rate
is five percent,

and seeing how it changes over time,

or seeing how it changes
by educational status –

this is why your parents always
wanted you to go to college –

or seeing how it varies by gender.

Nowadays, male unemployment rate is higher

than the female unemployment rate.

Up until the early ’80s,
it was the other way around.

This is a story of one
of the biggest changes

that’s happened in American society,

and it’s all there in that chart,
once you look beyond the averages.

The axes are everything;

once you change the scale,
you can change the story.

OK, so the third and final question
that I want you guys to think about

when you’re looking at statistics is:

How was the data collected?

So far, I’ve only talked about the way
data is communicated,

but the way it’s collected
matters just as much.

I know this is tough,

because methodologies can be opaque
and actually kind of boring,

but there are some simple steps
you can take to check this.

I’ll use one last example here.

One poll found that 41 percent of Muslims
in this country support jihad,

which is obviously pretty scary,

and it was reported everywhere in 2015.

When I want to check a number like that,

I’ll start off by finding
the original questionnaire.

It turns out that journalists
who reported on that statistic

ignored a question
lower down on the survey

that asked respondents
how they defined “jihad.”

And most of them defined it as,

“Muslims' personal, peaceful struggle
to be more religious.”

Only 16 percent defined it as,
“violent holy war against unbelievers.”

This is the really important point:

based on those numbers,
it’s totally possible

that no one in the survey
who defined it as violent holy war

also said they support it.

Those two groups might not overlap at all.

It’s also worth asking
how the survey was carried out.

This was something called an opt-in poll,

which means anyone could have found it
on the internet and completed it.

There’s no way of knowing
if those people even identified as Muslim.

And finally, there were 600
respondents in that poll.

There are roughly three million
Muslims in this country,

according to Pew Research Center.

That means the poll spoke to roughly
one in every 5,000 Muslims

in this country.

This is one of the reasons

why government statistics
are often better than private statistics.

A poll might speak to a couple
hundred people, maybe a thousand,

or if you’re L’Oreal, trying to sell
skin care products in 2005,

then you spoke to 48 women
to claim that they work.

(Laughter)

Private companies don’t have a huge
interest in getting the numbers right,

they just need the right numbers.

Government statisticians aren’t like that.

In theory, at least,
they’re totally impartial,

not least because most of them do
their jobs regardless of who’s in power.

They’re civil servants.

And to do their jobs properly,

they don’t just speak
to a couple hundred people.

Those unemployment numbers
I keep on referencing

come from the Bureau of Labor Statistics,

and to make their estimates,

they speak to over 140,000
businesses in this country.

I get it, it’s frustrating.

If you want to test a statistic
that comes from a private company,

you can buy the face cream for you
and a bunch of friends, test it out,

if it doesn’t work,
you can say the numbers were wrong.

But how do you question
government statistics?

You just keep checking everything.

Find out how they collected the numbers.

Find out if you’re seeing everything
on the chart you need to see.

But don’t give up on the numbers
altogether, because if you do,

we’ll be making public policy
decisions in the dark,

using nothing but private
interests to guide us.

Thank you.

(Applause)

今天我要讲的是
统计学。

如果这让你立即感到
有点警惕,那没关系,

这不会让你成为
某种疯狂的阴谋论者,

它会让你怀疑。

当谈到数字时,
尤其是现在,你应该持怀疑态度。

但是你也应该能够分辨出
哪些数字是可靠的

,哪些不是。

所以今天我想尝试给你
一些工具来做到这一点。

但在此之前,

我只想澄清
我在这里谈论的数字。

我不是在谈论诸如

“十分之九的女性推荐
这种抗衰老霜”之类的说法。

我想我们很多人总是
对这样的数字视而不见。

现在不同的是,
人们质疑

“美国
失业率为 5%”之类的统计数据。

这种说法的不同之处在于
它不是来自私人公司,

而是来自政府。

大约十分之四的美国人

不信任政府报告的经济数据。

在特朗普总统的支持者中,
这一比例更高;

大约是十分之七。

我不需要在这里告诉任何人


现在我们的社会有很多分界线

一旦你了解了人们
与这些政府数字的关系,其中很多就开始有意义了。

一方面,有人说
这些统计数据至关重要

,我们需要它们来
理解整个社会

,以便超越
情感轶事


以[客观]的方式衡量进步。

然后还有其他人,

他们说这些统计数据是精英主义的,

甚至可能是被操纵的;

它们没有意义
,也没有真正反映

人们日常生活中正在发生的事情

感觉就像第二组
现在正在赢得争论。

我们生活在一个充满
替代事实的世界中

,人们找不到统计数据
这种共同点,也找不到

辩论的起点。

这是个问题。

实际上
,美国现在正在采取行动完全

摆脱一些政府
统计数据。

现在国会有一项
关于衡量种族不平等的法案。

法律草案规定,
不应使用政府资金

来收集有关种族隔离的数据。

这是一场彻头彻尾的灾难。

如果我们没有这些数据,

我们怎么能观察到歧视,

更不用说修复它了?

换句话说:

如果政府无法衡量
当前的不公平程度,他们如何制定公平政策?

这不仅仅是关于歧视,

而是一切——想想吧。

如果我们没有
关于健康或贫困的良好数据,我们如何就医疗保健立法?

如果我们至少不能

就进入
和离开这个国家的人数达成一致,我们怎么能就移民问题进行公开辩论?

统计来自国家;
这就是他们得名的地方。

关键是要更好地
衡量人口

,以便更好地为它服务。

所以我们需要这些政府数字,

但我们也必须
超越盲目接受

或拒绝它们。

我们需要学习
能够发现不良统计数据的技能。

当我在联合国
的一个统计

部门工作时,我开始学习其中的一些内容。

我们的工作是找出有多少伊拉克人
因战争而被迫离开家园

以及他们需要什么。

这是一项非常重要的工作,
但也非常困难。

每一天,我们都在

做出影响
我们数字准确性的

决定——比如
我们应该去这个国家的哪些地区,我们应该和

谁交谈,

我们应该问哪些问题。

我开始
对我们的工作感到非常失望,

因为我们认为我们
做得非常好,


真正能告诉我们的一群人是伊拉克人

,他们很少有机会找到
我们的分析,更不用说 质疑它。

所以我开始真正下定

决心,让数字更准确的一种方法

是让尽可能多的人
能够质疑它们。

所以我成为了一名数据记者。

我的工作是找到这些数据集
并与公众分享。

任何人都可以做到这一点,
你不必是一个极客或书呆子。

你可以忽略那些话;
人们使用它们

试图说自己很聪明,
同时假装自己很谦虚。

绝对任何人都可以做到这一点。

我想给你们三个问题

,帮助你们发现
一些不好的统计数据。

所以,第一个问题
是:你能看到不确定性吗?

真正改变
人们与数字的关系,

甚至改变他们对媒体的信任的一件事

就是使用政治民意调查。

我个人
对政治民意调查有很多问题,

因为我认为记者的作用
实际上是报道事实

而不是试图预测它们,

尤其是当这些
预测实际上会

通过向人们发出信号来破坏民主时:
不要费心去投票 那家伙,

他没有机会。

让我们暂时把它放在一边,
谈谈这项工作的准确性。

根据
英国、意大利、以色列的全国大选

,当然还有最近的
美国总统大选,

使用民意调查预测选举结果

与使用月球预测住院人数一样准确

不,说真的,我使用
了一项学术研究的实际数据来得出这个结论。

投票变得如此不准确的原因有很多。

我们的社会已经变得非常多样化,

这使得民意调查人员很难为他们的民意调查
获得一个非常好的代表性

人口样本。

人们真的不愿意
给民意调查者接电话,

而且,令人震惊的是,
人们可能会撒谎。

但你不一定会
知道这一点,看看媒体。

一方面,
希拉里·克林顿获胜的概率

是用小数位表示的。

我们不使用小数位
来描述温度。

这个国家2.3亿选民的行为怎么能预测

得那么准确呢?

然后是那些圆滑的图表。

看,很多数据可视化
会夸大确定性,但它确实有效——

这些图表会使
我们的大脑麻木,无法接受批评。

当您听到统计数据时,
您可能会感到怀疑。

一旦它被隐藏在图表中,

就感觉像是
某种客观科学

,但事实并非如此。

所以我试图
找到更好地与人们交流这一点的方法,

向人们展示
我们数字中的不确定性。

我所做的是我开始获取
真实的数据集,

并将它们变成
手绘的可视化,

这样人们就可以看到
数据是多么的不精确;

所以人们可以看到这是一个人做的,

一个人找到了数据并将其可视化。

例如,您可以查看流感季节的粗略分布,而
不是找出

在任何给定月份感染流感的概率

这是——

(笑声)

二月份的一个糟糕的表现。

但它也是更负责任的
数据可视化,

因为如果你要
显示确切的概率,

也许这会鼓励
人们

在错误的时间接种流感疫苗。

这些摇摇晃晃的线条的目的

是让人们记住
这些不精确性,

但也让他们不必
带着特定的数字走开,

但他们可以记住重要的事实。

不公正和不平等等事实
在我们的生活中留下了巨大的印记。

像黑人和美洲原住民这样的事实

比其他种族的预期寿命更短,

而且这种情况不会很快改变。

像美国的囚犯这样的事实
可以被关在

比普通停车位还小的单独监禁牢房中

这些可视化的
目的还在于提醒

人们一些非常重要的
统计概念,

例如平均值等概念。

因此,假设您听到这样的说法,

“美国的平均游泳池
包含 6.23 起粪便事故。”

这并不意味着
该国的每个游泳池都

包含 6.23 个粪便。

因此,为了证明这一点,

我回到了来自 CDC 的原始数据,该数据

对 47 个游泳设施进行了调查。

我只花了一个晚上
重新分配便便。

因此,您
可以看到平均值的误导性。

(笑声)

好的,
所以你们应该问

自己发现错误数字的第二个问题是:

我能在数据中看到自己吗?

这个问题
在某种程度上也是关于平均值的,

因为
人们对

这些国家统计数据如此沮丧的部分原因

是,他们并没有真正讲述

国家政策中谁赢谁输的故事。

很容易理解为什么
人们在与个人经历不相符时会对全球平均水平感到沮丧

我想向人们展示
数据与他们日常生活的关系。

我开始了这个
名为“亲爱的莫娜”的建议专栏

,人们会写信给我
提出问题和疑虑

,我会尝试用数据来回答他们。

人们问我什么。

诸如“和
我妻子睡在一张单独的床上是正常的吗?”之类的问题。

“人们会后悔自己的纹身吗?”

“自然死亡是什么意思
?”

所有这些问题都很好,
因为它们让您

思考查找
和交流这些数字的方法。

如果有人问你,
“多少尿是很多尿?”

这是我被问到的一个问题,

你真的想
确保可视化

对尽可能多的人有意义。

这些数字不可用。

有时它们只是埋
在学术研究的附录中。

它们当然不是高深莫测的。

如果你真的想测试
这些数字的排尿量,

你可以拿
一瓶自己试试。

(笑声

) 这里的重点不一定

是每个数据集
都必须与你特别相关。

我感兴趣的
是法国有多少女性

因戴面纱或面纱而被罚款,

即使我不住在法国
也不戴面纱。

询问您适合的位置
是为了获得尽可能多的背景信息。

所以这是关于
从一个数据点缩小,

比如失业率
是5%

,看看它是如何随着时间变化的,

或者看看它是如何随着
教育状况而变化的——

这就是为什么你的父母总是
想让你上大学——

或查看它如何因性别而异。

如今,男性失业率

高于女性失业率。

直到 80 年代初,
情况正好相反。

这是一个关于

美国社会发生的最大变化之一的故事

,一旦你超越平均水平,它就在这张图表中

轴就是一切;

一旦你改变了规模,
你就可以改变故事。

好的,
所以我希望

你们在查看统计数据时考虑的第三个也是最后一个问题是:数据是

如何收集的?

到目前为止,我只讨论了
数据的通信方式,

但收集数据的方式
同样重要。

我知道这很难,

因为方法可能不透明
,实际上有点无聊,

但是您可以采取一些简单的步骤
来检查这一点。

我将在这里使用最后一个示例。

一项民意调查发现,这个国家有 41% 的
穆斯林支持圣战,

这显然是相当可怕的

,2015 年到处都有报道。

当我想检查这样的数字时,

我会从
寻找原始问卷开始。

事实证明
,报道该统计数据的记者

忽略了调查中较低的一个问题,该问题

询问受访者
如何定义“圣战”。

他们中的大多数人将其定义为

“穆斯林
为更加虔诚而进行的个人和平斗争”。

只有 16% 的人将其定义为
“针对非信徒的暴力圣战”。

这是非常重要的一点:

根据这些数字,

调查
中将其定义为暴力圣战的人完全有可能

也表示支持。

这两组可能根本不重叠。

同样值得一问的
是调查是如何进行的。

这就是所谓的选择加入民意调查,

这意味着任何人都可以
在互联网上找到并完成它。

没有办法
知道这些人是否甚至被认定为穆斯林。

最后,
该民意调查有 600 名受访者。

根据皮尤研究中心的数据,这个国家大约有 300 万穆斯林。

这意味着该民意调查涉及该国
大约每 5,000 名穆斯林

中的一名。

这是

政府统计
数据往往优于私人统计数据的原因之一。

一项民意调查可能会与
几百人交谈,也许是一千人,

或者如果你是欧莱雅,
在 2005 年试图销售护肤品,

那么你采访了 48 位女性
,声称她们在工作。

(笑声)

私营公司对正确的数字没有太大的
兴趣,

他们只需要正确的数字。

政府统计学家不是这样的。

至少在理论上,
他们是完全不偏不倚的

,尤其是因为他们中的大多数人都在做
自己的工作,而不管谁掌权。

他们是公务员。

为了做好自己的工作,

他们不只是
与几百人交谈。

我一直参考的那些失业数据

来自劳工统计局

,为了做出估计,

他们与这个国家超过 140,000 家
企业交谈。

我明白了,这很令人沮丧。

如果您想测试
来自私人公司的统计数据,

您可以为您
和一群朋友购买面霜,测试一下,

如果不起作用,
您可以说数字是错误的。

但是你如何质疑
政府的统计数据?

你只是不断检查一切。

了解他们如何收集这些数字。

确定您是否
在图表上看到了您需要查看的所有内容。

但是不要完全放弃数字
,因为如果你这样做,

我们将
在黑暗中做出公共政策决定,

只使用私人
利益来指导我们。

谢谢你。

(掌声)