Why you should love statistics Alan Smith

Back in 2003,

the UK government carried out a survey.

And it was a survey that measured
levels of numeracy

in the population.

And they were shocked to find out

that for every 100 working age
adults in the country,

47 of them lacked Level 1 numeracy skills.

Now, Level 1 numeracy skills –
that’s low-end GCSE score.

It’s the ability to deal with fractions,
percentages and decimals.

So this figure prompted
a lot of hand-wringing in Whitehall.

Policies were changed,

investments were made,

and then they ran
the survey again in 2011.

So can you guess
what happened to this number?

It went up to 49.

(Laughter)

And in fact, when I reported
this figure in the FT,

one of our readers joked and said,

“This figure is only shocking
to 51 percent of the population.”

(Laughter)

But I preferred, actually,
the reaction of a schoolchild

when I presented
at a school this information,

who raised their hand and said,

“How do we know that the person
who made that number

isn’t one of the 49 percent either?”

(Laughter)

So clearly, there’s a numeracy issue,

because these are
important skills for life,

and a lot of the changes
that we want to introduce in this century

involve us becoming
more comfortable with numbers.

Now, it’s not just an English problem.

OECD this year released some figures
looking at numeracy in young people,

and leading the way, the USA –

nearly 40 percent of young people
in the US have low numeracy.

Now, England is there too,

but there are seven OECD countries
with figures above 20 percent.

That is a problem,
because it doesn’t have to be that way.

If you look at the far end of this graph,

you can see the Netherlands and Korea
are in single figures.

So there’s definitely a numeracy
problem that we want to address.

Now, as useful as studies like these are,

I think we risk herding people
inadvertently into one of two categories;

that there are two kinds of people:

those people that are comfortable
with numbers, that can do numbers,

and the people who can’t.

And what I’m trying
to talk about here today

is to say that I believe
that is a false dichotomy.

It’s not an immutable pairing.

I think you don’t have to have
tremendously high levels of numeracy

to be inspired by numbers,

and that should be the starting point
to the journey ahead.

And one of the ways in which
we can begin that journey, for me,

is looking at statistics.

Now, I am the first to acknowledge
that statistics has got somewhat

of an image problem.

(Laughter)

It’s the part of mathematics

that even mathematicians
don’t particularly like,

because whereas the rest of maths
is all about precision and certainty,

statistics is almost the reverse of that.

But actually, I was a late convert
to the world of statistics myself.

If you’d asked my undergraduate professors

what two subjects would I be least likely
to excel in after university,

they’d have told you statistics
and computer programming,

and yet here I am, about to show you
some statistical graphics

that I programmed.

So what inspired that change in me?

What made me think that statistics
was actually an interesting thing?

It’s really because
statistics are about us.

If you look at the etymology
of the word statistics,

it’s the science of dealing with data

about the state or the community
that we live in.

So statistics are about us as a group,

not us as individuals.

And I think as social animals,

we share this fascination about how
we as individuals relate to our groups,

to our peers.

And statistics in this way
are at their most powerful

when they surprise us.

And there’s been some really wonderful
surveys carried out recently

by Ipsos MORI in the last few years.

They did a survey of over
1,000 adults in the UK,

and said, for every 100 people
in England and Wales,

how many of them are Muslim?

Now the average answer from this survey,

which was supposed to be representative
of the total population, was 24.

That’s what people thought.

British people think 24 out of every 100
people in the country are Muslim.

Now, official figures reveal
that figure to be about five.

So there’s this big variation
between what we think, our perception,

and the reality as given by statistics.

And I think that’s interesting.

What could possibly be causing
that misperception?

And I was so thrilled with this study,

I started to take questions out
in presentations. I was referring to it.

Now, I did a presentation

at St. Paul’s School for Girls
in Hammersmith,

and I had an audience rather like this,

except it was comprised entirely
of sixth-form girls.

And I said, “Girls,

how many teenage girls do you think
the British public think

get pregnant every year?”

And the girls were apoplectic when I said

the British public think that 15
out of every 100 teenage girls

get pregnant in the year.

And they had every right to be angry,

because in fact, I’d have to have
closer to 200 dots

before I could color one in,

in terms of what
the official figures tell us.

And rather like numeracy,
this is not just an English problem.

Ipsos MORI expanded the survey
in recent years to go across the world.

And so, they asked Saudi Arabians,

for every 100 adults in your country,

how many of them are overweight or obese?

And the average answer from the Saudis
was just over a quarter.

That’s what they thought.

Just over a quarter of adults
are overweight or obese.

The official figures show, actually,
it’s nearer to three-quarters.

(Laughter)

So again, a big variation.

And I love this one: they asked in Japan,
they asked the Japanese,

for every 100 Japanese people,

how many of them live in rural areas?

The average was about a 50-50 split,
just over halfway.

They thought 56 out of every 100
Japanese people lived in rural areas.

The official figure is seven.

So extraordinary variations,
and surprising to some,

but not surprising to people
who have read the work

of Daniel Kahneman, for example,
the Nobel-winning economist.

He and his colleague, Amos Tversky,
spent years researching this disjoint

between what people perceive
and the reality,

the fact that people are actually
pretty poor intuitive statisticians.

And there are many reasons for this.

Individual experiences, certainly,
can influence our perceptions,

but so, too, can things like the media
reporting things by exception,

rather than what’s normal.

Kahneman had a nice way
of referring to that.

He said, “We can be blind
to the obvious” –

so we’ve got the numbers wrong –

“but we can be blind
to our blindness about it.”

And that has enormous
repercussions for decision making.

So at the statistics office
while this was all going on,

I thought this was really interesting.

I said, this is clearly a global problem,

but maybe geography is the issue here.

These were questions that were all about,
how well do you know your country?

So in this case, it’s how well
do you know 64 million people?

Not very well, it turns out.
I can’t do that.

So I had an idea,

which was to think about
this same sort of approach

but to think about it
in a very local sense.

Is this a local?

If we reframe the questions and say,

how well do you know your local area,

would your answers be any more accurate?

So I devised a quiz:

How well do you know your area?

It’s a simple Web app.

You put in a post code

and then it will ask you questions
based on census data

for your local area.

And I was very conscious
in designing this.

I wanted to make it open
to the widest possible range of people,

not just the 49 percent
who can get the numbers.

I wanted everyone to engage with it.

So for the design of the quiz,

I was inspired by the isotypes

of Otto Neurath from the 1920s and ’30s.

Now, these are methods
for representing numbers

using repeating icons.

And the numbers are there,
but they sit in the background.

So it’s a great way
of representing quantity

without resorting to using terms
like “percentage,”

“fractions” and “ratios.”

So here’s the quiz.

The layout of the quiz is,

you have your repeating icons
on the left-hand side there,

and a map showing you the area
we’re asking you questions about

on the right-hand side.

There are seven questions.

Each question, there’s a possible answer
between zero and a hundred,

and at the end of the quiz,

you get an overall score
between zero and a hundred.

And so because this is TEDxExeter,

I thought we would have
a quick look at the quiz

for the first few questions of Exeter.

And so the first question is:

For every 100 people,
how many are aged under 16?

Now, I don’t know Exeter very well
at all, so I had a guess at this,

but it gives you an idea
of how this quiz works.

You drag the slider
to highlight your icons,

and then just click “Submit” to answer,

and we animate away the difference
between your answer and reality.

And it turns out, I was a pretty
terrible guess: five.

How about the next question?

This is asking about
what the average age is,

so the age at which half
the population are younger

and half the population are older.

And I thought 35 – that sounds
middle-aged to me.

(Laughter)

Actually, in Exeter,
it’s incredibly young,

and I had underestimated the impact
of the university in this area.

The questions get harder
as you go through.

So this one’s now asking
about homeownership:

For every 100 households, how many
are owned with a mortgage or loan?

And I hedged my bets here,

because I didn’t want to be
more than 50 out on the answer.

(Laughter)

And actually, these get harder,
these questions,

because when you’re in an area,
when you’re in a community,

things like age – there are clues
to whether a population is old or young.

Just by looking around
the area, you can see it.

Something like homeownership
is much more difficult to see,

so we revert to our own heuristics,

our own biases about how many people
we think own their own homes.

Now the truth is,
when we published this quiz,

the census data that it’s based on
was already a few years old.

We’ve had online applications
that allow you to put in a post code

and get statistics back for years.

So in some senses,

this was all a little bit old
and not necessarily new.

But I was interested to see
what reaction we might get

by gamifying the data
in the way that we have,

by using animation

and playing on the fact
that people have their own preconceptions.

It turns out, the reaction was, um …

was more than I could have hoped for.

It was a long-held ambition of mine
to bring down a statistics website

due to public demand.

(Laughter)

This URL contains the words
“statistics,” “gov” and “UK,”

which are three of people’s least
favorite words in a URL.

And the amazing thing about this
was that the website came down

at quarter to 10 at night,

because people were actually
engaging with this data

of their own free will,

using their own personal time.

I was very interested to see

that we got something like
a quarter of a million people

playing the quiz within the space
of 48 hours of launching it.

And it sparked an enormous discussion
online, on social media,

which was largely dominated

by people having fun
with their misconceptions,

which is something that
I couldn’t have hoped for any better,

in some respects.

I also liked the fact that people started
sending it to politicians.

How well do you know the area
you claim to represent?

(Laughter)

And then just to finish,

going back to the two kinds of people,

I thought it would be
really interesting to see

how people who are good with numbers
would do on this quiz.

The national statistician
of England and Wales, John Pullinger,

you would expect he would be pretty good.

He got 44 for his own area.

(Laughter)

Jeremy Paxman – admittedly,
after a glass of wine – 36.

Even worse.

It just shows you that the numbers
can inspire us all.

They can surprise us all.

So very often, we talk about statistics

as being the science of uncertainty.

My parting thought for today is:

actually, statistics is the science of us.

And that’s why we should
be fascinated by numbers.

Thank you very much.

(Applause)

早在2003年

,英国政府就进行了一项调查。

这是一项衡量人口
计算能力水平的调查

他们震惊地发现,

该国每 100 名工作年龄的
成年人中

,就有 47 人缺乏 1 级计算能力。

现在,1 级算术技能——
这是低端的 GCSE 分数。

它是处理分数、
百分比和小数的能力。

所以这个数字
在白厅引起了很多麻烦。

改变了政策,

进行了投资,

然后他们
在 2011 年再次进行了调查。

那么你能
猜出这个数字发生了什么吗?

上升到 49 个。

(笑声

)事实上,当我
在英国《金融时报》上报道这个数字时

,我们的一位读者开玩笑说:

“这个数字只
让 51% 的人口感到震惊。”

(笑声)

但实际上,

当我
在学校展示这些信息时,我更喜欢一个小学生的反应,

他们举起手说:

“我们怎么知道
做出这个数字

的人不是 49% 的人之一 任何一个?”

(笑声)

很明显,有一个算术问题,

因为这些
是生活中重要的技能

,我们想要在本世纪引入的很多变化

都让我们
对数字变得更加自在。

现在,这不仅仅是一个英语问题。

经合组织今年发布了一些
关于年轻人计算能力的数据

,其中美国处于领先地位——美国

近 40% 的
年轻人计算能力较低。

现在,英国也在那里,

但有 7 个经合组织国家
的数字超过 20%。

这是一个问题,
因为它不必是那样的。

如果您查看该图的远端,

您会看到荷兰和韩国
是单个数字。

所以肯定有一个
我们想要解决的算术问题。

现在,尽管这些研究很有用,但

我认为我们冒着将人们
无意中归为两类之一的风险;

有两种人:

那些对数字感到满意的人
,会做数字

的人,以及不会做的人。


今天要在这里谈论的

是,我认为
这是一种错误的二分法。

这不是一成不变的配对。

我认为你不必具有
极高的计算

能力才能受到数字的启发

,这应该是
未来旅程的起点。

对我来说,我们可以开始这一旅程的一种方式

是查看统计数据。

现在,我是第一个
承认统计数据存在

某种形象问题的人。

(笑声)

这是数学的一部分

,即使是数学家
也不特别喜欢,

因为其余的数学
都是关于精确性和确定性的,而

统计学几乎是相反的。

但实际上,我自己是一个
迟到的人。

如果你问我的本科教授

,大学毕业后我最不可能擅长哪两个科目,

他们会告诉你统计
和计算机编程

,但我在这里,将向你

展示我编程的一些统计图形。

那么是什么激发了我的这种变化呢?

是什么让我觉得
统计实际上是一件有趣的事情?

这真的是因为
统计数据是关于我们的。

如果你看一下
统计这个词的词源,

它是处理

有关
我们所居住的州或社区的数据的科学。

所以统计是关于我们作为一个群体,

而不是我们作为个人。

我认为,作为社会动物,

我们同样着迷于
我们作为个人如何与我们的群体、

与我们的同龄人建立联系。

以这种方式进行的统计数据在

让我们感到惊讶时最为强大。

Ipsos MORI 最近几年进行了一些非常棒的
调查

他们对英国 1000 多名成年人进行了一项调查,结果显示,

英格兰和威尔士每 100 人中,有

多少人是穆斯林?

现在,本应代表总人口的这项调查的平均答案

是 24。

这就是人们的想法。

英国人认为
该国每 100 人中有 24 人是穆斯林。

现在,官方数据显示
这个数字约为五。

因此,
我们的想法、我们的感知

和统计数据给出的现实之间存在很大差异。

我认为这很有趣。

什么可能导致
这种误解?

我对这项研究感到非常兴奋,

我开始
在演讲中提出问题。 我指的是它。

现在,我在哈默史密斯

的圣保罗女子学校做了一个演讲

,我的听众很像这样,

除了它完全
由六年级的女孩组成。

我说,“

姑娘们,你们
认为英国公众认为

每年有多少少女怀孕?”

当我

说英国公众认为
每年每 100 名少女中有 15 名

怀孕时,女孩们都中风了。

他们完全有权生气,

因为事实上,根据官方数据告诉我们的,我必须有
接近 200 个点

才能给一个点上色

就像算术一样,
这不仅仅是一个英语问题。

Ipsos MORI 近年来将调查范围扩大
到全球。

因此,他们问沙特阿拉伯人,

在你们国家每 100 名成年人中,有

多少人超重或肥胖?

沙特人的平均回答
只有四分之一多一点。

他们就是这么想的。

超过四分之一的
成年人超重或肥胖。

官方数据显示,实际上,
它接近四分之三。

(笑声

) 又是一个很大的变化。

我喜欢这个:他们在日本问,
他们问日本人

,每 100 个日本人,有

多少人生活在农村地区?

平均水平大约是 50-50,
刚刚过半。

他们认为每 100 个日本人中就有 56 个
生活在农村地区。

官方数字是七。

如此不同寻常的变化,
让一些

人感到惊讶,但对于
那些

读过丹尼尔·卡尼曼(
例如诺贝尔经济学奖得主)著作的人来说并不奇怪。

他和他的同事阿莫斯·特沃斯基(Amos Tversky)
花了数年时间研究

人们感知
与现实之间

的这种脱节,即人们实际上是
非常缺乏直觉的统计学家。

这有很多原因。

个人经历当然
会影响我们的看法,

但媒体也可以
例外地报道事情,

而不是正常报道。

卡尼曼有一个很好
的提及方式。

他说,“我们可以
对显而易见的事情视而不见”——

所以我们弄错了数字——

“但我们可以视而不见我们对此的
视而不见。”


对决策产生了巨大的影响。

因此,在统计局
进行这一切的同时,

我认为这真的很有趣。

我说,这显然是一个全球性的问题,

但也许地理是这里的问题。

这些都是关于,
你对你的国家了解多少?

那么在这种情况下,你对 6400 万人的了解程度如何

不是很好,事实证明。
我不能那样做。

所以我有一个想法,

那就是
考虑同样的方法,

但要
在非常本地的意义上考虑它。

这是本地人吗?

如果我们重新构建问题并说,

您对您所在地区的了解程度如何

,您的答案会更准确吗?

所以我设计了一个测验:

你对你所在地区的了解程度如何?

这是一个简单的网络应用程序。

您输入邮政编码

,然后它会
根据

您所在地区的人口普查数据向您提问。

我在设计这个时非常有意识

我想让它
向尽可能广泛的人开放,

而不仅仅是
能够获得数字的 49%。

我希望每个人都参与其中。

因此,对于测验的设计,

受到了 1920 年代和 30 年代 Otto Neurath 同种型的启发。

现在,这些是

使用重复图标表示数字的方法。

数字就在那里,
但它们在后台。

因此,这是一种
表示数量的好方法,

无需
使用“百分比”、

“分数”和“比率”等术语。

所以这里是测验。

测验的布局是,

您在左侧有重复的图标
,在右侧

有一张地图,显示
我们正在向您提问的区域

有七个问题。

每个问题都有一个
介于 0 到 100 之间的可能答案

,在测验结束时,

您的总分
在 0 到 100 之间。

所以因为这是 TEDxExeter,

我想我们可以
快速浏览一下

Exeter 前几个问题的测验。

所以第一个问题是

:每 100 人中,有
多少人年龄在 16 岁以下?

现在,我一点也不了解
埃克塞特,所以我对此进行了猜测,

但它让您
了解了这个测验是如何运作的。

您拖动滑块
以突出显示您的图标,

然后只需单击“提交”即可回答

,我们会通过动画消除
您的回答与现实之间的差异。

事实证明,我是一个非常
糟糕的猜测:五个。

下一个问题呢?

这是在
询问平均年龄是多少

,即
一半人口年轻

而一半人口老龄化的年龄。

我想 35
岁——对我来说听起来像是中年人。

(笑声)

事实上,在埃克塞特,
它非常年轻

,我低估了
这所大学在这方面的影响。

随着你的进行,问题变得越来越难

所以这个人现在问
的是房屋所有权

:每 100 个家庭,有多少
人拥有抵押贷款或贷款?

我在这里对冲我的赌注,

因为我不想
在答案上超过 50。

(笑声

) 实际上,这些问题变得更难了,

因为当你在一个地区,
当你在一个社区里

,比如年龄——有一些线索
可以判断人口是老是少。

只要环顾
四周,您就可以看到它。

像房屋所有权这样的东西
更难看到,

所以我们回归到我们自己的启发式方法,

我们自己对
我们认为有多少人拥有自己的房屋的偏见。

现在的事实是,
当我们发布这个测验

时,它所依据的人口普查数据
已经有几年的历史了。

多年来,我们有在线应用
程序允许您输入邮政编码

并获取统计数据。

所以从某种意义上说,

这一切都有点老了
,不一定是新的。

但我很想看看

通过
以我们现有的方式将数据游戏化,

通过使用动画并利用

人们有自己的先入之见这一事实,我们可能会得到什么反应。

事实证明,反应是,嗯

……超出了我的预期。

由于公众的需求,我的一个长期抱负是
关闭一个统计网站

(笑声)

这个 URL 包含
“statistics”、“gov”和“UK”

这三个
词,这是 URL 中人们最不喜欢的三个词。

令人惊奇的
是,该网站

在晚上 10 点半就关闭了,

因为人们实际上

是在

利用他们自己的个人时间自愿处理这些数据。

我很感兴趣地看到

,在启动它的 48 小时内,我们
有大约 100 万的人

在玩这个测验


在社交媒体上引发了在线上的巨大讨论,

主要

是人们
对自己的误解感到开心,在某些方面,

这是
我无法期待的更好的事情

我也喜欢人们开始
将它发送给政客的事实。

你对你声称代表的地区了解
多少?

(笑声

) 最后,

回到这两种人,

我想

看看擅长数字
的人在这个测验中的表现会非常有趣。

英格兰和威尔士的国家统计学家约翰·普林格,

你会认为他会很不错。

他为自己的地区获得了 44 分。

(笑声)

Jeremy Paxman——不可否认,
在喝了一杯酒之后—— 36。

更糟。

它只是向您表明,这些数字
可以激励我们所有人。

他们可以让我们所有人感到惊讶。

所以很多时候,我们把统计学说

成是不确定性的科学。

我今天的临别想法是:

实际上,统计是我们的科学。

这就是为什么我们
应该对数字着迷。

非常感谢你。

(掌声)