Becoming a Better Informed Citizen

[Applause]

okay

i want you to imagine something that’s

not quite that nice which is you go to

the doctor for regular checkup and it

turns out there’s something wrong

something serious potentially fatal and

you need treatment

luckily the hospital has two doctors on

staff with two different treatments to

offer you

and you’re allowed to choose which one

you want

and as you can see on board you see the

survival rate of those two treatments

and you’re allowed to choose obviously

most of you now think why would i ever

choose to put dr b

although with dr a thank you now what do

you normally do

when you have a serious medical problem

you find a cycling opinion

so you go to a different hospital that

also has two different treatments

and they split up the data a little bit

differently because they split it up

into severe cases in normal cases

now you might think well don’t i need to

know which case i am to make a good

choice and the answer is actually no

because if you look

you will see that dr b’s treatment has a

higher percentage in both categories

right meaning that in this hospital you

would prefer dr b

so to repeat in the first hospital you’d

prefer

dr a and the second hospital

you’d prefer to be the problem being

they’re the same doctors

this is dr a in the first hospital and

in the second hospital

and you can see i can either split his

two cases the normal sphere what’s up

or i can just add them up and get a 80

rate

and the second doctor the same thing the

problem being that although he’s better

both categories independently

overall his percentage is worse

now i hope that those of you who heard

there’s talks about statistics and

started dozing off that this grabbed

your attention enough to listen to me

for another 10 minutes

and what you’ve just experienced is

called simpsons paradox

which means that depending how i split

up data presented to you i can get

two opposite decisions from you

now becoming better informed citizen is

the name of my talk

and i want to introduce you to the

concept of statistical literacy

i teach mathematics which means that the

question that i hear most

often is when will we ever use this in

real life

admittedly it’s been a while since i had

to differentiate something outside of

work

however i’ve come to believe that the

single most important

topic in mathematics that i teach school

is actually

statistics because you will encounter it

no matter what you do

it will be in the news it will be in

medical

issues maybe you can go into research

and you have to deal with it

i mean even if you do psychology you

actually have to do courses in

statistics to

analyze all the data that’s given to you

so there’s no way around it

but as we’ve just seen there are quite

some problems you might encounter

now i’ll go to another example of the

simpsons paradox

which is a topic that’s often in the

news

the pay gap between men and women here i

present to you three companies

and show you the average income between

men and women in those three companies

and the question is which company is the

fairest in regards to pay

so i’ll give you a minute to look at

that but most people will probably think

well

technically none of them are fair

because fair would be if it’s the same

for both but in the first one women are

too then the second one’s a little bit

better but it’s still below and the

third one well compensates the first two

but it’s also not fair that they

actually get more

the problem is let’s have a closer look

so i’ve

split up the work of workers in that

company

into managing positions office workers

and support staff

and here you can see how many of those

are male how many of those are female

and how much they get paid each

the problem here is well it’s not

actually a problem everybody gets paid

the same on their level

but because there are fewer female

managers and office staff

the overall average goes down something

that the one number of average doesn’t

actually tell you

let’s have a look at company b company b

also has fewer women in the positions

but their pay is actually higher than

their male counterparts

but average is still lower bit weird

and now company c where you have

more women in the categories which is

good but they actually get paid less

but the average is higher meaning it

actually looks good

if you present that let’s say in the

news so

the problem is if you just compare the

averages

you don’t see any of that nuance and you

might completely have the wrong

discussion because the discussion could

be in a company

why there are fewer women in leadership

positions or why do people actually get

paid differently for the same

job but you have the right conversation

you actually need to analyze the data

appropriately and we have this tendency

to go for one number one number only

and that’s not always very productive

now i’m pretty sure there are a couple

of people in the audience who think well

this guy’s teaching mathematics and she

just says average that’s not very

good and you’re right because we’re

getting to the idea

that there are actually different kinds

of averages the problem is when we speak

normally in english

every day we don’t actually say we don’t

talk about the average we talk about the

mean

thing that most people are familiar with

you add everything up and then you

divide it evenly that’s the mean

prop the problem being mathematically

there are more

averages now on the board you have three

classes and their grades it’s the ib

system meaning that seven is the best

grade and

one is the worst grade now which class

did best

on average the problem is

if i take the mean i get one result

another average i can use is so called

median which means that you bring the

numbers in order and then you say the

middle one

and the last one called the modes which

simply says which number actually occurs

most often

the first class has these averages

the second one these and third one these

as you might notice each class did best

in one category

meaning that if this was either in the

newspaper or you actually discussed this

or the class is discussed with each

other every class has a right to say

on average we were better so

whenever you heard hear somebody saying

on average

the negativity of response would be well

which one

talking a little bit more about averages

because it’s really important because

our brains

really don’t like our brains don’t work

well with averages

because i’m pretty sure each one of you

has often or sometimes at least heard

the

expression the most likely outcome

whether that’s in science

an experiment maybe even politically to

say this will be the most likely outcome

the problem is there’s more to it than

you might think

so i feel the point two times that

average

should be that i actually get one head

and one tail right

what’s the actual probability of that

happening

fifty percent our brain likes that

what if i use 10 coins and flip it you

will still stay well on average it

should be five heads and five tails

right

the problem being that this actually

happening

has only a probability of 25

meaning that in 75 percent something

else than the average happens

and this actually gets worse as i

increase the numbers if i flip it 100

times

you would say it should still be 50 50

but not actually happening

only happens in eight percent of the

cases you might say well that’s a neat

thing but why should i care now

for the experiment you have a country

with ten major cities each city has ten

thousand cabinets

and there’s a disease which occurs in

roughly 1 in 10 000 people

so on average every single city should

have one of these cases right

now assuming that this this disease is

actually random

and there are no factors about it what’s

the chance that every city

has exactly one sick person come up with

a number in your head

and the actual answer is

probably slightly lower than you thought

now why is that important

because it means it’s not one in each

city that means at least one city has

two

and you can be sure that the mayor of

that city won’t like the newspaper

headlines of saying that this city has

twice the average the national average

rate of that disease

maybe even three times if there are

three cases in the city

maybe the citizens demand that time and

resources will be used

to fight off this problem and actually

find out why their city

has this problem in the first place the

problem is

in statistics when you just have random

events

clusters are actually to be expected to

visualize this

i will take a coin and because i want to

avoid embarrassing myself and not ever

being able to flip it here i now have it

here

i want you in your head to decide if

it’s going to be heads or tails

so just think if this head details

it’s tails please stand up if you were

correct

now because every second person on

average should be right our brain

kind of thinks it should be standing

sitting standing sitting standing

sitting

but as you can see that’s far from the

truth and the problem is

should i actually go in one of the

corners maybe the two people standing

next to each other who are both standing

up now

and analyze whether they have the

ability to predict the future or

not there’s another group there in the

middle so looking at these clusters

after the effect and trying to analyze

why

there is a cluster cluster is not

necessarily productive

if you don’t understand that cluster is

actually to be expected

and you need to think about how big of a

cluster will actually be a problem

otherwise limited resources and time and

energy might be diverted in the wrong

place please sit down

now i’ve been talking about percentages

a little bit

and i’ll stick with that for a moment

i have in my kitchen three eggs and two

sausages

which means that i have sixty percent

eggs because it’s three out of five

i feel like a snack and i go to the

kitchen and eat a sausage

now i have 75 percent eggs because it’s

three out of four

what’s a valid conclusion that an

outside observer might have if he sees

mr barakat

enters the kitchen and the potential

eggs goes up from 60 to 75 percent

if the person doesn’t know anything else

that might be a conclusion

they get to and the problem is this is a

silly example but that actually happens

in the real world imagine you have a

city with different

police precincts and one present

initiates a new program which is

actually successful

in reducing overall crime in their area

of the city

what does that mean about the crime rate

in the other precincts

it will go up although the actual number

of crimes

stays the same maybe

your company your department does better

and suddenly everybody else

looks worse although they’re still as

productive as before

of course you could say well relatively

they didn’t improve it you did but

to be honest that’s not what people

think when they see a percentage go up

or down

that’s why i’m actually having this

talking

another problem about percentages is

when percentage or percentages

change because if i say a percentage

goes down by 10 percent

i mean something like this that it goes

from 15 percent to five so it’s reduced

by 10 what do i actually mean

it goes from 50 to 13.5 you might notice

that by 13.5

well 10 percent of 15 is 1.5

so 15 went down by 1.5 that’s a

reduction of 10 percent

both of these are actually valid points

meaning that if somebody actually talks

about the change in a percentage

it’s very important to know which case

they’re talking about i’ll come back to

a medical example

the mortality rate for cancer in some

cases halves if you do chemo

now the expression the cure is sometimes

worse than the disease

has its place and when making the

decision you might be interested in

is this a drop from twenty to ten

percent

or is it a drop from two to one percent

both are half but if you’re only told it

halves

and you have to make a decision you

don’t have necessary all

information to make an informed decision

sometimes depending how the number is

presented to you

it will produce a completely different

emotional response from you

and i’ll have an example here

let’s say an increase from 300 to 450

parts per million and this is not

supposed to be a climate change debate

it’s just like this number goes up

from 300 to 450. the problem is

if i don’t want to express it in parts

per million about 0.03 to 0.045

no matter what you know or what you

believe or what you feel about this

your brain will automatically have a

different response to the above and the

below numbers

because our brains don’t really work

that well when

dealing with numbers that are bigger

than let’s say how many fingers we have

so keep that in mind when a number is

presented to you

either the unit you don’t know or maybe

in a context you don’t know

maybe somebody comes up to you and says

hey i can deadlift 100 kilos

and you think okay i don’t know what the

deadlift is but 100 kilos i know that’s

kind of heavy that sounds pretty good

and then you have the people actually

know what the deadlift is and know that

100 kilo deadlift is

not very impressive at all so when

you’re given

an information about something where you

don’t know much about it you probably

need to look it up

even for me if i hear something that a

power generator produces a certain

amount of

energy or power i don’t know

exactly without looking up how to put

that into context with other sources of

energy

now other problems are again as i said

if you’re not given any context

so for example if i just put this number

on the board four million infants died

in 2018 according to unicef

obviously everybody knows that’s bad and

that number is really big so you think

well that’s horrible

we should really really do something

about that the problem is

do you have any other numbers to compare

this to

because one year previously the number

was slightly higher

when you previously it was still higher

in 1990 it was even more than twice as

high

and the most important thing about this

is these are absolute numbers so despite

population growth between 1990

to 2020 the number of infants so below

one year

dying has more than half worldwide

the problem is you don’t know this

unless you look at the numbers in

context so if i just show you 4 million

your brain says 4 million babies that’s

obviously bad so we should do something

about it and it’s horrible

but the matter as a matter of fact is

it’s getting

much better and it’s the best it’s ever

been

now when preparing this talk i asked my

brother for advice what i should include

his response was everything you’re going

to tell them is absolutely

obvious and it’s completely irrelevant

that you have this talk because

everybody knows this already

he has a phd in mathematics so he has a

very particular point

of view for this so i asked my sister

and she was a little bit more helpful

saying that i should at the end of my

time give you concrete advice what to do

in the future otherwise this was a

pointless exercise

so here are three pieces of advice

first when reading or being told about

an average the responses you should have

are what average are you talking about

and

keeping in mind that our brain really

likes to think the average

should happen and anything else there’s

something wrong

but i hope i showed you that it’s

actually very unlikely

that the average is going to happen in

most cases

and so you should expect something else

to happen

secondly but where percentages as i said

a percentage change which one did it go

down by 10

or 10 percent of the percentage i

already have it’s most of the time very

helpful if you’re given a potential

something to also ask for the absolute

values

to make a better informed decision

and finally you need context for numbers

if you’re just told a single number

this is the gdp of a country nothing

about the previous years

what am i supposed to do with that so

ask for numbers in comparison and

although i said three pieces of advice

the most important piece of advice is

a single number never almost never tells

the whole story

unless that number is 42. thank you

[掌声]

好吧,

我想让你想象一些

不太好的事情,那就是你

去看医生做定期检查,

结果发现有问题,

可能会致命,

你需要治疗,

幸运的是,医院有两名

医生和两名 为您提供不同的治疗

,您可以选择

您想要的一种

,正如您在船上看到的那样,您会看到

这两种治疗的存活率,

并且您可以选择显然

大多数人现在认为我为什么会

选择 把 b

医生说一下,虽然现在有了 a 医生,谢谢你,当你有严重的医疗问题时,你通常会做什么

你找到一个骑自行车的意见,

所以你去另一家

也有两种不同治疗方法的医院

,他们把数据分开了一点

不同,因为他们

在正常情况下将其分为严重情况,

现在您可能会认为我不需要

知道我要做出一个好的

选择,而答案实际上是否定的,

因为如果 你看

你会看到 b 博士的治疗

在这两个类别中都有更高的百分比,这

意味着在这家医院你

更喜欢 b 医生,

所以在你喜欢的第一家医院和你

更喜欢

的第二家医院

重复 问题是

他们是同一位医生,

这是第一家医院

和第二家医院的医生

,你可以看到我可以将他的

两个病例分成正常范围,

或者我可以将它们加起来得到 80 分

和第二位医生同样的

问题是,尽管他在

两个类别中都独立地

表现

更好,但他的百分比

现在更差

了 再过 10 分钟

,您刚刚经历的

称为辛普森悖论

,这意味着根据我如何

拆分呈现给您的数据,我可以

从您那里得到两个相反的决定,

现在变得更好 形成公民

是我演讲的名称

,我想向您介绍

统计素养的概念,

我教数学,这

意味着我最常听到的问题

是我们什么时候才能在

现实生活中

使用它

必须区分工作之外的东西

但是我开始相信我在

学校教的数学中最重要的一个主题

实际上是

统计学,因为

无论你做什么你都会遇到它

它会出现在新闻中它会出现在

医学中

问题也许您可以进行研究

并且您必须处理它

我的意思是即使您从事心理学,您

实际上也必须学习

统计学课程来

分析提供给您的所有数据,

所以没有办法解决它,

但正如我们刚刚 看到

你可能会遇到很多问题,

现在我将转到另一个

辛普森悖论的例子,

这是一个经常出现在新闻中的话题,

我在这里向你展示男女之间的薪酬

差距 三个公司

,向你展示

这三个公司的男女平均收入

,问题是哪家公司

在薪酬方面最公平,

所以我给你一分钟时间看看

,但大多数人可能会从技术上考虑

得很好

没有一个是公平的,

因为如果两者都相同,那么公平是公平的,

但是在第一个中,女性也是

如此,然后第二个稍微

好一点,但仍然低于,

第三个很好地补偿了前两个,

但这也不公平 他们

实际上得到了

更多问题是让我们仔细看看,

所以我

把那家公司的工人的工作

分成管理职位办公室工作人员

和支持人员

,在这里你可以看到其中

有多少是男性,有多少是女性

以及他们每个人的报酬是多少

这里的问题很好 这

实际上并不是每个人

在他们的水平上得到相同的报酬

但是因为女性

经理和办公室工作人员较少,

所以总体平均水平是 下降

一个平均数实际上并没有

告诉你的东西

让我们看看 b 公司 b

公司的女性职位也较少,

但她们的薪水实际上

高于男性同行,

但平均水平仍然较低有点奇怪

,现在公司 c

在这些类别中有更多的女性,这

很好,但她们实际上得到的报酬更少,

但平均水平更高,这意味着

如果你在新闻中展示它实际上看起来不错,

所以问题是如果你只是比较

你没有的平均值' 看不到任何细微差别,您

可能会进行完全错误的

讨论,因为讨论可能

是在公司中,

为什么担任领导职位的女性更少,

或者为什么人们实际上

在同一份工作中获得不同的报酬,

但您实际上进行了正确的对话

需要适当地分析数据

,我们倾向于

只追求一个第一

,这并不总是很有成效,

现在我很确定有一个

听众中的几个人认为

这个人在教数学,她

只是说平均值不是

很好,你是对的,因为我们开始

认为实际上存在不同类型

的平均值问题是当我们

正常说话时 每天在英语中,

我们实际上并没有说我们不

谈论平均水平,我们谈论

的是大多数人熟悉的平均水平,

你把所有的东西加起来,然后你

平均分配,这就是平均

水平问题在数学

上是有的

现在在黑板上有更多的平均值 你有三个

班级和他们的成绩 这是 ib

系统,这意味着七个是最好的

成绩,

一个是最差的成绩 现在哪个班级

的平均成绩最好 问题是

如果我取平均值,我会得到另一个结果

我可以使用的平均值是所谓的

中位数,这意味着您将

数字按顺序排列,然后您说

中间一个

,最后一个称为众

数,它只是说明实际出现的数字 s

大多数情况

下,第一堂课

有这些平均值 第二堂课和第三

堂课,因为你可能会注意到每个班级

在一个类别中表现最好,这

意味着如果这是在

报纸上,或者你实际上讨论过这个,

或者班级彼此讨论过

每个班级都有权

说平均而言我们更好,所以

每当你听到有人

说平均而言

,反应的消极性会很好

,哪一个人

谈论平均水平,

因为这真的很重要,因为

我们的大脑

真的不喜欢我们的 大脑不能

很好地处理平均值,

因为我很确定你们每个人

都经常或有时至少听到过

这个

表达最可能的结果,

无论这是否在科学中,

一个实验甚至可能在政治上

说这将是最有可能的

结果 问题是它比你想象的要多,

所以我觉得

平均水平的两倍

应该是我实际上得到了一个头

和一个尾是

什么 发生这种情况的实际概率

50% 我们的大脑喜欢

这样,如果我使用 10 个硬币并翻转它,你

仍然会保持良好平均

应该是五个正面和五个

反面,问题是这种情况实际

发生

的概率只有 25

这意味着在 75%

的情况下会发生与平均值不同的事情

如果我将其翻转 100

,这实际上

会变得更糟 说得好,这是

一件好事,但我为什么现在要关心

这个实验,你有一个

有十个主要城市的国家,每个城市都有一

万个柜子,

并且大约每 10 000 人中就有一种疾病发生,

所以平均每个城市都应该

有 现在其中一个案例

假设这种疾病

实际上是随机的

并且没有任何因素关于它

每个

城市恰好有一个病人的机会是多少 e

在你的脑海中想出一个数字

,实际的答案

可能比你现在想象的略低。

为什么这很重要,

因为这意味着不是每个城市都有一个,

这意味着至少一个城市有

两个

,你可以确定市长

那个城市不会喜欢报纸

头条说这个城市

的平均

发病率是全国平均发病率的

两倍,如果

这个城市有三个病例,甚至可能是三倍,

也许市民要求时间和

资源

用于抗击 解决这个问题,实际上首先

找出他们的城市

有这个问题的原因

问题

在于统计,当你只有随机

事件时

,实际上应该期望集群来

可视化这个

我会拿一枚硬币,因为我想

避免让自己尴尬 并且

永远无法在这里翻转它我现在有它

我希望你在你的脑海中决定

它是正面还是反面

所以只要想想如果这个头部细节

它是尾巴请站起来 如果你现在是对的,那就向上吧,

因为平均每个第二个人

都应该是对的,我们的大脑

有点认为它应该是站着

坐着站着站着

坐着,

但正如你所看到的,这与事实相去甚远

,问题是

我是否真的应该进入其中一个

角落可能是两个并排

站着的人,他们现在都站

起来

,分析他们是否有

能力预测未来

,中间有另一个群体,所以在

效果后观察这些集群并试图分析

原因

如果您不了解集群

实际上是预期的,

并且您需要考虑实际上有多大的

集群会成为问题,那么集群不一定是高效的,

否则有限的资源、时间和

精力可能会被转移到 错误的

地方请坐,

现在我一直在谈论百分比

,我会坚持一会儿

我的厨房里有三个鸡蛋和两个 o

香肠

,这意味着我有 60% 的

鸡蛋,因为它是五分之三

我觉得自己像点心,我去

厨房吃香肠

现在我有 75% 的鸡蛋,因为它是

四分之三 观察者可能有,如果他看到

巴拉卡特先生

进入厨房并且潜在的

鸡蛋从 60% 上升到 75%,

如果这个人不知道任何其他

可能是

他们得出的结论,问题是这是一个

愚蠢的例子,但是 实际发生

在现实世界中 假设您有一个

拥有不同

警察辖区的城市,并且在场的

一个人启动了一项新计划,该计划

实际上成功

地减少了该城市所在地区的总体犯罪率,

对其他辖区的犯罪率意味着

什么? 尽管实际犯罪数量保持不变,但上升

也许

您的公司您的部门做得更好

,突然其他人

看起来更糟,尽管他们仍然

像以前一样富有成效

当然,相对而言,您可以说

他们没有改善,

但老实说,这不是

人们看到百分比上升或下降时的想法,

这就是为什么我实际上要

谈论百分比的另一个问题是

当百分比或 百分比会

发生变化,因为如果我说百分比

下降 10%,

我的意思是这样的,它

从 15% 下降到 5,所以它减少

了 10 我实际上的意思是

它从 50 下降到 13.5,你可能会

注意到到 13.5

以及 10 15 的百分比是 1.5,

所以 15 下降了 1.5,即

减少了 10

%,这两个实际上都是有效的点,

这意味着如果有人真的

谈论百分比的变化,

那么知道他们在谈论哪种情况非常重要

让我回到

一个医学例子

如果你现在做化疗,在某些情况下癌症的死亡率会减半

表达治愈有时

比疾病本身更糟糕

,当

你做出决定时 可能感兴趣的

是这是从百分之二十下降到百分之十

还是从百分之二下降到百分之一

两者都是一半但是如果你只被告知它是

一半

并且你必须做出决定你

没有必要的所有

信息

有时根据数字

呈现给您的方式做出明智的决定,

它会产生与您完全不同的

情绪反应

,我将在这里举一个例子,

假设从百万分之 300 增加到 450

,这不

应该是 一场气候变化辩论

就像这个数字

从 300

上升到 450

你的大脑会自动

对上面和下面的数字产生不同的反应,

因为我们的大脑

处理

大于我们有多少手指的数字时并不能很好地工作,

所以请记住这一点 w 当一个数字

呈现给你时,

要么是你不知道的单位,要么是在

你不知道的情况下,

也许有人走过来对你说,

嘿,我可以硬拉 100 公斤

,你认为好吧,我不知道是什么

硬拉只有 100 公斤,我知道

这听起来很重

,然后你让人们真正

知道硬拉是什么,并且知道

100 公斤的硬拉一点

也不令人印象深刻,所以当

得到关于某事的信息时

您对此

知之甚少,如果我听到

发电机产生一定

量的

能量或功率的东西,即使对我来说,您也可能

需要

查找它 其他能源

现在其他问题再次像我说的那样,

如果你没有给出任何

背景,例如,如果我把这个数字

放在董事会上,

根据联合国儿童基金会的数据,2018 年有 400 万婴儿死亡,

显然每个人都知道这很糟糕,

这个数字是 雷亚尔 很大,所以你

认为这太可怕了,

我们真的应该

做点什么,问题是

你有没有其他数字可以比较

因为一年前这个

数字略高

,而你之前的数字在 1990 年仍然更高

,它甚至 高出两倍以上

,最重要的

是这些是绝对数字,因此尽管

人口在 1990 年

至 2020 年之间增长,但不到

一岁

死亡的婴儿数量在全球范围内占一半以上

,问题是你不知道这一点,

除非你 看看上下文中的数字,

如果我只给你看 400 万,

你的大脑会说 400 万个婴儿,这

显然很糟糕,所以我们应该对此采取

一些措施,这很可怕,

但事实上

它正在

变得更好,而且是最好的

现在在准备这次演讲时,我向我的

兄弟征求意见,我应该包括什么

他的回答是你

要告诉他们的一切都是绝对

明显的,而且 与

你进行这次谈话完全无关,因为

每个人都知道这一点,

他拥有数学博士学位,所以他对此有

非常特殊

的观点,所以我问了我姐姐

,她更有帮助,

说我应该在结束时 我的

时间会给你具体的建议,

否则这是一个

毫无意义的练习,

所以在阅读或被告知平均值时,首先有三条建议

你应该有的回答

是你在谈论

记住的平均值 我们的大脑真的很

喜欢认为

应该发生平均值以及其他任何

问题,

但我希望我向您展示了在大多数情况下

实际上不太可能

发生平均值

,因此您应该期待其他事情

会发生,

但是在哪里 正如我所说

的百分比变化百分比

下降了

我已经拥有的百分比的 10% 或 10%

如果你愿意,它大部分时间都非常有帮助 你被赋予了一个潜在的

东西来要求绝对值

以做出更明智的决定

,最后你需要数字的背景,

如果你只是被告知一个数字

这是一个国家的国内生产总值

与前几年无关

我应该这样做,所以

要求比较数字,

虽然我说了三个建议,

但最重要的建议是

一个数字,

除非那个数字是 42,否则几乎永远不会说出整个故事。谢谢