Why you dont like the sound of your own voice Rbecca Kleinberger

If you ask evolutionary biologists

when did humans become humans,

some of them will say that,

well, at some point
we started standing on our feet,

became biped and became
the masters of our environment.

Others will say that because our brain
started growing much bigger,

that we were able to have
much more complex cognitive processes.

And others might argue
that it’s because we developed language

that allowed us to evolve as a species.

Interestingly, those three
phenomena are all connected.

We are not sure how or in which order,

but they are all linked

with the change of shape of a little bone
in the back of your neck

that changed the angle
between our head and our body.

That means we were able to stand upright

but also for our brain
to evolve in the back

and for our voice box to grow
from seven centimeters for primates

to 11 and up to 17 centimetres for humans.

And this is called
the descent of the larynx.

And the larynx is the site of your voice.

When baby humans are born today,
their larynx is not descended yet.

That only happens
at about three months old.

So, metaphorically, each of us here

has relived the evolution
of our whole species.

And talking about babies,

when you were starting to develop
in your mother’s womb,

the first sensation that you had
coming from the outside world,

at only three weeks old,
when you were about the size of a shrimp,

were through the tactile sensation

coming from the vibrations
of your mother’s voice.

So, as we can see, the human voice
is quite meaningful and important

at the level of the species,

at the level of the society –

this is how we communicate
and create bonds,

and at the personal
and interpersonal levels –

with our voice, we share much more
than words and data,

we share basically who we are.

And our voice is indistinguishable
from how other people see us.

It is a mask that we wear in society.

But our relationship with our own voice
is far from obvious.

We rarely use our voice for ourselves;
we use it as a gift to give to others.

It is how we touch each other.

It’s a dialectical grooming.

But what do we think about our own voice?

So please raise your hand

if you don’t like the sound of your voice
when you hear it on a recording machine.

(Laughter)

Yeah, thank you, indeed,

most people report not liking
the sound of their voice recording.

So what does that mean?

Let’s try to understand that
in the next 10 minutes.

I’m a researcher at the MIT Media Lab,

part of the Opera of the Future group,

and my research
focuses on the relationship

people have with their own voice
and with the voices of others.

I study what we can learn
from listening to voices,

from the various fields,

from neurology to biology,
cognitive sciences, linguistics.

In our group we create
tools and experiences

to help people gain a better
applied understanding of their voice

in order to reduce the biases,

to become better listeners,

to create more healthy relationships

or just to understand themselves better.

And this really has to come
with a holistic approach on the voice.

Because, think about all
the applications and implications

that the voice may have,
as we discover more about it.

Your voice is a very complex phenomenon.

It requires a synchronization
of more than 100 muscles in your body.

And by listening to the voice,

we can understand possible failures
of what happens inside.

For example:

listening to very specific
types of turbulences

and nonlinearity of the voice

can help predict
very early stages of Parkinson’s,

just through a phone call.

Listening to the breathlessness
of the voice

can help detect heart disease.

And we also know that the changes of tempo
inside individual words

is a very good marker of depression.

Your voice is also very linked
with your hormone levels.

Third parties listening to female voices

were able to very accurately
place the speaker

on their menstrual cycle.

Just with acoustic information.

And now with technology
listening to us all the time,

Alexa from Amazon Echo

might be able to predict
if you’re pregnant

even before you know it.

So think about –

(Laughter)

Think about the ethical
implications of that.

Your voice is also very linked
to how you create relationships.

You have a different voice
for every person you talk to.

If I take a little snippet
of your voice and I analyze it,

I can know whether you’re talking
to your mother, to your brother,

your friend or your boss.

We can also use, as a predictor,
the vocal posture.

Meaning, how you decide to place
your voice when you talk to someone.

And you vocal posture,
when you talk to your spouse,

can help predict not only if,
but also when you will divorce.

So there is a lot to learn
from listening to voices.

And I believe this has to start
with understanding

that we have more than one voice.

So, I’m going to talk
about three voices that most of us posses,

in a model of what I call the mask.

So when you look at the mask,

what you see is a projection
of a character.

Let’s call that your outward voice.

This is also the most classic way
to think about the voice,

it’s a way of projecting
yourself in the world.

The mechanism for this projection
is well understood.

Your lungs contract your diaphragm

and that creates a self-sustained
vibration of your vocal fold,

that creates a sound.

And then the way you open and close
the cavities in you mouth,

your vocal tract is going to
transform the sound.

So everyone has the same mechanism.

But voices are quite unique.

It’s because very subtle differences
in size, physiology, in hormone levels

are going to make very subtle
differences in your outward voice.

And your brain is very good

at picking up those subtle differences
from other people’s outward voices.

In our lab, we are working
on teaching machines

to understand those subtle differences.

And we use deep learning to create
a real-time speaker identification system

to help raise awareness
on the use of the shared vocal space –

so who talks and who never
talks during meetings –

to increase group intelligence.

And one of the difficulties with that
is that your voice is also not static.

We already said that it changes
with every person you talk to

but it also changes generally
throughout your life.

At the beginning
and at the end of the journey,

male and female voices are very similar.

It’s very hard to distinguish

the voice of a very young girl
from the voice of a very young boy.

But in between, your voice
becomes a marker of your fluid identity.

Generally, for male voices
there’s a big change at puberty.

And then for female voices,

there is a change at each pregnancy
and a big change at menopause.

So all of that is the voice
other people hear when you talk.

So why is it that we’re so
unfamiliar with it?

Why is it that it’s not
the voice that we hear?

So, let’s think about it.

When you wear a mask,
you actually don’t see the mask.

And when you try to observe it,
what you will see is inside of the mask.

And that’s your inward voice.

So to understand why it’s different,

let’s try to understand the mechanism
of perception of this inward voice.

Because your body has many ways
of filtering it differently

from the outward voice.

So to perceive this voice,
it first has to travel to your ears.

And your outward voice
travels through the air

while your inward voice
travels through your bones.

This is called bone conduction.

Because of this, your inward voice
is going to sound in a lower register

and also more musically harmonical
than your outward voice.

Once it travels there,
it has to access your inner ear.

And there’s this other mechanism
taking place here.

It’s a mechanical filter,

it’s a little partition
that comes and protects your inner ear

each time you produce a sound.

So it also reduces what you hear.

And then there is a third filter,
it’s a biological filter.

Your cochlea – it’s a part of your
inner ear that processes the sound –

is made out of living cells.

And those living cells
are going to trigger differently

according to how often
they hear the sound.

It’s a habituation effect.

So because of this,

as your voice is the sound
you hear the most in your life,

you actually hear it less
than other sounds.

Finally, we have a fourth filter.

It’s a neurological filter.

Neurologists found out recently

that when you open your mouth
to create a sound,

your own auditory cortex shuts down.

So you hear your voice

but your brain actually never listens
to the sound of your voice.

Well, evolutionarily
that might make sense,

because we know cognitively
what we are going to sound like

so maybe we don’t need
to spend energy analyzing the signal.

And this is called a corollary discharge

and it happens for every motion
that your body does.

The exact definition
of a corollary discharge

is a copy of a motor command
that is sent by the brain.

This copy doesn’t create any motion itself

but instead is sent
to other regions of the brain

to inform them of the impending motion.

And for the voice, this corollary
discharge also has a different name.

It is your inner voice.

So let’s recapitulate.

We have the mask, the outward voice,

the inside of the mask, your inward voice,

and then you have your inner voice.

And I like to see this one
as the puppeteer

that holds the strings
of the whole system.

Your inner voice is

the one you hear
when you read a text silently,

when you rehearse
for an important conversation.

Sometimes is hard to turn it off,

it’s really hard to look at the text
written in your native language,

without having this inner voice read it.

It’s also the voice
that refuse to stop singing

the stupid song you have in your head.

(Laughter)

And for some people
it’s actually impossible to control it.

And that’s the case
of schizophrenic patients,

who have auditory hallucinations.

Who can’t distinguish at all
between voices coming from inside

and outside their head.

So in our lab, we are also
working on small devices

to help those people
make those distinctions

and know if a voice
is internal or external.

You can also think about the inner voice
as the voice that speaks in your dream.

This inner voice can take many forms.

And in your dreams, you actually unleash
the potential of this inner voice.

That’s another work
we are doing in our lab:

trying to access
this inner voice in dreams.

So even if you can’t always control it,

the inner voice –
you can always engage with it

through dialogue, through inner dialogues.

And you can even see this inner voice

as the missing link
between thought and actions.

So I hope I’ve left you
with a better appreciation,

a new appreciation of all of your voices

and the role it plays
inside and outside of you –

as your voice is a very critical
determinant of what makes you humans

and of how you interact with the world.

Thank you.

(Applause)

如果你问进化生物学家

，人类是什么时候变成人类的，

他们中的一些人会说，

嗯，在某个时候，
我们开始站起来，

变成两足动物，
成为我们环境的主人。

其他人会说，因为我们的大脑
开始变得更大

，我们能够拥有
更复杂的认知过程。

其他人可能会争辩
说，这是因为我们开发了语言

，使我们能够作为一个物种进化。

有趣的是，这三种
现象都是相互关联的。

我们不确定如何或以何种顺序，

但它们都

与你脖子后面的一块小骨头的形状变化有关，

它改变了
我们头部和身体之间的角度。

这意味着我们能够直立站立，

而且我们的大脑也
能在后面进化

，我们的语音盒也
能从灵长类动物的 7 厘米长

到 11 厘米，人类的长到 17 厘米。

这被称为
喉的下降。

喉是你发声的地方。

当人类婴儿今天出生时，
他们的喉部还没有下降。

这只发生
在大约三个月大的时候。

所以，打个比方，我们每个人

都重温
了我们整个物种的进化。

说到婴儿，

当你
在你母亲的子宫里开始发育时

，你第一次感受到
来自外界的感觉

，只有三周大，
当你大约有一只虾那么大的时候，

是通过触觉

来的来自
你母亲声音的振动。

所以，正如我们所看到的，人类的声音

在物种

层面、社会层面是非常有意义和重要的——

这就是我们沟通
和建立联系的方式

，在个人
和人际层面——

与我们的声音，我们分享的
不仅仅是文字和数据，

我们基本上分享的是我们自己。

我们的声音与
其他人对我们的看法没有区别。

这是我们在社会上戴的面具。

但我们与自己声音的关系
远非显而易见。

我们很少为自己发声；
我们把它当作礼物送给别人。

这就是我们彼此接触的方式。

这是一种辩证的梳理。

但我们如何看待自己的声音？

因此，

如果您不喜欢在录音机上听到自己的声音
，请举手。

（笑声）

是的，谢谢，事实上，

大多数人报告说他们不喜欢
他们录音的声音。

那是什么意思？

让我们
在接下来的 10 分钟内尝试理解这一点。

我是麻省理工学院媒体实验室的研究员，该实验室

是未来歌剧小组的成员

，我的研究
重点是

人们与自己
的声音和他人的声音之间的关系。

我研究我们可以
从聆听声音中学到什么，

从各个领域，

从神经学到生物学、
认知科学、语言学。

在我们的团队中，我们创造了
工具和体验

来帮助人们更好
地理解他们的声音

，以减少偏见

，成为更好的倾听者

，建立更健康的关系，

或者只是为了更好地了解自己。

这确实需要
对声音进行整体处理。

因为，

随着我们对它的发现越来越多，想想声音可能具有的所有应用和影响。

你的声音是一个非常复杂的现象。

它需要
您体内 100 多块肌肉的同步。

通过聆听声音，

我们可以了解
内部发生的可能失败的事情。

例如：

聆听非常特定
类型的湍流

和声音的非线性

可以帮助预测
帕金森氏症的早期阶段，

只需通过电话即可。

听声音的呼吸困难

可以帮助检测心脏病。

而且我们也知道，单个单词中节奏的变化

是抑郁症的一个很好的标志。

你的声音也
与你的荷尔蒙水平密切相关。

聆听女性声音的第三方

能够非常准确
地将扬声器

置于月经周期。

只是有声学信息。

现在，随着技术一直在
倾听我们的声音，

来自 Amazon Echo 的 Alexa

甚至可以在您知道之前预测
您是否怀孕

。

所以想一想——

（笑声）

想一想这对道德的
影响。

你的声音也与
你如何建立关系密切相关。

与
您交谈的每个人都有不同的声音。

如果我截取
你的一小段声音并加以分析，

我就能知道你是在
和你的母亲、你的兄弟、

你的朋友还是你的老板说话。

我们还可以使用
声音姿势作为预测指标。

意思是，
当您与某人交谈时，您如何决定发出声音。

当你和你的配偶说话时，你的发声姿势

不仅可以帮助预测
你是否会离婚，还可以预测你何时会离婚。

因此，从聆听声音中可以学到很多东西
。

我相信这必须从

了解我们有不止一个声音开始。

所以，我将在我称之为面具的模型中
谈论我们大多数人拥有的三种声音

。

所以当你看面具的时候，

你看到的是一个角色的投影
。

让我们称之为你的外在声音。

这也是最经典
的声音思考

方式，是一种在世界上投射自己的方式
。

这种投射的机制
是众所周知的。

你的肺收缩你的横膈膜

，这会产生声带的自我维持
振动，

从而产生声音。

然后你打开和关闭
嘴里的空腔的方式，

你的声道将
改变声音。

所以每个人都有相同的机制。

但声音非常独特。

这是因为
体型、生理和激素水平

的细微
差别会在你的外在声音中产生非常细微的差别。

你的大脑非常

擅长从其他人的外在声音中识别出那些细微的差异
。

在我们的实验室中，我们正在
努力教

机器理解这些细微的差异。

我们使用深度学习来
创建实时说话者识别系统，

以帮助提高
对共享声音空间使用的认识——

即谁在会议期间说话和谁从不
说话——

以提高群体智能。

这样做的困难之一
是你的声音也不是静止的。

我们已经说过，它会
随着你交谈的每个人

而改变，但它也会
在你的一生中普遍改变。

在旅程的开始
和结束时，

男性和女性的声音非常相似。

很难区分

一个非常年轻的女孩
的声音和一个非常年轻的男孩的声音。

但在这两者之间，你的声音
成为你流动身份的标志。

一般来说，男性的声音
在青春期会有很大的变化。

然后对于女性的声音，

每次怀孕都会
有变化，更年期会有很大的变化。

所以所有这些都是
其他人在你说话时听到的声音。

那么为什么我们对它如此
陌生呢？

为什么
不是我们听到的声音？

所以，让我们考虑一下。

当你戴上面具时，
你实际上看不到面具。

当你尝试观察它时，
你会看到面具里面。

这就是你内心的声音。

所以要理解它为什么不同，

让我们试着理解
这种内在声音的感知机制。

因为你的身体有很多
过滤它的方式，

不同于外在的声音。

所以要感知这个声音，
它首先必须传到你的耳朵里。

你向外的声音
在空气中传播，

而你向内的声音
在你的骨骼中传播。

这称为骨传导。

正因为如此，你内心的
声音会比你的外在声音发出更低的音域，

并且在音乐上更和谐
。

一旦它到达那里，
它就必须进入你的内耳。

这里还有另一种
机制。

这是一个机械过滤器，

它是一个小隔板
，

每次你发出声音时都会保护你的内耳。

所以它也会减少你听到的东西。

然后是第三个过滤器，
它是一个生物过滤器。

你的耳蜗——它
是处理声音的内耳的一部分——

是由活细胞组成的。

这些活
细胞会根据

它们听到声音的频率不同触发。

这是一种习惯效应。

因此，

由于您的声音是
您一生中听到最多的声音，因此

您实际上听到的
声音比其他声音少。

最后，我们有第四个过滤器。

这是一个神经过滤器。

神经学家最近

发现，当你张开嘴
发出声音时，

你自己的听觉皮层就会关闭。

所以你听到了你的声音，

但你的大脑实际上从来没有听过
你的声音。

好吧，从进化
上讲，这可能是有道理的，

因为我们从认知上
知道我们会发出什么样的声音，

所以也许我们
不需要花费精力分析信号。

这被称为必然放电

，它发生在你身体的每一个动作
中。必然放电

的确切定义

是大脑发送的运动命令的副本
。

该副本本身不会产生任何运动

，而是被发送
到大脑的其他区域，

以告知它们即将发生的运动。

而对于声音，这种必然的
放电也有不同的名称。

这是你内心的声音。

所以让我们回顾一下。

我们有面具，外在的声音，

面具的里面，你内心的声音，

然后你有你内在的声音。

我喜欢把这个
看作

操纵整个系统的木偶。

你内心的声音

是你
在默读一段文字时听到的，当你

为重要的谈话排练时听到的。

有时很难将其关闭，

如果不让这种内心的声音读出来，就很难看到
用你的母语写的文字

。

这也是
拒绝停止

唱你脑海中愚蠢歌曲的声音。

（笑声

）对于某些人
来说，实际上是无法控制的。

精神分裂症患者就是这种
情况，

他们有幻听。

谁根本
分不清脑内

和脑外的声音。

因此，在我们的实验室中，我们
也在研究小型设备，

以帮助这些人
做出这些区分，

并了解声音
是内部的还是外部的。

你也可以把内心
的声音想象成你梦中的声音。

这种内在的声音可以采取多种形式。

在你的梦想中，你实际上释放
了这种内在声音的潜力。

这
是我们实验室正在做的另一项工作：

试图
在梦中获得这种内心的声音。

所以即使你不能总是控制它

，内在的声音——
你总是可以

通过对话，通过内在的对话与它互动。

你甚至可以将这种内在的声音

视为
思想和行动之间缺失的环节。

所以我希望我给你
留下了更好的欣赏，

对你所有的声音

以及它在你内外所扮演的角色的新欣赏
——

因为你的声音是一个非常关键的
决定因素，决定了你是什么

以及你如何与世界互动。

谢谢你。

（掌声）