Three Ways to Extend the Life of Moores Law

Transcriber: yoojae lee
Reviewer: David DeRuwe

My mom is 76, and I am her tech support.

The most frequent issue she runs into
with her iPad and smartphone is data.

She has accumulated so much data
that she’s running out of space.

Now, on the other end
of our generational spectrum,

here comes my teenage daughter,

who took 50 pictures of the exact
same pair of her Converse shoes

just to be able to pick the best one
to post on her Instagram.

Consequently, my family’s entire
iCloud storage was used up.

For me, this is not just a personal
problem, but also a work challenge.

You see, I work at Facebook
as an engineering director,

and it is my job to ensure
there is enough space,

which is what I call cloud
capacity, for every user.

Today, my company
has about 2.6 billion users.

Let’s imagine each of these users
uploaded one photo.

If I print these photos
into our four-by-sixes

and stack them side by side,

the prints will have enough

so that they could circle
our entire beautiful Earth

not just once, but a total of 10 times.

To support these users,
we not only need to store all the data,

but we also need to process them,
which means we need computers.

Lots of them.

A technology research company
called IDC made a prediction

that data stored in the cloud
would reach 100 zettabytes by 2025.

That is a 100 trillion billion bytes.

At this rate, simply adding more computers
will no longer be a sustainable solution.

You may wonder:

Data has been growing fast
like this for quite a while,

so why is it a big deal?

You’re right.

I was never worried because I had
a superpower in my pocket,

and it’s called Moore’s Law.

So what is Moore’s Law?

It is a prediction that says
computers get faster for the same cost.

In 1965, Gordon Moore,
the co-founder of Intel,

predicted that the number of transistors

on the silicon chip
would double every year.

The semiconductor industry
took his prediction as their North Star,

and they met the challenge.

Ten years later, Mr. Moore
revised his prediction

for the number of transistors
to double every two years.

The semiconductor industry
met that challenge again.

When I first came to the US for graduate
school in 1994 from Inner Mongolia,

my professor told us about Moore’s Law,

and he basically said there’s no need
to worry about how to keep up

with the growing data
because Moore’s Law was taking care of it.

This has been true
for the last 50 years till recently.

There is now a physical limit
of the silicon chip getting reached,

and therefore, we now start
to see the slowdown.

To give you a rough example,

on a silicon chip
the size of my fingernail,

there can be up to 50 billion transistors.

50 billion - that’s 10 times the number
of trees we have on this planet.

Recognizing that Moore’s law
may no longer help us as significantly,

the entire cloud infrastructure
industry worked together

looking for new methods
to bend down that demand curve.

So here you can see, this green line here,
indicating our computer supply,

and this blue line is the increasing
demand to store and to process the data.

As you can see, the supply
and the demand are mostly kept up,

but you also start to see a gap,
and this gap is widening.

This means my dear mom may have
to delete her precious videos and photos.

This means you and I may have
to pay more for our iCloud storage.

This also means more and more
data centers are going to be built,

consuming more power, cooling,
and space on the planet.

Luckily, my colleagues
and I do have a few methods here,

each of them bending down
the demand curve a little bit at a time.

And here I’d like to share
three of them with you:

The first method we use
to bend the demand curve

is by introducing hardware accelerators.

Hardware accelerators
are like specialists in the hospital.

While a general doctor can be very good
triaging and admitting a lot of patients,

she scales herself by referring specific
patients to the specialist.

In the data processing world,

these can be video encoding

or an intensive artificial
intelligence calculation.

Adding the hardware
accelerator to a computer

is like adding specialist support
to a general doctor,

and working together,
they become much more efficient

by focusing on what they’re best at.

To give you a rough example,

if I tell my computer to recognize
the image of my daughter’s shoe photo,

it would have taken about a day
using a regular CPU,

but with hardware accelerators,
this can be done under 30 minutes.

Hardware accelerators help
to bend the demand curve.

The second method to bend the demand curve

is by using software
to optimize our cloud platform.

Now, platform optimization
is like playing the game of Tetris.

The different colors of tiles

are like different types
of computers in my data center.

Some are for storage,
while the others are for compute.

I would spread and stack them

so that there’s as little
wasted space as possible,

which means I get to have less idle
computers and a higher utilization rate.

This Tetris game
is so fancy that it’s even 3D,

which means I get to leverage time zones.

When the users from the continent
of Asia went offline,

I can possibly use
those computers to process data

for our North American users.

And just like any gamers,

I would keep track of my gaming scores
from one day to the next,

and each day I get to bend
that demand curve a little bit more.

Now the third method, also the newest,
is to change how the data is used,

therefore, where the data is stored.

For example, there is a technology
called “on-device compute.”

Instead of uploading
all the data up to the cloud

and then process them up there,

we would process the data
locally on your devices,

either your iPhone or your laptop,

and from there, only a subset of the data

after processing
gets uploaded to the cloud.

If you imagine my data centers
literally as a box,

this approach is literally
thinking outside the box.

By reducing the amount of data
uploaded onto the cloud,

we get to bend that demand curve
a little bit more.

To sum it up, as you can see,
there are multiple technology inventions

being used to address
this steep growth curve of data.

As a result of applying
hardware accelerators,

software optimization,
and on-device compute,

all the cloud computers
are holding strong for all that usage.

If I can give you an example,

in the last eight minutes or so
since I started this talk,

more than 4000 hours worth of video
got uploaded to YouTube,

about 120 million messages
were sent by text,

and my daughter may have just uploaded
another 50 photos of hers.

Keeping up with all the data growth
will continue to be a challenge,

but I am optimistic
that we will continue to have

new inventions being
developed along the way.

I’m committed to support
all of my cloud users,

ensuring there is enough space
to carry their stories.

I’m committed to be my mom’s tech support,
guarding her photos and videos,

but I do plan to go talk to my daughter

about her thousands
of the same shoe photos.

Someday, even when I no longer
have the support from Moore’s Law,

I’m confident that my colleagues and I

will find ways to operate millions
of our cloud computers,

and operate them well,

because these computers
are indeed so important to us.

They carry not only all the data,
but also our memories,

our laughter,

and the stories of our lives.

抄写员:yoojae lee
审稿人:David DeRuwe

我妈妈今年 76 岁,我是她的技术支持。

她在
使用 iPad 和智能手机时遇到的最常见问题是数据问题。

她已经积累了太多的数据
,以至于她的空间已经用完了。

现在,在
我们这一代人的另一端

,我十几岁的女儿来了,

她拍了 50 张
与她完全相同的匡威鞋的照片,

只是为了能够挑选出最好的一款
来发布在她的 Instagram 上。

结果,我家的整个
iCloud 存储空间都用完了。

对我来说,这不仅仅是个人
问题,也是工作挑战。

你看,我在 Facebook
担任工程总监,

我的工作是确保每个用户
都有足够的空间,

这就是我所说的云
容量。

今天,我的公司
有大约 26 亿用户。

让我们想象这些用户中的每一个都
上传了一张照片。

如果我将这些照片打印
成我们的四乘六

并并排堆叠,

这些照片将足以

让它们不仅绕
我们整个美丽的地球

一周,而且总共绕十圈。

为了支持这些用户,
我们不仅需要存储所有数据,

还需要处理它们,
这意味着我们需要计算机。

其中很多。

一家
名为 IDC 的技术研究公司预测

,到 2025 年,存储在云中的数据
将达到 100 泽字节,

即 100 万亿字节。

按照这个速度,简单地增加更多的计算机
将不再是一个可持续的解决方案。

您可能想知道:

数据一直
像这样快速增长有一段时间了

,为什么它很重要?

你说得对。

我从不担心,因为
我口袋里有一种超能力

,这就是摩尔定律。

那么什么是摩尔定律?

这是一个预测,说
计算机以相同的成本变得更快。

1965 年,
英特尔的联合创始人戈登·摩尔

预测硅芯片上的晶体管数量

将每年翻一番。

半导体行业
将他的预测视为北极星

,他们迎接了挑战。

十年后,摩尔先生
修改了他

对晶体管数量
每两年翻一番的预测。

半导体行业
再次遇到了这一挑战。 1994

年我第一次从内蒙古来美国读研究生的时候

我的教授给我们讲了摩尔定律

,他基本上说
不用担心如何

跟上不断增长的数据,
因为摩尔定律在处理它 .

直到最近,过去 50 年都是如此。

现在已经达到硅芯片的物理极限

,因此,我们现在
开始看到放缓。

举个粗略的例子,


我指甲大小的硅芯片上,

可以有多达 500 亿个晶体管。

500 亿 - 这是
我们在这个星球上拥有的树木数量的 10 倍。

认识到摩尔定律
可能不再对我们有那么大的帮助

,整个云基础设施
行业共同努力

寻找新的方法
来降低需求曲线。

所以在这里你可以看到,这条绿线
表示我们的计算机供应量

,这条蓝线表示
存储和处理数据的需求不断增加。

正如你所看到的,供应
和需求大部分都保持不变,

但你也开始看到差距,
而且这个差距正在扩大。

这意味着我亲爱的妈妈可能
不得不删除她珍贵的视频和照片。

这意味着您和我可能需要
为我们的 iCloud 存储支付更多费用。

这也意味着将建造越来越多的
数据中心,在地球上

消耗更多的电力、冷却
和空间。

幸运的是,我和我的同事们
在这里确实有一些方法,

它们中的每
一个都会一次一点地降低需求曲线。

在这里,我想与您分享其中的
三个:

我们用来弯曲需求曲线的第一种方法

是引入硬件加速器。

硬件
加速器就像医院里的专家。

虽然普通医生可以很好地
对很多患者进行分类和收治,

但她通过将特定
患者推荐给专家来扩大自己的规模。

在数据处理领域,

这些可以是视频编码

或密集的
人工智能计算。

将硬件
加速器添加到

计算机就像
为普通医生添加专家支持,

并且

通过专注于他们最擅长的事情,他们一起工作变得更有效率。

举个粗略的例子,

如果我让我的计算机识别
我女儿的鞋子照片的图像,使用普通 CPU

大约需要一天的时间

但使用硬件加速器,
这可以在 30 分钟内完成。

硬件加速器
有助于弯曲需求曲线。

弯曲需求曲线的第二种方法

是使用
软件优化我们的云平台。

现在,平台
优化就像玩俄罗斯方块游戏。

不同颜色的

瓷砖就像
我的数据中心中不同类型的计算机。

一些用于存储,
而另一些用于计算。

我会将它们分散并堆叠起来,

以便尽可能少地
浪费空间,

这意味着我可以减少闲置
计算机并提高利用率。

这款俄罗斯方块游戏
非常精美,甚至是 3D 的,

这意味着我可以利用时区。

当亚洲大陆的用户
下线时,

我可以使用
这些计算机

为我们的北美用户处理数据。

就像任何游戏玩家一样,

我会
从一天到另一天跟踪我的游戏分数,

并且每天我都会进一步弯曲
这条需求曲线。

现在第三种方法,也是最新的,
是改变数据的使用方式

,即数据的存储位置。

例如,有一种
称为“设备上计算”的技术。

我们不会将
所有数据上传到云端

然后在那里进行处理,

而是会
在您的设备(

iPhone 或笔记本电脑)上本地处理数据,

然后从那里,处理后的数据只有一部分

上传到 云。

如果您将我的数据中心想象
成一个盒子,那么

这种方法实际上就是
在盒子外面思考。

通过减少
上传到云端的数据量,

我们可以进一步弯曲需求
曲线。

总而言之,正如您所看到的,
有多种技术发明

被用来解决
这种陡峭的数据增长曲线。

由于应用了
硬件加速器、

软件优化
和设备端计算,

所有云计算机
都在所有这些使用方面保持强劲。

如果我能举个例子,

在我开始这个演讲后的最后八分钟左右

价值超过 4000 小时的视频
被上传到 YouTube,

大约 1.2 亿条信息
是通过文本发送的,

而我女儿可能刚刚上传了
另一个 她的 50 张照片。

跟上所有数据的增长
仍将是一项挑战,

但我乐观地
认为,我们将继续开发

新的发明

我致力于支持
我所有的云用户,

确保有足够的空间
来承载他们的故事。

我致力于成为我妈妈的技术支持,
保护她的照片和视频,

但我确实计划去和我的女儿

谈谈她的
数千张相同的鞋子照片。

总有一天,即使我不再
有摩尔定律的支持,

我相信我和我的同事们

会找到方法来操作我们数以百万
计的云计算机,

并且将它们操作好,

因为这些计算机
对我们来说确实如此重要。

它们不仅承载了所有数据,还承载了
我们的记忆、

我们的笑声

和我们生活中的故事。