Massivescale online collaboration Luis von Ahn

how many of you have had to fill out

some sort of web form or even asked to

read a distorted sequence of characters

like this how many of you found it

really really annoying okay outstanding

so I invented that or I was one of the

people who did it that thing is called a

CAPTCHA and the reason is there’s to

make sure that you the entity filling

out the form are actually a human and

not some sort of computer program that

was written to submit the form millions

and millions of times the reason it

works is because humans at least non

visually impaired humans have no trouble

reading these distorted squiggly

characters whereas computer programs

simply can’t do it as well yet so for

example in the case of Ticketmaster the

reason you have to type these distorted

characters is to prevent scalpers from

writing a program that can buy millions

of tickets two at a time they’re not

captures are used all over the Internet

and since they’re used so often a lot of

times the precise sequence of random

characters that are shown to the user is

not so fortunate so this is an example

from the Yahoo registration page the

random characters that happen to be

shown to the user where waait which of

course spell a word but the best part is

the message that the Yahoo help this got

about 20 minutes later

this person thought may need it way this

of course is not a bad as this poor

person who okay now captcha project is

something that we did here at carnagie

mellon over ten years ago and it’s been

used everywhere let me not tell you

about a project that we did a few years

later which is sort of the next

evolution of CAPTCHAs just a project

that we call recaptcha which is

something that we started here at

carnegie mellon then we turned it into a

startup company and then about a year

and a half ago google actually acquired

this company so let me tell you what

this project started ok so this project

started from the following realization

it turns out that approximately 200

million CAPTCHAs are typed every day by

people around the world ok when I first

heard this I was quite proud of myself I

thought look at the impact that my

research has had but then I started

feeling bad so here’s the thing each

time you type a CAPTCHA essentially you

waste 10 seconds of your time and if you

multiply that by 200 million you get

that humanity as a whole is wasting

about 500,000 hours every day typing

these annoying CAPTCHAs so then I

started feeling bad

and then I started thinking well of

course we can’t just get rid of CAPTCHAs

music it’s the security of the web sort

of depends on them but then I started

thinking is there any way we can use

this pepper for something that is good

for Humanity so see here’s the thing

while you’re typing a CAPTCHA during

those 10 seconds your brain is doing

something amazing your brain is doing

something that computers can not yet do

so can we get you to do use for work for

those 10 seconds another way of putting

it is there’s some humongous problem

that we cannot yet get computers to

solve but that somehow we can split into

tiny 10 second chunks such that each

time somebody solves a CAPTCHA they

solve a little bit of this problem and

the answer to that is yes and this is

what we’re doing now so what you may not

know is that nowadays while you’re

typing a CAPTCHA not only are you

authenticating yourself as a human but

in addition you’re actually helping us

to digitize books okay so let me explain

how this works so there’s a lot of

projects out there trying to digitize

books google has won the Internet

Archive has won amazon now with the

kindle store in the digitize books

basically the way this works is you

start with an old book like a physical

thing you’ve seen those things right

like a Laker hope so you start with a

book and then you scan it now scanning a

book is like taking a digital photograph

of every page of the book it gives you

an image for every page of the book this

is an image with text for every page of

the book the next step in the process is

that the computer needs to be able to

decipher all of the words in this image

that’s done using a technology called

OCR for optical character recognition

which takes a picture of text and tries

to figure out what the text is in there

now the problem is that OCR is not

perfect especially for older books where

the ink has faded and the pages have

turned yellow OCR cannot recognize a lot

of the words for example the things that

we’re in more than 50 years ago the

computer cannot recognize about thirty

percent of the words so what we’re doing

now is we’re taking all of the words

that the computer cannot recognize and

we’re getting people to read them for us

while they’re typing a CAPTCHA on the

Internet okay so next time you type a

CAPTCHA these words that you’re typing

are actually words that are coming from

books that are being digitized that the

computer could not recognize I know the

reason we have two words nowadays

instead of one is because you see one of

the words is a word that we that the

system just got out of a book it didn’t

know what it was and it’s going to

present it to you but since it doesn’t

know the answer for it it cannot grade

it for you so what we do is we give you

another word one for which the system

does know the answer okay we don’t tell

you which ones which and we say please

type both and if you type the correct

word for the one for which the system

already knows the answer

it assumes you’re human and it also get

some confidence that you tap the other

word correctly and if we repeat this

process to like 10 different people and

all of them agree on what the new word

is then we get one more word digitized

accurately so this is how the system

works and basically since we released it

about three or four years ago a lot of

websites that started switching from the

old capture where people waste at the

time to the new capture where people are

helping to digitize books so for example

Ticketmaster so every time you buy

tickets on Ticketmaster you have to

digitize a book Facebook every time you

add a friend or poke somebody you help

to digitize a book Twitter and about

350,000 other sites are all using

reCAPTCHA and in fact the number of

sites that are using reCAPTCHA so high

that the number of words that we’re

digitizing per day is really really

large it’s about 100 million a day which

is the equivalent of about two and a

half million books a year and this is

all being done one word at a time by

just people typing CAPTCHAs on the

Internet

now of course since we’re doing so many

words per day funny things can happen

and this is especially true because see

now we’re giving people two randomly

chosen English words next to each other

ok so funny things can happen so for

example we present that this word it’s

the word Christians there’s nothing

wrong with it but if you presented along

with another randomly chosen word bad

things can happen but it’s even worse

because the particular website where we

show this actually happened to be called

the Embassy of the kingdom of God

yep here’s another really bad one at

John Edwards com

so we keep on insulting people are left

and right every day now of course we’re

not just insulting people see here’s the

thing since we’re presenting two

randomly chosen words there’s

interesting things can happen so this is

actually has given rise to a really big

internet meme that tens of thousands of

people have participated in which is

called CAPTCHA art I’m sure some of you

have heard about it here’s here’s how it

works okay imagine your is in you’re

using the internet and you see a capture

that you think is somewhat peculiar like

this capture then what you’re supposed

to do is you take a screenshot of it

then of course you fill out the CAPTCHA

because you helped us digitize a book

but then first you take a screenshot and

then you draw something that is related

to it that’s how it works there are tens

of thousands of these some of them are

very cute

some of them are funnier

and some of them like paleontological SH

fizzle they contain Snoop Dogg okay so

this is my favorite number of reCAPTCHA

so this is the favorite thing that I

like about this whole project this is

the number of distinct people that have

helped us digitize at least one word out

of a book to recapture 750 million which

is a little over ten percent of the

world’s population has helped us

digitize human knowledge and it is words

it is numbers like these that motivate

my research agenda so the question that

motivates my research is the following

if you look at humanity’s large-scale

achievements these really big things

that humanity has gotten together and

done like historically like for example

building the pyramids of Egypt or the

Panama Canal or putting a man on the

moon there’s a curious fact about them

and it is that they were all done with

about the same number of people it’s

weird they’re all done with about a

hundred thousand people and the reason

for that is because before the internet

coordinating more than 100,000 people

let alone paying them was essentially

impossible but see now with the internet

I’ve just shown you a project where

we’ve gotten 750 million people to help

us digitize human knowledge so the

question that motivates my research is

if we can put a man on the moon with a

hundred thousand what can we do with 100

million so based on this question we’ve

had a lot of different projects that

we’ve been working on let me tell you

about one that I’m most excited about

this is something we’ve been so send me

quietly working on for the last year and

a half or so it hasn’t yet been launched

its called duolingo since it hasn’t been

launched

I can trust the other that okay so this

is a project here’s how it started it

started with me posting a question to my

graduate student Severin hacker okay

that’s Severin hacker so I post a

question to my graduate student by the

way is that you did hear me correctly

his last name is hacker so I post this

question to him how can we get a hundred

million people translating the web into

every major language for free okay so

there’s a lot of things to say about

this question first of all translating

the web so right now the web is

partitioned into multiple languages a

large fraction of it is in English if

you don’t know any English you can’t

access it but there’s large fractions in

other different languages and if you

don’t know those languages you can’t

access it so i would like to translate

all of the web or at least most of the

web into every major language okay so

that’s that’s what I would like to do

now some of you may say well why can’t

we use computers to translate so why

can’t we use machine translation machine

translation nowadays it starting to

translate some sentences here and there

why can’t we use it to translate the

whole web well the problem with that is

that it’s not yet good enough and it

probably won’t be for the next 15 to 20

years it makes a lot of mistakes even

when it doesn’t make a mistake since it

makes so many mistakes you don’t know

whether to trust it or not so let me

show you an example of something that

was translated with a machine it’s

actually it was a forum post with

somebody who’s trying to ask a question

about JavaScript it was translated from

Japanese into English so I’ll just let

you read this person starts apologizing

from the fact that it’s translated with

a computer okay so the next sentence is

going to be the preamble to the question

so he’s just explaining something I

remember it’s a question about

JavaScript

okay then

then comes the first part of the

question

then comes my favorite part of the

question

and then comes the ending which is my

favorite part of the whole thing

okay so computer translation not yet

good enough okay so back to the question

so we need people to translate the whole

web okay so now the next question you

may have is well why can’t we just pay

people to do is we could pay

professional language translators

translate the whole web we could do that

unfortunately it would be extremely

expensive for example translating a tiny

tiny fraction of the whole web Wikipedia

into one other language Spanish it you

know the Wikipedia exists in Spanish but

it’s very small compared to the size of

English is about twenty percent is the

size of English if we wanted to

translate the other eighty percent into

Spanish it would cost at least 50

million dollars and this is even at the

most exploitive outsourcing country out

there so to be very expensive so what we

want to do is wanna get 100 million

people translating the web into every

major language for free okay now if this

is what you want to do you pretty

quickly realize you’re going to run into

two pretty big hurdles two big obstacles

okay the first one is a lack of

bilinguals okay so I don’t even know if

there exists a hundred million people

out there using the web who are

bilingual enough to help us translate

that’s a big problem the other problem

that you’re going to run into is a lack

of motivation how are we going to

motivate people to actually translate

the web for free this is normally you

have to pay people to do this so now

we’re going to motivate him to do it for

free now when we were starting to think

about this we were blocked by these two

things but then we realized there’s

actually a way to solve both of these

problems with the same solution there’s

a way to kill two birds with one stone

and that is to transform language

translation into something that millions

of people want to do and that also helps

with the problem lack of bilinguals and

that is language education okay so it

turns out that today there are over 1.2

billion people learning a foreign

language people really really want to

learn a foreign language and it’s not

just because they’re being forced to do

so in school for example in the United

States alone there over 5 million people

who have paid over five hundred dollars

for software to learn a new language

okay so people really really want to

learn a new language so what we’ve been

working on for the last year and a half

is a new website it’s called duolingo

where the basic idea is people learn a

new language for free while

simultaneously translating the web and

so basically they’re learning by doing

okay so the way this works is whenever

just a beginner we give you very very

simple sentences there’s of course a lot

of very simple sentences on the web we

give you very very simple sentences

along with what each word means okay and

as you translate them and as you see how

other people translate them you start

learning the language and as you as you

get more and more advanced we give you

more and more complex sentences to

translate but at all times you’re

learning by doing ok now the crazy thing

about this master method is that it

actually really works okay first of all

people are really really learning a

language we’re mostly done building it

and now we’re testing it people really

can learn a language with it and and

they learn it about as well as the

leading language learning software so

people really do learn a language and

not only do they learn it as well but

actually it’s way more interesting

because you see we do a lingual people

are actually learning with real content

as opposed to learning with made up

sentences people are learning with real

content which is inherently interesting

they the other thing so people really do

learn a language but perhaps more

surprisingly the translations that we

get from people using the site even

though they’re just beginners the

translations that we get are as accurate

as those professional language

translators which is very surprising so

let me show you one example this is

sentence that was translated from german

into english the top is the german the

middle is an english translation that

was done by somebody who is a

professional language translator who we

paid twenty cents a word for this

translation and the bottom is a

translation by users of duolingo none of

whom knew any German before they started

using the site if you can see it’s

pretty much perfect now of course we

play a trick here to make the

translations as good as professional

language translators we combine the

translations of multiple beginners to

get the quality of a single professional

translator ok now even though we’re

combining the translations the the site

actually can translate pretty fast so

let me show you this is our estimates of

how fast we could translate wikipedia

from english into spanish remember this

is 50 million dollars worth of value ok

so if we wanted to translate Wikipedia

into Spanish we could do it in five

weeks with a hundred thousand active

users and we could do it in about 80

hours with a million active users since

all the projects that my group has

worked on so far have gotten millions of

users were hopeful that we’ll be able to

translate extremely fast with this

project now the thing that I’m most

excited about with duolingo is that I

think this provides a fair business

model for language education so here’s

the thing the current business model for

language education is the student pace

in particular the student pays rosetta

stone five hundred dollars

that’s the current business model the

problem with this business model is that

ninety-five percent of the world’s

population doesn’t have five hundred

dollars so it’s extremely unfair towards

the poor okay this is totally biased or

so rich now let’s see in duolingo

because while you learn you’re actually

creating value you are translating stuff

which for example we could charge

somebody for translations so this is how

we could monetize this since people are

creating value while you’re learning

they don’t have to pay with their money

they pay with their time but the magical

thing here is that they’re paying with

their time but that is time that would

have have to be in spent anyways

learning the language okay so the nice

thing about dueling was I think it

provides a fair business model one that

doesn’t discriminate against poor people

so here’s the site

here’s the site we haven’t yet launched

but if you go there you can sign up to

be part of our private beta which are

probably going to start in about three

or four weeks we haven’t yet launched

this duel and go by the way I’m the one

talking here but actually duolingo is

the work of a really awesome team some

of whom are here so thank you

你们中有多少人不得不填写

某种网络表格,甚至被要求

阅读这样一个扭曲的字符序列

你们中有多少人觉得这

真的很烦人 好吧 很出色

所以我发明了这个 或者我是这样做的

人之一 它被称为

验证码,原因是

要确保您

填写表格的实体实际上是一个人,而

不是某种计算机程序,它

被编写成数百万次提交表格

的原因,它的

工作原理 是因为人类至少没有

视力障碍的人可以毫无困难地

阅读这些扭曲的波浪形

字符,而计算机程序

根本无法做到这一点,因此

例如在 Ticketmaster 的情况下,

您必须输入这些扭曲的

字符的原因是为了防止黄牛 从

编写一个可以一次购买数

百万张门票的程序,他们没有被

捕获在整个互联网上都被使用

,因为他们经常被使用很多

次预

向用户显示的随机字符序列

并不那么幸运,所以这

是雅虎注册页面中的一个示例

,碰巧向用户显示的随机字符

在哪里等待

当然拼写一个单词,但最好的部分

是 大约 20 分钟后收到雅虎帮助的消息,

这个人认为可能需要它,

这当然不是坏事,因为这个可怜的

人现在还好验证码项目

是我们十年前在卡纳基梅隆做的事情

,而且一直

到处都在使用,我不会告诉你

几年后我们做的一个项目,

它是 CAPTCHA 的下一个

演变,只是一个

我们称之为 recaptcha 的项目,

这是我们在卡内基梅隆大学开始的,

然后我们把它变成了一家

初创公司 公司,然后大约

一年半前,谷歌实际上收购了

这家公司,所以让我告诉你

这个项目是从什么开始的,所以这个项目

是从以下实现开始

的 世界各地

的人们每天输入大约 2 亿个验证码

好吧,当我第一次

听到这个消息

时,我为自己

感到非常自豪

输入 CAPTCHA 基本上你

浪费了 10 秒的时间,如果你

将其乘以 2 亿,你会

发现整个人类每天都在浪费

大约 500,000 小时输入

这些烦人的 CAPTCHA,所以我

开始感觉很糟糕

,然后我开始想好

当然,我们不能只是摆脱 CAPTCHA

音乐,网络的安全

性取决于它们,但后来我开始

思考有没有什么办法可以将

这种胡椒用于对人类有益的事情,

所以当你看到这里的东西

‘在

这 10 秒内输入验证码 你的大脑正在做

一些令人惊奇的事情 你的大脑正在做

一些计算机还不能做的事情

我们可以让你在

这 10 秒内完成工作吗 换句话说,

有一些巨大的

问题我们还不能让计算机来

解决,但是我们可以以某种方式将其分成

10 秒的小块,这样

每次有人解决验证码时,他们就

解决了这个问题的一点点,并得到

了答案 是的,这

就是我们现在正在做的事情,所以你可能不

知道的是,现在你在

输入 CAPTCHA 时,不仅是在

验证自己是一个人,而且

实际上是在帮助我们

将书籍数字化 所以让我解释一下它是

如何工作的,所以有很多

项目试图将书籍数字化

谷歌赢得了互联网

档案馆现在赢得了亚马逊的

Kindle 商店数字化书籍

基本上这种工作方式是你

从一本旧书开始,比如 一个物理的

东西,你已经看到了这些东西,

就像湖人队的希望一样,所以你从一

本书开始,然后你扫描它现在扫描一

本书就像给书的每一页拍一张数码照片

,它会给你

一个图像 e 对于书的每一页,这

是一个包含书每一页的文本的图像,

该过程的下一步

是计算机需要能够

破译该图像中的所有单词,这

是使用称为 OCR 的技术完成

的 光学字符识别

,它拍摄文本图片并

试图找出其中的文本

现在问题是 OCR 并不

完美,特别是

对于墨水褪色和页面

变黄的旧书籍 OCR 无法识别

很多 例如,

我们在 50 多年前所处的事物中,

计算机无法识别大约 30

% 的单词,所以我们现在正在做的

是,我们正在提取所有

计算机无法识别的单词,而

我们’ 让人们

在互联网上输入 CAPTCHA 时为我们阅读它们

好的,所以下次您输入 CAPTCHA 时,您输入的

这些

单词实际上是来自

正在数字化的书籍中的单词,

com puter 无法识别我知道

我们现在有两个词

而不是一个词的原因是因为你看到其中

一个词是我们

系统刚刚从一本书中取出的词,它不

知道它是什么,它会

将其呈现给您,但由于它不

知道它的答案,因此无法

为您评分,所以我们所做的是我们给您

另一个单词一个系统

确实知道答案的单词 好的,我们不会告诉

您哪个是哪个 我们说请

同时输入,如果您

为系统

已经知道答案的

那个输入正确的单词,它会假定您是人类,并且它也会

对您正确点击另一个单词有信心

,如果我们重复此

过程以 就像 10 个不同的人

,他们都同意新词是什么,

然后我们再准确地数字化一个词

,这就是系统的

工作原理,基本上自从我们大约三四年前发布它以来,

很多

网站开始从 人们的

旧捕获 把时间浪费在

人们

帮助数字化书籍的新

捕获上

Twitter 和大约

350,000 个其他网站都

在使用 reCAPTCHA,事实上

,使用 reCAPTCHA 的网站数量如此之多

,以至于我们每天数字化的单词

数量非常

大,每天大约 1 亿个

,相当于 每年大约有两百

五十万本书,而这

一切都是

通过人们在互联网上输入

验证码来完成的 因为现在看,

我们给人们两个随机

选择的英文单词

ok 所以有趣的事情可能会发生所以

例如我们提出这个词它

是词 Christians there’s nothing

wron g 用它,但是如果你

和另一个随机选择的词一起出现,

坏事可能会发生,但更糟糕

的是,因为我们展示这个的特定网站

实际上恰好被称为

上帝王国的大使馆,

是的,这是 John Edwards 的另一个非常糟糕的网站

com

所以我们现在每天都在不断地侮辱人们

当然我们

不只是在侮辱人们看到这就是

事情因为我们呈现了两个

随机选择的词

可能会发生有趣的事情所以这

实际上已经引起了 成千上万的人参与的非常大的

互联网模因,

这被

称为验证码艺术我相信你们中的一些人

已经听说过它这就是它的

工作原理好吧想象你在

使用互联网并且看到捕获

你认为

这个捕获有点奇怪,那么

你应该做的是截取它,

然后你当然要填写验证码,

因为你帮助我们数字化了一个 bo 好的,

但是首先您截取屏幕截图,

然后绘制与之相关的东西,

这就是它的工作原理,其中有

成千上万个,其中一些

非常可爱

,其中一些更有趣

,其中一些像古生物学的 SH

fizzle 它们包含 Snoop Dogg 好吧,

这是我最喜欢的 reCAPTCHA 数字,

所以这是整个项目中我最喜欢的地方

这是

帮助我们将书中至少一个单词数字化

以重新获得 7.5 亿字的不同人的

数量 世界上超过百分之十

的人口帮助我们

将人类知识数字化,

正是这些数字激发了

我的研究议程,所以

如果你看看人类的大规模

成就,那么激发我研究的问题如下

人类聚集在一起

并像历史一样完成的大事,例如

建造埃及金字塔或

巴拿马运河或将一个人放在

m上 关于他们有一个奇怪的事实

,那就是他们都完成了

大约相同数量的人这很

奇怪他们都完成了大约

十万人,

原因是因为在互联网之前

协调了超过 100,000 人

更不用说付钱给他们了,

但现在看看互联网

我刚刚向你展示了一个项目,

我们已经让 7.5 亿人帮助

我们将人类知识数字化,

所以激发我研究的问题是

我们是否可以让一个人 10 万的月亮

我们可以用 1 亿做什么

所以基于这个问题,我们

已经有很多不同的项目,

我们一直在努力让我告诉

你一个我最

兴奋的事情

在过去一年半左右的时间里,我们一直在悄悄地工作

它还没有推出

它叫做duolingo,因为它还没有

推出

我可以相信另一个好的,所以这

是一个项目,这里是如何 它开始了

首先是我向我的研究生 Severin 黑客发布了一个问题,

好吧

,那是 Severin 黑客,所以我

向我的研究生发布了一个问题,

顺便说一句,你确实没听错

他的姓是黑客,所以我把这个

问题发布给他怎么能 我们有 1

亿人免费将网络翻译成

各种主要语言,好吧,

所以关于这个问题有很多话要说,

首先

翻译网络,所以现在网络被

划分为多种语言,其中

很大一部分是 英语,如果

你不懂任何英语,你就无法

访问它,但有很大一部分是

其他不同语言的,如果你

不知道这些语言,你就无法

访问它,所以我想翻译

所有的网络或在 至少大部分

网络都可以翻译成每种主要语言

,这就是我现在想做的事情

,你们中的一些人可能会说,为什么

我们不能使用计算机翻译,那么为什么

我们现在不能使用机器翻译机器

翻译呢? 几天它开始在

这里和那里翻译一些句子

为什么我们不能用它来很好地翻译

整个网络问题

是它还不够好而且它

可能不会在接下来的 15 到 20

即使它没有犯错也有很多错误因为它

犯了很多错误你不知道

是否相信它所以让我

给你看一个用机器翻译的东西的例子

它实际上是一个 论坛帖子

有人试图问一个关于 JavaScript 的问题

,它是从

日语翻译成英文的,所以我会让

你读这个人开始道歉

,因为它是用计算机翻译的,

所以下一句

将是 问题的序言,

所以他只是在解释一些

事情 整个事情的一部分

好的 所以计算机翻译还

不够好 好的 回到问题

所以我们需要人们翻译整个

网络

我们是否可以付钱给

专业的语言翻译人员

翻译我们可以做到的整个网络,

不幸的是,这将非常

昂贵,例如将

整个网络维基百科的一小部分翻译

成另一种语言西班牙语,你

知道维基百科以西班牙语存在,但

它非常小 与英语的大小相比,

大约是英语的 20%

如果我们想

将另外 80% 翻译成

西班牙语,至少要花费 5000

万美元,而这甚至是

最剥削的外包国家

,所以非常 昂贵,所以我们

想做的是让 1 亿

人免费将网络翻译成各种

主要语言

retty

很快意识到你会遇到

两个相当大的障碍 两个大障碍

好吧 第一个是缺乏

双语 好吧 所以我什至不知道是否

有 1 亿

人在使用网络并且

足够双语 帮助我们翻译

这是一个大问题

你会遇到的另一个问题是

缺乏动力 我们如何

激励人们真正

免费翻译网络 这通常是你

必须付钱给人们这样做 现在

我们要激励他

免费去做现在当我们开始

思考这个问题时,我们被这两件事所阻碍,

但后来我们意识到

实际上有一种方法可以

用相同的解决方案来解决这两个问题

用一块石头杀死两只鸟

,这就是将语言

翻译转变为

数百万人想做的事情,这也

有助于解决缺乏双语者的问题,

这就是语言教育,所以

事实证明 今天有超过 12

亿人学习

外语 人们真的很想

学习外语,这

不仅仅是因为他们在学校被迫这样

做,例如

仅在美国就有超过 500 万人

拥有

为学习一门新语言的

软件支付了超过 500 美元的费用

人们

翻译网络的同时免费学习一门新语言,

所以基本上他们是通过做事来学习的,

所以这种工作方式是

每当初学者时,我们都会给你非常非常

简单的句子,当然还有

很多非常简单的句子 网络我们

给你非常非常简单的句子

以及每个单词的含义,

当你翻译它们时,当你看到

其他人如何翻译它们时,你开始

学习语言和 当你

变得越来越先进时,我们会为你提供

越来越多的复杂句子来

翻译,但在任何时候你都在

通过做来学习,现在

这个大师方法的疯狂之处在于它

实际上真的很好用,首先

人们是 真的真的在学习一门

语言,我们基本上已经完成了它的构建

,现在我们正在测试它,人们真的

可以用它学习一门语言,而且

他们学习它以及

领先的语言学习软件,所以

人们真的学习一门语言,而

不是 只是他们也学了它,但

实际上它更有趣,

因为你看到我们做一个语言的

人实际上是在学习真实的内容

而不是用虚构的

句子学习人们正在学习真实的

内容,这本质上是有趣的,

他们是另一件事所以 人们确实确实

学习了一门语言,但也许更

令人惊讶的是,我们

从使用该网站的人那里得到的翻译,

即使他们只是初学者

,我们得到的翻译 t

和那些专业的语言翻译一样准确,

这非常令人惊讶,所以

让我给你看一个例子,这是

从德语翻译

成英语的句子,顶部是德语,

中间

是由专业人士完成的英文翻译

我们

为这个翻译支付了 20 美分的语言翻译

,底部是

duolingo 用户的翻译,

在他们开始使用该网站之前,他们都不懂德语,

如果你能看到它

现在已经非常完美了,当然我们

在这里玩一个把戏 为了使

翻译与专业语言翻译一样好,

我们结合

了多个初学者的翻译,以

获得单个专业翻译的质量,

即使我们正在

结合翻译,该网站

实际上可以翻译得非常快,所以

让我告诉你这个 是

我们对将维基百科

从英语翻译成西班牙语的速度的估计吗?记住这

是 5000 万美元的麦汁 h 的价值还可以,

所以如果我们想将 Wikipedia 翻译

成西班牙语,我们可以在 5

周内完成,拥有 10 万活跃

用户,我们可以在大约 80

小时内完成,拥有 100 万活跃用户,

因为我的团队参与了所有项目

到目前为止,已经有数百万

用户希望我们能够

通过这个项目进行极快的翻译

现在我对

duolingo 最兴奋的是,我

认为这为语言教育提供了一个公平的商业

模式,所以这里

是 当前

语言教育的商业模式是学生的进度

,尤其是学生支付

五百美元

这就是当前的商业模式

这种商业模式的问题

是世界上百分之九十五的

人口没有五百

美元 所以这对穷人非常不公平

好吧这完全有偏见或

如此富有现在让我们看看duolingo,

因为当你学习时你实际上是在

创造价值 正在翻译的东西

,例如我们可以向

某人收取翻译费用,所以这就是

我们可以通过这种方式获利的方式,因为人们正在

创造价值,而你正在学习

他们不必

用他们的时间支付他们的钱,但

这里的神奇之处 是他们付出了

自己的时间,但那是

无论如何都必须花在

学习语言上的时间,所以

决斗的好处是我认为它

提供了一种不歧视穷人的公平商业模式

所以这里是网站

这里是我们尚未推出的网站,

但如果你去那里,你可以注册

成为我们的私人测试版的一部分,这

可能会在大约三

到四个星期内开始我们还没有推出

这场决斗和 顺便说一句,我是在

这里说话的人,但实际上 duolingo

是一个非常棒的团队的作品

,其中一些人在这里,所以谢谢