Redefining the dictionary Erin McKean

now I have any of y’all ever looked up

this word you know in a dictionary but

yeah that’s what I thought um how about

this word

you know I’ll show it to you

lexicography the practice of compiling

dictionaries known as we’re very

specific that word compile the

dictionary is not carved out of a piece

of granite out of a lump of rock it’s

made up of lots of little bits little

discrete that spelled D is CR e te bits

and those bits are words now one of the

perks of being a lexicographer besides

getting to come to Ted is that you get

to say really fun words like

lexicographical lexicographical has this

great pattern is called a double dactyl

and by just by saying double dactyl I’ve

sent the geek needle all the way into

the red

but lexicographical is the same pattern

as higgledy-piggledy right it’s a fun

word to say and I get to say it a lot

now one of the non perks of being a

lexicographer is that people don’t

usually have a kind of warm fuzzy

snuggly image of the dictionary right

nobody hugs their dictionaries but what

people really often think about the

dictionary is they think more like this

just to let you know I do not have a

lexicographical whistle but people think

that my job is to let the good words

make that difficult left-hand turn into

the dictionary and keep the bad words

out but the thing is I don’t want to be

a traffic cop for one thing I just do

not do uniforms and for another deciding

what words are good and what words are

bad is actually not very easy and it’s

not very fun and when part of your job

are not easy or fun you kind of look for

an excuse not to do them so if I had to

think of some kind of occupation as a

metaphor for my work I would much rather

be a fisherman I want to throw my big

net into the deep blue ocean of English

and see what marvelous creatures I can

drag up from the bottom but why do

people want me to direct traffic when I

would much rather go fishing well

I blame the Queen well why do I blame

the Queen well first of all I blame the

Queen because it’s funny but secondly I

blame the Queen because dictionaries

have really not changed our idea of what

a dictionary is has not changed since

her reign the only thing that Queen

Victoria would not be abused by in

modern dictionaries is our inclusion of

the f-word which has happened in

American Dictionary since 1965 so

there’s this guy right Victorian era

James Murray first after the Oxford

English Dictionary I do not have that

hat I wish I had that hat so he’s really

responsible for a lot of what we

consider modern in dictionaries today

when a guy who looks like that in that

hat is the face of modernity you have a

problem and so James Murray could get a

job on any dictionary today there would

be virtually no learning curve and of

course people are saying okay Computers

Computers what about computers the thing

about computers is I love computers I

mean I’m a huge geek I love computers I

would go on a hunger strike before I let

them take away google book search from

me but computers don’t do much else

other than speed up the process of

compiling dictionaries they don’t change

the end result because what a dictionary

is is it’s Victorian design merged with

a little bit of modern propulsion its

steampunk what we have is an electric

velocipede you know we have Victorian

design with an engine on it that’s all

the design has not changed and okay what

about online dictionaries right online

dictionaries must be different this is

the Oxford English Dictionary online one

of the best online dictionaries this is

my favorite word by the way erinaceus

pertaining to the hedgehog family of the

nature of a hedgehog very useful word

so look at that

online dictionaries right now our paper

thrown up on a screen this is flat look

how many links there are in the actual

entry - right

those little buttons I have them all

expanded except for the date chart so

there’s not very much going on here

there’s not a lot of click enos and in

fact online dictionaries replicate

almost all the problems of print

except for search ability and when you

improve search ability you actually take

away the one advantage of print which is

serendipity serendipity is when you find

things you weren’t looking for because

finding what you are looking for is so

damn difficult now when you think about

this what we have here is a hand but

problem does everybody know the hand but

problem woman’s making a hand for a big

family dinner she goes to cut the butt

off the hammer throw it away and she

looked at this piece of ham she’s like

this was a perfectly good piece of ham

why am i throwing this away stop well my

mom always did this so she calls it mom

and she says mom why’d you cut the butt

off the hand when you’re making a ham

she says I don’t know my mom always did

it so they called grandma my grandma

says my pan was too small

so it’s not that we have good words and

bad words we have a pan that’s too small

you know that ham but is delicious

there’s no reason to throw it away the

bad words see when people think about a

place and they don’t find a place on the

map they think this map sucks when they

find a nice spot or a bar and it’s not

the guidebook they’re like ooh this must

place must be cool it’s not in the

guidebook when they find a word that’s

not in the dictionary they think this

must be a bad word why it’s more likely

to be a bad dictionary why are you

blaming the ham for being too big for

the pan so you can’t get a smaller ham

the English language is as big as it is

so if you have a ham but problem and

you’re thinking about the hand but

problem the conclusion that it leads you

to is inexorable and counterintuitive

paper is the enemy of words how can this

be I mean I love books I really love

books some of my best friends are books

but

the book is not the best shape for the

dictionary they’re like oh my people are

going to take away my my my beautiful

paper dictionaries no there will still

be paper dictionaries when we had cars

when cars became the dominant mode of

transportation

we didn’t round up all the horses and

shoot them you know there’s still going

to be paper dictionaries but it’s not

going to be the dominant cake dictionary

the book-shaped dictionaries not going

to be the only shape dictionaries come

in and it’s not going to be the

prototype for the shapes dictionaries

come in so think about it this way if

you have an artificial constraint

artificial constraints lead to arbitrary

distinctions and a skewed worldview

what if biologists could only study

animals that made people go oh right

what if we made aesthetic judgments

about animals and only the ones we

thought were cute were the ones that we

could study we know a whole lot about

charismatic megafauna and not very much

about much else and I think this is a

problem I think we should study all the

words because when you think about words

you can make beautiful expressions from

very humble parts lexicography is really

more about material science we are

studying the tolerances of the materials

that you use to build the structure of

your expression your speeches and your

writing and then often people say to me

well okay how do I know that this word

is real that I think okay if we think

words are the tools that we use to build

the expressions of our thoughts how can

you say that screwdrivers are better

than hammers how can you say that a

sledgehammer is better than a ball-peen

hammer

they’re just the right tool for the job

and some people say to me how do I know

if a word is real you know anybody who’s

read a children’s book knows that love

makes things real if you love a word use

it that makes it real being in the

dictionary is an artificial

it doesn’t make a word any more real

than any other real any other way if you

love a word it becomes real so if we’re

not worrying about directing traffic if

we’ve transcended paper if we are

worrying less about control and more

about description then we can think of

the English language as being this

beautiful mobile and any time one of

those little parts of the mobile changes

is touched anytime you touch a word you

use it in a new context you give it a

new connotation you verb it you make the

mobile move you didn’t break it it’s

just in a new position and that new

position can be just as beautiful now if

you’re no longer a traffic cop the

problem with being a traffic cop is that

there can only be so many traffic cops

in any one intersection or the cars get

confused right but if your goal is no

longer to direct the traffic but maybe

to count the cars that go by then more

eyeballs are better you can ask for help

if you ask for help

you get more done and we really need

help

Library of Congress 17 million books of

which half are in English if only one

out of every 10 of those books had a

word that’s not in the dictionary in it

that would be equivalent to more than

two unabridged dictionaries and I find

an undeclared word like a word like on

dictionary for example in almost every

book I read what about newspapers

newspaper archive goes back to 1759

fifty-eight point 1 million newspaper

pages if only one in 100 of those pages

had an undeclared word on it it would be

an entire other OAD that’s a more than

500 thousand more words so that’s that’s

a lot I don’t really been talking about

magazines I’m not talking about blogs

and I find more new words on Boing Boing

in a given week than I do in newsmaker

time there’s a lot going on there and

I’m not even talking about pol SME which

is the greedy habit some words have of

taking more than one meaning for

themselves

so if you think of the word set a set

can be a Badgers burrow a set can be one

of the pleats and an Elizabethan rough

and there’s one number definition in the

OED the OED has 33 different numbered

definitions for set tiny little word 33

numbered definitions one of them is just

labeled miscellaneous technical senses

you know do you know what that says to

me that says to me it was Friday

afternoon and somebody wanted to go down

the pub that’s a lexicographical cop-out

to say miscellaneous technical senses so

we have all these words and we really

need help and the thing is we can we can

ask for help ask you for helps not that

hard I mean lexicography is not rocket

science

see I just gave you a lot of words and a

lot of numbers and this is more of a

visual explanation if we think of the

dictionary as being the map of the

English language these bright spots are

what we know about and the dark spots

are where we are in the dark if that was

the map of all the words in American

English we don’t know very much and we

don’t even know the shape of the

language if this was the dictionary if

this with the map of American English

look we have a kind of lumpy idea of

Florida but there’s no California we’re

missing California from American English

we just don’t know enough and we don’t

even know that we’re missing California

we don’t even see that there’s a gap on

the map so I’m again lexicography is not

rocket science but even if it were

rocket science is being done by

dedicated amateurs these days you know

it can’t be that hard to find some words

so now scientists in other disciplines

are really asking people to help and

they’re doing a good job of it for

instance there’s eBird where amateur

bird watchers can upload information

about their bird sightings and then

ornithologist can go and to help track

populations migrations etc and there’s

this guy Mike Oates my Coates lives in

the UK he’s a director of an

electroplating company he’s found more

than 140 comets he’s got so many comets

they named a comment after him it’s kind

of out past Mars it’s a hike I don’t

think he’s getting his picture taken

there any time soon but he found 104

Komets without a telescope he downloaded

data from the NASA Soho satellite and

that’s how he found them we can find

comments without a telescope shouldn’t

we be able to find words now y’all know

where I’m going with this because I’m

going to the internet which is where

everybody goes and the Internet is great

for collecting words because the

Internet’s full of collectors and this

is a little-known technological fact

about the internet but the Internet is

actually made up of words and enthusiasm

and words and enthusiasm actually

happened to be the recipe for

lexicography isn’t that great

so the problem there are a lot of good

word collecting sites out there now but

the problem with some of them is that

they’re not scientific enough they show

the word but they don’t show any context

where did it came from who said it what

newspaper was it in what book because a

word is like an archaeological artifact

if you don’t know the provenance or the

source of the artifact it’s not science

it’s a pretty thing to look at so a word

without its source is like a cut flower

you know it’s pretty to look out for a

while but then it dies it dies too fast

so this whole time I’ve been saying the

dictionary though a dictionary the

dictionary not a dictionary or

dictionaries and that’s because well

people use the dictionary to stand for

the whole language they use it cynically

and one of the problems of knowing a

word like Sanok Dhaka Kelly is that you

really want an excuse to say Sanok

topically and so this whole talk has

just been an excuse to get me to the

point where I could say syntactically to

all of you so I’m really sorry but when

you use a part of something like the

dictionary is a part of the language or

a flag stands for the United States it’s

a symbol of the country then you’re

using it cynically but thing is we could

make the dictionary the whole language

if we get a bigger pan then we can put

all the words in we can put in all the

meanings doesn’t everybody want more

meaning in their lives

and we can make the dictionary not just

be a symbol of a language we can make it

be the whole language and you see what

I’m really hoping for is that my son who

turns seven this month I want him to

barely remember that this is the form

factor that dictionaries used to come in

this is what dictionaries used to look

like I want him to think of this kind of

dictionary as an 8-track tape it’s a

format that died because it wasn’t

useful enough it wasn’t really what

people needed and the thing is if we can

put in all the words no longer have that

artificial distinction between good and

bad we can really describe the language

like scientists we could leave the

aesthetic judgments to the writers and

the speakers if we can do that then I

can spend all my time fishing and I

don’t have to be a traffic cop anymore

thank you very much for your kind

attention

现在我有你们中的任何一个人曾经

在字典中查找过这个你知道的词,但是

是的,这就是我的想法,你知道这个词怎么样,

我会向你展示

词典编纂词典的做法,

因为我们非常

具体来说,编译

字典不是从

一块花岗岩上雕刻出来的,它是

由许多小块组成的

,拼写为 D 是 CR e te bits

,这些小块是现在的

单词之一 作为一名词典编纂者,

除了来到 Ted 之外,你还

可以说像词典词典这样非常有趣的词

词典词典有这种

很棒的模式被称为双 dactyl

并且仅仅通过说双 dactyl 我已经

把极客针一直送进

了红色

但是词典编纂

与 higgledy-piggledy 是相同的模式,这是一个有趣的

词,我现在可以说很多

,作为一个词典编纂者的非特权之一

是人们

通常没有一种温暖的模糊

依偎的形象 的 字典对

没有人拥抱他们的字典,但

人们真正经常想到的

字典是他们这样想

只是为了让你知道我没有

字典哨,但人们

认为我的工作是让好词

变得困难 手

翻字典,把坏词

拒之门外,但问题是我不想

当交通警察,一方面我

不穿制服,另一方面决定

什么词好什么词

坏实际上不是 很容易,

也不是很有趣,当你的部分工作不容易或不有趣时,你会找

借口不去做,所以如果我不得不

把某种职业

作为我工作的隐喻,我更愿意

做一个渔夫 我想把我的大

网扔进深蓝色的英语海洋

,看看我能从海底拖上什么奇妙的生物

,但为什么

人们要我指挥交通,而我

宁愿去钓鱼,

我却责怪女王 那么为什么 我要怪女王吗?

首先我怪

女王,因为这很有趣,其次我

怪女王,因为

字典真的没有改变我们对字典是什么的看法

自她统治以来没有改变

维多利亚女王唯一不会成为的东西 在

现代词典中被滥用的是我们包含了

自 1965 年以来在美国词典中出现的 f 字,

所以这个人是维多利亚时代的

James Murray 在牛津英语词典之后首先出现的

我没有那

顶帽子我希望我有那顶帽子所以他是 真正

对我们

今天在字典中认为现代的许多内容

负责 曲线,

当然人们都说好吧 计算机

计算机 计算机怎么样 计算机的事情

是我爱计算机 我的

意思是我是一个巨大的极客 我爱计算机 我

会去 在我让

他们从我这里拿走谷歌图书搜索之前绝食,

但是计算机

除了加快编译字典的过程之外没有做太多其他事情,

它们不会

改变最终结果,因为字典

是什么,它融合了维多利亚时代的设计

一点点现代推进力 它的

蒸汽朋克 我们拥有的是电动

自行车 你知道我们有维多利亚时代的

设计,上面有一个引擎,所有

的设计都没有改变,好吧,

在线词典怎么样,在线

词典必须不同,这

是牛津英语

在线词典 最好的在线词典之一 这是

我最喜欢的词

实际条目中有多少个链接

那些小按钮,我把它们都

展开了,除了日期图表,

所以不是很 这里

发生了很多事情 没有很多点击 enos

实际上在线词典复制了

几乎所有的印刷问题,

除了搜索能力,当你

提高搜索能力时,你实际上

带走了印刷的一个优势,即

偶然性 偶然性就是当你找到

你没有在寻找的东西,因为现在

找到你正在寻找的东西是如此

的困难,当你想到

这一点时,我们这里有一只手,但

问题是每个人都知道这只手,但

问题女人正在为一个大家庭聚餐做一只手

去砍掉锤子的屁股

把它扔掉 她

看着这块火腿 她

觉得这是一块非常好的火腿

为什么我要把它扔掉 停好 我

妈妈总是这样做 所以她称它为

妈妈 她说 妈妈

,你做火腿的时候为什么要把屁股从手上切下来

她说我不知道我妈妈总是这样做

,所以他们打电话给奶奶,我奶奶

说我的锅

太小了,不是我们有好话

我们有一个太小的平底锅

你知道火腿但是很好吃

没有

理由扔掉 当他们

找到一个不错的地方或酒吧,而这

不是指南时,他们会喜欢 哦,这个

地方一定很酷,它不在

指南中 当他们发现

字典中没有的词时,他们认为这

一定是一个坏词,为什么它是 更有

可能是一本糟糕的字典为什么你要

责怪火腿对锅来说太大

了所以你不能得到一个更小的火腿如果你有一个火腿但是有问题你就不能得到一个更小的

火腿英语

思考手但

问题 它导致你的结论

是无情的和违反直觉的

纸是文字的敌人 这怎么

可能 我的意思是我喜欢书 我真的很喜欢

书 我最好的一些朋友是书,

但书不是最好的

字典的形状他们就像哦,我的人

会拿走我我漂亮的

纸质词典不,

当我们有汽车时,仍然会有纸质词典

当汽车成为主要的

交通工具时,

我们没有围捕所有的马并

射杀它们,你知道仍然

会有纸质的 字典,但它

不会成为占主导地位的蛋糕

字典书本形字典

不会成为唯一的形状

字典,它也不会

成为形状

字典的原型,所以如果你有一个这样的想法

人为约束

人为约束导致任意

区分和歪曲的

世界观 如果生物学家只能研究

那些让人们走对

的动物

会怎么样

很多关于

魅力超凡的巨型动物,而不是

太多关于其他的,我认为这是一个

问题,我认为我们应该研究所有的

词 因为当你思考单词时,

你可以从

非常不起眼的部分做出漂亮的表达词典实际上

更多的是关于材料科学我们正在

研究

你用来构建表达结构的材料的公差

你的演讲和

写作然后人们经常说 对我来说

好吧 好吧 我怎么知道这个词

是真实

大锤比圆

头锤好,

它们是完成这项工作的合适工具

,有些人对我说,我

怎么知道一个词是不是

真实的 喜欢一个词 使用

它使它成为真实 在

字典中是一种人造的

它不会使一个词比任何其他真实的词更真实

如果你

喜欢一个词它就会变得真实 所以如果我们

不是 如果

我们已经超越了纸质,那么担心引导流量如果我们

不那么担心控制而更多地

担心描述,那么我们可以

将英语视为这款

漂亮的手机,并且任何时候只要您触摸

到手机的这些小部分中的任何一个

变化 触摸一个词 你

在一个新的语境中使用它 你给它一个

新的内涵 你动词它 你做出了

移动 你没有破坏它 它

只是在一个新的位置,如果

你是 不再是交通警察

成为交通警察的问题在于,

在任何一个十字路口只能有这么多交通警察,否则汽车会

混淆,但是如果您的目标

不再是指挥交通,而是可能

要计算那些 到那时,更多的

眼球更好 你可以寻求帮助

如果你寻求帮助

你会完成更多工作,我们真的需要

帮助

国会图书馆 1700 万本书,

其中一半是英文的,如果

每 10 本书中只有一本 有一个

不在字典中的词

,相当于

两个以上未删节的字典,我发现

一个未声明的词

,例如字典上的词,例如在

我读的几乎每一本书中,报纸

报纸档案可以追溯到 1759 年

50 年 - 8 点 100 万张报纸

页面,如果其中只有 100 个页面上

有一个未声明的字,那将是

一个完整的其他 OAD,超过

50 万个字,所以这

就是很多我并没有真正谈论

杂志 我不是在谈论博客

,我在给定的一周内在 Boing Boing 上发现的新词

比在新闻制作人

时间里发现的要多,那里发生了很多事情,

我什至没有谈论 pol SME,这

是某些词的贪婪习惯

为自己赋予多个含义,

因此如果您想到 set 一词,那么一组

可以是 Badgers burrow,一组可以

是褶皱之一和伊丽莎白时代

的原石,并且 OED 中有一个数字定义,

OED 有 33 种差异

为设置的小词租编号定义 33 个

编号定义 其中一个只是

标记为杂项技术意义

你知道吗 你知道这对

我说什么 对我说这是星期五

下午 有人

想去酒吧 那是一个词典警察

  • 说杂项技术意义,所以

我们有所有这些词,我们真的

需要帮助,问题是我们可以

寻求帮助 向你寻求帮助并不那么

难我的意思是词典编纂不是火箭

科学,

我只是给了你很多 单词和

大量数字,

如果我们将

字典视为英语语言的地图,

这更像

是一种视觉解释 是

美国英语中所有单词的地图,

我们不太了解,我们

甚至不知道语言的形状

如果这是字典如果

这是与美国英语的地图一起

看,我们有一种 佛罗里达州的模糊概念,

但没有加利福尼亚我们

在美国英语中错过了加利福尼亚

我们只是不够

了解,我们甚至不知道我们正在错过加利福尼亚

我们甚至没有看到地图上的差距

所以 我再次重申,词典编纂不是

火箭科学,但即使是

火箭科学,

这些天也是由敬业的业余爱好者完成的,你

知道找到一些词并不难,

所以现在其他学科的科学家

真的在寻求人们的帮助和

他们做得很好,

例如有 eBird,业余

观鸟者可以在其中上传有关观鸟的信息

,然后

鸟类学家可以去帮助跟踪

种群迁徙等,还有

这个人 Mike Oates 我的科茨住

在英国,他是 一家

电镀公司的主管 他发现了

140 多颗彗星 他有这么多彗星

他们以他的名字命名了一条评论

有点过了火星 这是一次徒步旅行 我不

认为他在那里拍照

我很快,但他在

没有望远镜的情况下找到了 104 个 Komets 他

从美国宇航局 Soho 卫星下载了数据,

这就是他找到它们的方法 我们可以在

没有望远镜的情况下找到评论

我们现在不应该找到文字吗你们都

知道我要去哪里 之所以这样,是因为我

要去互联网,这是

每个人都会去的地方,互联网非常

适合收集文字,因为

互联网上到处都是收集者,这

是关于互联网的一个鲜为人知的技术事实

,但互联网

实际上是由文字组成的 和热情

,文字和热情实际上

恰好是

词典编纂的秘诀并不是那么好,

所以问题是现在有很多好的

单词收集网站,但

其中一些的问题是

它们不够科学,他们 显示

这个词,但他们没有显示任何上下文

它是从哪里来的

的神器的来源它不是科学

它看起来很漂亮所以一个

没有它的来源的词就像一朵切花

你知道它很漂亮看

一会儿但它死了它死得太快

所以这整个时间我 一直在说

字典虽然

字典不是字典或

字典,那是因为

人们使用字典来代表

他们玩世不恭地使用它的整个语言

,而认识

像 Sanok Dhaka Kelly 这样的单词的问题之一是你

真的

想找个借口说

Sanok 语言的一部分

或国旗代表美国,它

是国家的象征,然后你就

玩世不恭地使用它,但问题是我们可以

让字典成为整个语言,

如果我们得到一个更大的平底锅,那么我们可以把

所有的 我们可以把所有的

意思都表达出来 不是每个人都希望

他们的生活

有更多的意义吗?我们可以让字典

不仅仅是一种语言的象征,我们可以让它

成为整个语言,你明白

我真正希望的是什么 因为

我儿子这个

月就七岁了 磁带它是一种

已经消亡的格式,因为它

不够有用,它并不是

人们真正需要的,问题是如果我们

能把所有的词都放进去,不再有

好坏之间的人为区别,

我们就可以真正描述这种语言

像科学家一样,我们可以把

审美判断留给作家

和演讲者,如果我们能做到,那么

我可以把所有的时间都花在钓鱼上,我

不必再当交通警察了

,非常感谢你的

关注