Did Shakespeare write his plays Natalya St. Clair and Aaron Williams

“Some are born great,

some achieve greatness,

and others have greatness thrust
upon them”, quoth William Shakespeare.

Or did he?

Some people question whether Shakespeare
really wrote the works that bear his name,

or whether he even existed at all.

They speculate that Shakespeare
was a pseudonym for another writer,

or a group of writers.

Proposed candidates
for the real Shakespeare

include other famous playwrights,
politicians and even some prominent women.

Could it be true that the greatest writer
in the English language

was as fictional as his plays?

Most Shakespeare scholars
dismiss these theories

based on historical
and biographical evidence.

But there is another way to test
whether Shakespeare’s famous lines

were actually written by someone else.

Linguistics, the study of language,

can tell us a great deal about the way
we speak and write

by examining syntax, grammar,
semantics and vocabulary.

And in the late 1800s,

a Polish philosopher
named Wincenty Lutosławski

formalized a method known as stylometry,

applying this knowledge to investigate
questions of literary authorship.

So how does stylometry work?

The idea is that each writer’s style
has certain characteristics

that remain fairly uniform
among individual works.

Examples of characteristics include
average sentence length,

the arrangement of words,

and even the number of occurrences
of a particular word.

Let’s look at use of the word thee
and visualize it as a dimension, or axis.

Each of Shakespeare’s works
can be placed on that axis,

like a data point, based on the number
of occurrences of that word.

In statistics, the tightness
of these points

gives us what is known as the variance,
an expected range for our data.

But, this is only a single characteristic
in a very high-dimensional space.

With a clustering tool
called Principal Component Analysis,

we can reduce the multidimensional space
into simple principal components

that collectively measure the variance
in Shakespeare’s works.

We can then test the works
of our candidates

against those principal components.

For example,

if enough works of Francis Bacon
fall within the Shakespearean variance,

that would be pretty strong evidence

that Francis Bacon and Shakespeare
are actually the same person.

What did the results show?

Well, the stylometrists who carried
this out have concluded

that Shakespeare is none other
than Shakespeare.

The Bard is the Bard.

The pretender’s works just don’t match up
with Shakespeare’s signature style.

However, our intrepid
statisticians did find

some compelling evidence
of collaborations.

For instance, one recent study concluded

that Shakespeare worked with playwright
Christopher Marlowe on “Henry VI,”

parts one and two.

Shakespeare’s identity is only one of
the many problems stylometry can resolve.

It can help us determine
when a work was written,

whether an ancient text is a forgery,

whether a student has committed plagiarism,

or if that email you just received
is of a high priority or spam.

And does the timeless poetry
of Shakespeare’s lines

just boil down to numbers and statistics?

Not quite.

Stylometric analysis may reveal what makes
Shakespeare’s works structurally distinct,

but it cannot capture the beauty of
the sentiments and emotions they express,

or why they affect us the way they do.

At least, not yet.

“有些人天生伟大,

有些人成就伟大,

而另一些人
则将伟大强加于他们”,威廉·莎士比亚说。

还是他?

有人质疑莎士比亚是否
真的写了以他的名字命名的作品,

或者他是否真的存在过。

他们推测莎士比亚
是另一位作家

或一群作家的化名。

真正莎士比亚的提名候选人

包括其他著名剧作家、
政治家,甚至一些杰出的女性。

最伟大

英语作家真的像他的戏剧一样虚构吗?

大多数莎士比亚学者

基于历史
和传记证据驳斥了这些理论。

但是还有另一种方法可以
检验莎士比亚的名言

是否真的是别人写的。

语言学,对语言的研究,

可以

通过检查句法、语法、
语义和词汇来告诉我们很多关于我们说话和写作的方式。

在 1800 年代后期,

一位名叫 Wincenty Lutosławski 的波兰哲学家

正式提出了一种称为文体测量法的方法,

将这种知识应用于
调查文学作者身份问题。

那么文体测量是如何工作的呢?

这个想法是每个作家的风格
都具有某些特征,这些特征在个人作品

中保持相当一致

特征的示例包括
平均句子长度、

单词的排列,

甚至
特定单词的出现次数。

让我们看一下 thee 这个词的用法,
并将其可视化为一个维度或轴。

莎士比亚的每一部作品
都可以放在那个轴上,

就像一个数据点,基于
那个词的出现次数。

在统计学中,
这些点的紧密程度

为我们提供了所谓的方差,
即我们数据的预期范围。

但是,这只是
一个非常高维空间中的一个特征。

使用
称为主成分分析的聚类工具,

我们可以将多维空间
简化为简单的主成分

,这些主成分共同衡量
莎士比亚作品中的方差。

然后,我们可以针对这些主要组件测试
我们的候选人的作品

例如,

如果弗朗西斯培根的足够多的作品都
属于莎士比亚的差异,

那将

是弗朗西斯培根和
莎士比亚实际上是同一个人的有力证据。

结果表明了什么?

好吧,执行此操作的测字师
得出的结论

是,莎士比亚就是
莎士比亚。

吟游诗人就是吟游诗人。

伪装者的作品
与莎士比亚的标志性风格不符。

然而,我们勇敢的
统计学家确实发现了

一些令人信服
的合作证据。

例如,最近的一项研究得出

结论,莎士比亚与剧作家
克里斯托弗·马洛在《亨利六世》

第一和第二部分合作。

莎士比亚的身份只是
文体学可以解决的众多问题之一。

它可以帮助我们
确定作品的创作时间、

古代文本是否是伪造的、

学生是否抄袭,

或者您刚刚收到的电子邮件
是否属于高优先级或垃圾邮件。

莎士比亚诗句中永恒
的诗句是否

可以归结为数字和统计数据?

不完全的。

文体分析可能揭示
莎士比亚作品在结构上与众不同的原因,

但它无法捕捉到
它们所表达的情感和情感之美,

或者它们为什么以它们的方式影响我们。

至少,现在还没有。