Not all scientific studies are created equal David H. Schwartz

Studies have shown that

taking vitamins is good for your health

and bad for your health.

That newly discovered herb can improve your memory

or destroy your liver.

Headlines proclaim a promising new cancer treatment

and never mention it again.

On a daily basis,

we are bombarded with attention-grabbing news,

backed up by scientific studies,

but what are these studies?

How are they performed?

And how do we know whether they’re reliable?

When it comes to dietary or medical information,

the first thing to remember

is that while studies on animals or individual cells

can point the way towards further research,

the only way to know how something will affect humans

is through a study involving human subjects.

And when it comes to human studies,

the scientific gold standard is

the randomized clinical trial, or RCT.

The key to RCTs is that the subjects are randomly assigned

to their study groups.

They are often blinded to make them more rigorous.

This process attempts to ensure

that the only difference between the groups

is the one the researchers are attempting to study.

For example,

when testing a new headache medication,

a large pool of people with headaches

would be randomly divided into two groups,

one receiving the medication

and another receiving a placebo.

With proper randomization,

the only significant overall difference

between the two groups

will be whether or not they received the medication,

rather than other differences that could affect results.

Randomized clinical trials are incredible tools,

and, in fact, the US Food and Drug Administration

often requires at least two to be conducted

before a new drug can be marketed.

But the problem is that an RCT is not possible

in many cases,

either because it’s not practical

or would require too many volunteers.

In such cases,

scientists use an epidemiological study,

which simply observes people going about their usual behavior,

rather than randomly assigning active participants

to control invariable groups.

Let’s say we wanted to study

whether an herbal ingredient on the market

causes nausea.

Rather than deliberately giving people something

that might make them nauseated,

we would find those who already take the ingredient

in their everyday lives.

This group is called the cohort.

We would also need a comparison group

of people who do not have exposure to the ingredient.

And we would then compare statistics.

If the rate of nausea is higher in the herbal cohort,

it suggests an association

between the herbal supplement and nausea.

Epidemiological studies are great tools

to study the health effects of almost anything,

without directly interfering in people’s lives

or assigning them to potentially dangerous exposures.

So, why can’t we rely on these studies

to establish causal relationships

between substances and their effects on health?

The problem is

that even the best conducted epidemiological studies

have inherent flaws.

Precisely because the test subjects

are not randomly assigned to their groups.

For example, if the cohort in our herbal study

consisted of people who took the supplement

for health reasons,

they may have already had higher rates of nausea

than the other people in the sample.

Or the cohort group could’ve been composed of

people who shop at health food stores

and have different diets

or better access to healthcare.

These factors that can affect results,

in addition to the factor being studied,

are known as confounding variables.

These two major pitfalls,

combined with more general dangers,

such as conflicts of interest or selective use of data,

can make the findings of any particular epidemiological study suspect,

and a good study must go out of its way

to prove that its authors have taken steps

to eliminate these types of errors.

But even when this has been done,

the very nature of epidemiological studies,

which examine differences between preexisting groups,

rather than deliberately inducing changes within the same individuals,

means that a single study

can only demonstrate a correlation

between a substance and a health outcome,

rather than a true cause and effect relationship.

At the end of the day,

epidemiological studies have served as excellent guides to public health,

alerting us to critical health hazards,

such as smoking, asbestos, lead, and many more.

But these were demonstrated through

multiple, well-conducted epidemiological studies,

all pointing in the same direction.

So, the next time you see a headline

about a new miracle cure

or the terrible danger posed by an everyday substance,

try to learn more about the original study

and the limitations inherent in any epidemiological study or clinical trial

before jumping to conclusions.

研究表明,

服用维生素对您的健康有益

,对您的健康有害。

这种新发现的草药可以改善你的记忆力

或破坏你的肝脏。

头条新闻宣布了一种有前途的新癌症治疗方法,

并且再也没有提及它。

每天,

我们都会被引人注意的新闻轰炸,这些新闻

有科学研究的支持,

但这些研究是什么?

它们是如何执行的?

我们怎么知道它们是否可靠?

谈到饮食或医学信息时

,首先要记住的

是,虽然对动物或单个细胞的研究

可以为进一步研究指明方向,

但了解某事物将如何影响人类的唯一方法

是通过涉及人类受试者的研究。

在人体研究方面

,科学的黄金标准

是随机临床试验或 RCT。

RCT 的关键是受试者被随机分配

到他们的研究组。

他们经常被蒙蔽以使他们更加严格。

这个过程试图确保

组之间的唯一区别

是研究人员试图研究的那个。

例如,

在测试一种新的头痛药物时,

一大群头痛患者

将被随机分为两组,

一组接受药物治疗

,另一组接受安慰剂治疗。

通过适当的随机化

,两组之间唯一显着的总体差异

将是他们是否接受了药物治疗,

而不是可能影响结果的其他差异。

随机临床试验是令人难以置信的工具

,事实上,美国食品和药物管理局

通常要求至少进行两次试验

才能将一种新药推向市场。

但问题是,

在许多情况下,RCT 是不可能的,

要么因为它不实用,

要么需要太多的志愿者。

在这种情况下,

科学家们使用流行病学研究,

它只是观察人们的日常行为,

而不是随机分配积极的参与者

来控制不变的群体。

假设我们想

研究市场上的一种草药成分是否

会引起恶心。

与其故意给人们一些

可能使他们恶心的东西,

我们会发现那些已经

在日常生活中服用这种成分的人。

这个组被称为队列。

我们还需要一组

没有接触过这种成分的人。

然后我们将比较统计数据。

如果草药组的恶心率较高,

则表明

草药补充剂与恶心之间存在关联。

流行病学研究是

研究几乎任何事物对健康影响的重要工具,

不会直接干扰人们的生活

或将他们分配给潜在的危险暴露。

那么,为什么我们不能依靠这些研究

来建立

物质及其对健康影响之间的因果关系呢?

问题是

,即使是进行得最好的流行病学研究

也存在固有缺陷。

正是因为测试对象

不是随机分配到他们的组中的。

例如,如果我们的草药研究中的队列

由因健康原因服用补充剂的人组成

那么他们的恶心率可能已经

高于样本中的其他人。

或者,队列组可能由

在健康食品商店购物

并有不同饮食

或更好地获得医疗保健的人组成。

除了正在研究的因素之外,这些可能影响结果的因素

被称为混杂变量。

这两个主要陷阱,

再加上更普遍的危险,

例如利益冲突或选择性使用数据,

可能会使任何特定流行病学研究的结果令人怀疑

,一项好的研究必须不遗余力

地证明其作者已经采取了

消除这些类型的错误的步骤。

但即使这样做了,

流行病学研究的本质

是检查先前存在的群体之间的差异,

而不是故意在同一个体内部引起变化,这

意味着一项研究

只能证明

物质与健康结果之间的相关性,

而不是真正的因果关系。

归根结底,

流行病学研究已成为公共卫生的极好指南,

提醒我们注意严重的健康危害,

例如吸烟、石棉、铅等等。

但这些都通过

多项开展良好的流行病学研究得到证明,

所有研究都指向同一个方向。

因此,下次当您看到

有关新的灵丹妙药

或日常物质造成的可怕危险的标题时,请

尝试更多地了解原始研究

以及任何流行病学研究或临床试验中固有的局限性,

然后再下结论。