Collaboration has no limits

exactly a year ago

when india had reported its 100th covert

positive case

a user on reddit posted this to the

site’s r

india subreddit the anonymous user

described that they were preparing a

database on google sheets

to collect as much information as

possible regarding the transmission of

the coronavirus

in india i was mindlessly doomed

scrolling through my news feed

when i first discovered this post and

this one sentence really bothered me

doing this from the start of the

epidemic will be valuable data in the

future

what a weird way to spend time doing

quarantine right

but i opened the spreadsheet and it

looked fairly simple

100 of transmission data right from the

first reported case on january 30th

2020 to then present hundredth case

each row was tagged with the source

which was either a state bulletin

or a press release reporting on behalf

of medical institutions

since every transmission was recorded

adding up all the numbers gave an

up-to-date overview

of the number of people across four

different categories

confirmed with those who tested positive

active were those who were actively

carrying the virus

recovered with those who were immune to

the virus

and deceased were those who fought

really really hard

but unfortunately couldn’t make it

these calculations were automated and

stored in the second sheet

in the third sheet there was a breakdown

of these statistics

group by states and in the fourth sheet

there were instructions for newcomers to

enter new data

wait a minute so anybody on the internet

could potentially modify the contents of

the sheet

now i don’t know about you guys but this

is what i was expecting to happen to the

page in just a few hours

but i gave it a shot i read the

instructions

and there was a call for volunteers to

help with data entry

and at the bottom there was a link to

join a group chat

it was the weekend in the middle of a

worldwide quarantine

so i figured why not spare some time and

help with whatever this person was

trying to accomplish

so i hopped onto the group chat and very

quickly realized that

i wasn’t the only person who walked in

hoping to help

there were about 300 people already in

there

talking about all the different ways

each one of them could help

i noticed a pinned message at the top

summarizing the roadmap

and describing the kind of help required

to maintain this initiative for the next

coming days

i didn’t know anybody in the group and

i am pretty sure nobody knew each other

as well

but people just seemed to pick whatever

they could help with from the list

and branched out to smaller groups that

were focused on a single type of work

at that time these were data operations

and web development

in the data operations group people

volunteered to keep an eye on press

releases

from various sources and share them with

the group whenever there was an update

after verifying the source someone would

then document it on the sheet

a few others would then double check the

new entry to make sure they had gotten

it right

and the updates would then get published

similarly in the web development group

people volunteered to keep an eye on the

website source code

whenever people found bugs on the

website they would file issues

and describe what was acting weird after

figuring out a solution

someone would then update the code that

fixed the bug

a few others would then double check the

updated code to make sure they too had

gotten it right

and the changes would then be reflected

on the website

pretty straightforward right so when i

joined the kovan 19 outbreak was in its

early phase of the pandemic

the data operations group made about 10

updates every day

and i found that to be super

overwhelming the web development group

they were talking about fixing bugs and

adding new things to the website

so i dropped them a message saying hey

i could help with that over the weekend

the team came together and made a lot of

improvements to the code base

one of the things that we agreed to from

the very beginning

was to keep things as simple as possible

we knew that this was going to be a

collaborative crowdsourced effort

so we believe that using simpler tools

that a good majority of people already

knew

or perhaps could be learned easily would

make it a more inclusive space for

anybody to walk in and participate so

with this philosophy in mind

this was how the tech infrastructure

looked like

we stuck with google sheets and used it

as a primary database

anybody could easily create read

update and delete things off the

spreadsheet

yep we actually use google sheets as our

database

we redesigned the website on top of an

open source framework called react.js

that made it easier for people to create

and share interactive components without

worrying too much about how the rest of

the website worked

everyone from seasoned developers to

absolute beginners

created maps and graphs and were able to

easily plug them into the website

finally every 10 minutes a script took

snapshots of all the sheets

converted them into structured data and

published the data to this url

whenever anyone visited the website

it referred to the data present in this

url and populated the page with the

latest aggregated statistics

this helped to keep the project’s total

expenditure

to about zero rupees and allowed 300

volunteers to collaborate

without any friction in the process at

the start of the week we began to see

over 8000 people

visiting the website in just a single

day that’s

a lot of people and as much as we were

worried about scaling the tech

infrastructure

to support 300 volunteers none of us had

any previous experiences

with scaling reliability or

accountability

to the general public one trivial bug

and you could end up causing chaos at a

time when it’s crucial for

everybody to remain calm in

less than 24 hours six million people

were on the front page to keep up with

the numbers

that brought in a lot of questions on

social media like is this official

who are you guys how are you doing this

by the way

in no time a new bunch of incoming

volunteers

branched out to take care of social

media we created an account representing

the initiative

reached out to as many people as

possible and clarified with answers

since all the work done was transparent

in open source

it helped people vouch for their

credibility whenever we made mistakes

people were quick to correct us this

very quickly grew into a medium of

communications between the volunteers

and the general public they were able to

run multiple awareness campaigns

data analysis features in qa sessions

with experts

to bring a scientific and data-driven

approach to the situation

questions then turned into where can i

find a testing center in mother

what’s the curfew time in chennai i’d

like to volunteer how do i get started

social media can be incredibly volatile

when talking about a subject like

cover 19 which stands at the

intersection of health

data politics and all the emotional

uncertainties that was brought about by

the lockdown

however helplessness and vulnerability

to the crisis

drove a lot of these questions

withholding updates

only feared uncertainty fear

and even the spread of dangerous

misinformation

being transparent empathetic and

objective with their communications

significantly help people

be at ease soon many joined together

and kept us on our doors by tagging us

to press releases

and eventually many cheered us on to

keep the good work going

a couple of weeks in the energy was

still running high and many more still

joined the group

hoping to help these were some of the

things that were made possible

essentials help people find the nearest

testing

centers foot banks shelters

open supermarkets etc localization

people from different states came

together and translated the website into

these many regional languages

behind the scenes the group chat also

opened up conversations and

collaborations

between researchers journalists

economists and many more

while development had run its course and

had eventually started to ease down a

little

the real work had begun for the folks in

the data operations group

they were absolutely floored with the

rise in the number of people testing

positive

and fighting the virus to give you some

context

each of the 28 states and eight union

territories had separate channels of

communication

and reporting with varying degrees of

granularity

and structure even though the reporting

was half as hard

the team only had to record a handful of

cases to begin with

the months following may the team had

moved on from aggregating by states to

aggregating by districts

there are about 700 districts in india

and with four data points collected for

each

that’s about 2 800 data points collected

per day even though many districts and

states were unable to release all the

details

the data collected was still vast so

the team had to improvise on the fly by

re-evaluating their operations multiple

times

adapting to the changing nature of

primary sources and sometimes

even breaking the limits of google

sheets for instance

some of them even built custom big text

recognition scripts

to sift through multiple pdfs as they

came in and quickly turn them into

something they could make sense of

but at the end of the day regardless of

all the tech magic

and software gymnastics the final

entries were input by

volunteers in order to avoid any glaring

mistakes

it’s true that technology has made

collaborating with multiple people

easier than ever before however

computers don’t feel the heavy toll

it comes along with adding thousands of

rows to a spreadsheet

each roman there gave us a little

glimpse into the lives of thousands of

indians getting through this awful

pandemic

and given the grim nature of the

volunteers work

getting to some of the worst days wasn’t

easy

and i’m sure it wasn’t easy for you as

well

all of us entered a very painful period

in our country’s history

when over 50 000 people’s lives were at

stake

every day the economy was severely

tanking

people were losing their jobs and most

importantly

a lot of people were dying for some

they were friends and for some they were

family

we lost a lot of people along the way

and

that sucks but what surprises me the

most is that

people still showed up none of us in the

group had met each other before

and every interaction was online yet

there was the sense of familiarity

that brought us all together in the

midst of this very uneventful moment of

our lives

people checked in on each other across

different countries and time zones

and there existed this sense of empathy

care and selflessness things i

never would have thought this witness

happen over the spreadsheet

but it was there

row in there felt personal to each one

of us

at some point and that’s been the

invisible culture that has sustained

this initiative

for more than a year now today

marks one year since someone posted this

on the internet

hoping that doing this from the start of

this pandemic

would be valuable data in the future

throughout this journey

overnight in india.org has become a

significant source of information for

research

state governments and even the very

recently released

economic survey of india eventually

this initiative will have to stop which

is the goal

but until then we will continue to take

it

one day at a time over the last year

so many of you have helped us in so many

different ways

and we sincerely appreciate all the help

you’ve helped us believe

that collaboration has no limits

就在一年前,

当印度报告了其第 100 例隐蔽的

阳性病例时

,reddit 上的一位用户将其发布到该

网站的 r

india subreddit 上,匿名用户

描述说,他们正在

谷歌表格上准备一个数据库,

以收集尽可能多的

有关传播的信息。

印度的冠状病毒当我第一次发现这篇文章时,我注定要盲目地

滚动浏览我的新闻提要

这句话真的让我很困扰,

从流行病开始就这样做

将是未来的宝贵数据

花时间做隔离的方式真是太奇怪了

但我打开了电子表格,

从 2020

年 1 月 30 日的第一个报告病例

到现在的第 100 个病例,它看起来相当简单 100 个传输数据,

每一行都标有来源

,该来源是州公告

或代表代表报告的新闻稿

医疗机构,

因为每次传输都被记录下来

,所有数字

加起来提供了最新的概览 w

在四个不同类别的人数中,

与那些被检测为阳性的人确认

为积极

携带病毒的

人与那些对病毒免疫

并死亡的人是那些

非常

努力但不幸未能成功的人

这些计算是自动进行的,并

存储在第二张表

中,第三张表中按州

对这些统计数据进行了细分

,第四张表

中指示新手

输入新数据

稍等,因此互联网上的任何人

都可能会修改

表格的内容

现在我不了解你们,但这

是我期望

在短短几个小时内发生在页面上的内容,

但我试了一下,我阅读了

说明

,并呼吁志愿者提供

帮助 数据输入

,底部有一个加入群聊的链接,

那是周末在

全球隔离期间,

所以我想为什么不抽点时间 我和

这个人试图完成的任何事情都提供帮助,

所以我跳进了群聊,

很快意识到

我不是唯一一个走进来

希望帮助的

人,那里已经有大约 300 人在

谈论所有不同的方式

他们每个人都可以提供帮助

其他也是如此,

但人们似乎只是

从列表中挑选出他们可以提供帮助的任何东西,

并扩展到

当时专注于单一类型工作的较小小组,

这些

小组是人们自愿参与的数据运营小组中的数据运营和 Web 开发

密切关注

来自各种来源的新闻稿,并在核实来源后

有更新时与小组分享,

然后有人将其记录在案

然后其他一些人会仔细检查

新条目以确保他们

做对了

,然后更新将

在网络开发组中以类似方式发布,

人们自愿在人们发现错误时密切关注

网站源代码

网站,他们会提交问题

并描述在

找出解决方案之后发生了什么奇怪的

事情,然后有人会更新

修复错误

的代码,其他一些人会仔细检查

更新的代码,以确保他们也

做对了

,然后更改会 反映

在网站上

非常简单,所以当我

加入 kovan 19 爆发

处于大流行的早期阶段时,

数据运营组每天进行大约 10 次

更新

,我发现他们正在谈论

的 Web 开发组超级压倒

修复错误

并向网站添加新内容,

所以我给他们发了一条消息,说嘿,

我可以在周末提供帮助,

团队 c 聚在一起并对代码库进行了很多

改进

我们从一开始就同意的一件事

是让事情尽可能简单

我们知道这将是一个

协作的众包工作,

所以我们相信使用更简单的

大多数人已经

知道

或可能很容易学习的工具将

使它成为一个更具包容性的空间,

任何人都可以走进并参与,

因此牢记这一理念,

这就是

我们坚持使用谷歌表格并使用的技术基础设施的样子 它

作为主要数据库,

任何人都可以轻松地创建读取

更新并从

电子表格中删除内容

是的,我们实际上使用谷歌表格作为我们的

数据库,

我们在名为 react.js 的开源框架之上重新设计了网站

,使人们更容易创建

和 共享交互式组件,而

不必过多担心网站的其余部分是如何

工作的,

从经验丰富的开发人员到

绝对的初学者

创建了地图和图表,并且能够

轻松地将它们最终每 10 分钟插入到网站中,

一个脚本拍摄

所有工作表的快照,

将它们转换为结构化数据,并

在任何人访问该网站时将数据发布到该 URL,

它引用了存在的数据 这个

url 并在页面上填充了

最新的汇总统计数据

这有助于将项目的总

支出

保持在零卢比左右,并允许 300

名志愿者

在这个过程中没有任何摩擦地进行协作

。在本周开始,我们开始看到

超过 8000 人

访问 网站在

一天之内

就有很多人,尽管我们

担心扩展技术

基础设施

以支持 300 名志愿者,但我们之前没有任何人有

任何

扩展可靠性或

对公众负责的经验

一个微不足道的错误

,你可以结束 在

每个人在

不到 24 小时内保持冷静至关重要的时候造成混乱 600 万 n

人们在头版

关注数字,这些数字

在社交媒体上引发了很多问题,

比如这位官员是

谁,你们是谁?顺便说一下,你们是怎么做到的

为了照顾社交

媒体,我们创建了一个代表该倡议的帐户,该帐户

向尽可能多的人

传达了答案,

因为所做的所有工作

在开源中都是透明的,

它可以帮助人们

在我们犯错时保证

他们的可信度 纠正我们这

很快成为

志愿者和公众之间的沟通媒介,

他们能够

在与专家的 QA 会议中开展多项宣传活动数据分析功能,

从而为情况问题带来科学和数据驱动的方法,

然后变成 我在哪里可以

找到母亲的测试中心

钦奈的宵禁时间是几点 我

想当志愿者 我如何开始

社交 l

在谈论像第 19 期这样的主题时,媒体可能会非常不稳定,该主题

处于健康

数据政治和封锁

带来的所有情绪不确定性的交叉点,

但是无助和

对危机的脆弱性导致

了很多这样的问题

隐瞒了更新

只担心不确定性 恐惧

甚至危险的

错误信息

的传播 透明的 善解人意和

客观的沟通

极大地帮助

人们放松 很快许多人团结起来

,通过给我们贴上新闻稿的标签,让我们保持在门口

,最终许多人为我们欢呼

保持

好几个星期的工作精力

仍然很高,还有更多的人

加入了

这个小组,希望能提供帮助

来自不同国家的人

聚集在一起翻译 在幕后将网站翻译成

许多区域语言

群聊还

开启了

研究人员、记者、经济学家等人之间的对话和合作,

而发展已经顺利进行

并最终开始缓和

一些真正的工作已经开始为人们

在数据操作组中,

他们

对检测

呈阳性

和与病毒作斗争的人数的增加感到非常震惊,以便为您提供一些

背景信息,28 个州和 8 个联邦直辖区中的每一个

都有不同程度的沟通和报告渠道,

并且 结构 尽管报告

的难度是原来的一半,

但团队只需要在接下来的几个月中记录少数

案例即可。

团队已经

从按州汇总转向

按地区汇总

印度大约有 700 个地区

和四个数据 为

每个

收集的点,每天收集大约 2 800 个数据点

e 尽管许多地区

和州无法公布所有

细节,

但收集到的数据仍然非常庞大,

因此团队不得不通过

多次重新评估他们的运营来

适应主要来源不断变化的性质

,有时

甚至打破限制,从而即兴发挥 例如,谷歌

表格中的

一些甚至构建了自定义的大文本

识别脚本,

以便

在它们进入时筛选多个 pdf,并迅速将它们变成

他们可以理解的东西,

但最终不管

所有的技术魔法

和软件 体操 最后的

条目是由志愿者输入

的,以避免任何明显的

错误

确实,技术使

与多人协作

比以往任何时候都更容易,但是

计算机并没有感觉到在

电子表格中添加数千行所带来的沉重负担

罗马那里让我们

瞥见了成千上万

印度人度过这场可怕的

流行病的生活

鉴于志愿者工作的严酷性质,

度过

一些最糟糕的日子并不

容易

,我相信对你来说

也不容易 人们的生命每天都处于

危险之中

经济严重下滑

人们正在失去工作,最

重要

的是,很多人正在死去,有些

人是朋友,有些人是

家人,

我们一路上失去了很多人

这很糟糕,但 最让我惊讶的

是,

人们仍然出现在这个

小组中,我们之前没有见过面

,每一次互动都是在线的,但

在我们生活中这个非常平静的时刻,有一种熟悉

感将我们聚集在一起

人们在

不同的国家和时区互相检查

,存在这种同情

关怀和无私的感觉,

我从未想过这个见证

会发生在电子表格上,

但是 在那里

,我们每个人都

在某个时候感到很私人,这是一种

无形的文化,使

这一倡议

持续了一年多,今天

标志着自从有人在互联网上发布此消息以来的一年

希望从

这场大流行的开始

将是未来整个旅程中的宝贵数据

india.org 一夜之间已成为研究州政府的

重要信息来源

,甚至

最近发布

的印度经济调查最终

将不得不停止这一举措

目标,

但在那之前,我们将继续

在过去一年中一天一天地完成它

,你们中的许多人以许多不同的方式帮助了我们

,我们真诚地感谢

你们帮助我们的所有帮助,

相信合作没有限制