Department of Mathematics, Bowdoin College, Brunswick, Maine, United States of America.
Odum School of Ecology, University of Georgia, Athens, Georgia, United States of America.
PLoS Biol. 2022 Jun 22;20(6):e3001685. doi: 10.1371/journal.pbio.3001685. eCollection 2022 Jun.
Historically, emerging and reemerging infectious diseases have caused large, deadly, and expensive multinational outbreaks. Often outbreak investigations aim to identify who infected whom by reconstructing the outbreak transmission tree, which visualizes transmission between individuals as a network with nodes representing individuals and branches representing transmission from person to person. We compiled a database, called OutbreakTrees, of 382 published, standardized transmission trees consisting of 16 directly transmitted diseases ranging in size from 2 to 286 cases. For each tree and disease, we calculated several key statistics, such as tree size, average number of secondary infections, the dispersion parameter, and the proportion of cases considered superspreaders, and examined how these statistics varied over the course of each outbreak and under different assumptions about the completeness of outbreak investigations. We demonstrated the potential utility of the database through 2 short analyses addressing questions about superspreader epidemiology for a variety of diseases, including Coronavirus Disease 2019 (COVID-19). First, we found that our transmission trees were consistent with theory predicting that intermediate dispersion parameters give rise to the highest proportion of cases causing superspreading events. Additionally, we investigated patterns in how superspreaders are infected. Across trees with more than 1 superspreader, we found preliminary support for the theory that superspreaders generate other superspreaders. In sum, our findings put the role of superspreading in COVID-19 transmission in perspective with that of other diseases and suggest an approach to further research regarding the generation of superspreaders. These data have been made openly available to encourage reuse and further scientific inquiry.
从历史上看,新出现和重新出现的传染病已经在全球范围内造成了大规模、致命和昂贵的疫情爆发。通常,疫情爆发的调查旨在通过重建疫情传播树来确定谁感染了谁,该树将人与人之间的传播可视化为人与人之间的网络,其中节点代表个体,分支代表人与人之间的传播。我们编译了一个名为 OutbreakTrees 的数据库,其中包含 382 个已发表的标准化传播树,这些树包含 16 种直接传播疾病,大小从 2 例到 286 例不等。对于每棵树和每种疾病,我们计算了几个关键统计数据,例如树的大小、平均二次感染人数、分散参数以及被认为是超级传播者的病例比例,并研究了这些统计数据在每次爆发过程中以及在对爆发调查的完整性的不同假设下是如何变化的。我们通过两个简短的分析来证明数据库的潜在用途,这些分析解决了各种疾病(包括 2019 年冠状病毒病(COVID-19))的超级传播者流行病学方面的问题。首先,我们发现我们的传播树与理论一致,该理论预测中等分散参数会导致最高比例的病例引发超级传播事件。此外,我们还研究了超级传播者感染的模式。在有超过 1 个超级传播者的传播树中,我们初步支持了超级传播者产生其他超级传播者的理论。总的来说,我们的研究结果将 COVID-19 传播中的超级传播者作用与其他疾病进行了比较,并提出了一种方法来进一步研究超级传播者的产生。这些数据已经公开提供,以鼓励重复使用和进一步的科学探究。