Gilbert and Rose Mary Chagoury School of Medicine, Lebanese American University, Byblos, Lebanon.
Independent Researcher, Montclair, USA.
BMC Infect Dis. 2022 Apr 1;22(1):322. doi: 10.1186/s12879-022-07147-2.
The world has been dealing with the SARS-COV-2 pandemic since December 2019 and a lot of effort has focused on tracking the spread of the virus by gathering information regarding testing statistics and generating viral genomic sequences. Unfortunately, there is neither a single comprehensive resource with global historical testing data nor a centralized database with summary statistics of the identified genomic variants. We merged different pre-aggregated historical testing data and complemented them with our manually extracted ones, which consist of 6852 historical test statistics from 76 countries/states unreported in any other dataset, at the date of submission, making our dataset the most comprehensive to date. We also analyzed all publicly deposited SARS-CoV-2 genomic sequences in GISAID and annotated their variants. Both datasets can be accessed through our interactive dashboard which also provides important insights on different outbreak trends across countries and states. The dashboard is available at https://bioinfo.lau.edu.lb/gkhazen/covid19 . A daily updated version of the datasets can be downloaded from github.com/KhazenLab/covid19-data.
自 2019 年 12 月以来,世界一直在应对 SARS-CoV-2 大流行,人们投入了大量精力来通过收集有关检测统计数据和生成病毒基因组序列的信息来跟踪病毒的传播。不幸的是,目前既没有一个包含全球历史检测数据的单一综合资源,也没有一个集中的数据库来汇总已识别基因组变异的统计信息。我们合并了不同的预先汇总的历史检测数据,并补充了我们手动提取的数据,这些数据包括在提交日期之前,来自 76 个国家/地区的 6852 项历史检测统计数据,这些数据在其他任何数据集中均未报告,使我们的数据集成为迄今为止最全面的数据集。我们还分析了 GISAID 中所有已提交的 SARS-CoV-2 基因组序列,并对其变体进行了注释。这两个数据集均可通过我们的交互式仪表板访问,该仪表板还提供了有关各国和各州不同疫情趋势的重要见解。仪表板可在 https://bioinfo.lau.edu.lb/gkhazen/covid19 上访问。数据集的每日更新版本可从 github.com/KhazenLab/covid19-data 下载。