Zhao Wen-Ming, Song Shu-Hui, Chen Mei-Li, Zou Dong, Ma Li-Na, Ma Ying-Ke, Li Ru-Jiao, Hao Li-Li, Li Cui-Ping, Tian Dong-Mei, Tang Bi-Xia, Wang Yan-Qing, Zhu Jun-Wei, Chen Huan-Xin, Zhang Zhang, Xue Yong-Biao, Bao Yi-Ming
China National Center for Bioinformation & National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
China National Center for Bioinformation & National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
Yi Chuan. 2020 Feb 20;42(2):212-221. doi: 10.16288/j.yczz.20-030.
An ongoing outbreak of a novel coronavirus infection in Wuhan, China since December 2019 has led to 31,516 infected persons and 638 deaths across 25 countries (till 16:00 on February 7, 2020). The virus causing this pneumonia was then named as the 2019 novel coronavirus (2019-nCoV) by the World Health Organization. To promote the data sharing and make all relevant information of 2019-nCoV publicly available, we construct the 2019 Novel Coronavirus Resource (2019nCoVR, https://bigd.big.ac.cn/ncov). 2019nCoVR features comprehensive integration of genomic and proteomic sequences as well as their metadata information from the Global Initiative on Sharing All Influenza Data, National Center for Biotechnology Information, China National GeneBank, National Microbiology Data Center and China National Center for Bioinformation (CNCB)/National Genomics Data Center (NGDC). It also incorporates a wide range of relevant information including scientific literatures, news, and popular articles for science dissemination, and provides visualization functionalities for genome variation analysis results based on all collected 2019-nCoV strains. Moreover, by linking seamlessly with related databases in CNCB/NGDC, 2019nCoVR offers virus data submission and sharing services for raw sequence reads and assembled sequences. In this report, we provide comprehensive descriptions on data deposition, management, release and utility in 2019nCoVR, laying important foundations in aid of studies on virus classification and origin, genome variation and evolution, fast detection, drug development and pneumonia precision prevention and therapy.
自2019年12月以来,中国武汉持续爆发新型冠状病毒感染,已导致25个国家的31516人感染,638人死亡(截至2020年2月7日16:00)。世界卫生组织将导致这种肺炎的病毒命名为2019新型冠状病毒(2019-nCoV)。为促进数据共享并使2019-nCoV的所有相关信息公开可用,我们构建了2019新型冠状病毒资源库(2019nCoVR,https://bigd.big.ac.cn/ncov)。2019nCoVR全面整合了来自全球共享流感数据倡议组织、美国国立生物技术信息中心、中国国家基因库、国家微生物数据中心以及中国国家生物信息中心(CNCB)/国家基因组数据中心(NGDC)的基因组和蛋白质组序列及其元数据信息。它还纳入了广泛的相关信息,包括科学文献、新闻和科普文章,以进行科学传播,并基于所有收集到的2019-nCoV毒株为基因组变异分析结果提供可视化功能。此外,通过与CNCB/NGDC中的相关数据库无缝链接,2019nCoVR为原始序列读数和组装序列提供病毒数据提交和共享服务。在本报告中,我们全面描述了2019nCoVR中的数据存贮、管理、发布和应用,为病毒分类与溯源、基因组变异与进化、快速检测、药物研发以及肺炎精准防控等研究奠定了重要基础。