Department of Computer Science and Engineering, University of Louisville, Louisville, Kentucky, United States of America.
Kentucky Biomedical Research Infrastructure Network Bioinformatics Core, University of Louisville, Louisville, Kentucky, United States of America.
PLoS One. 2020 Nov 5;15(11):e0241535. doi: 10.1371/journal.pone.0241535. eCollection 2020.
The severe acute respiratory syndrome-coronavirus 2 (SARS-CoV-2) viral genome is an RNA virus consisting of approximately 30,000 bases. As part of testing efforts, whole genome sequencing of human isolates has resulted in over 1,600 complete genomes publicly available from GenBank. We have performed a comparative analysis of the sequences, in order to detect common mutations within the population. Analysis of variants occurring within the assembled genomes yields 417 variants occurring in at least 1% of the completed genomes, including 229 within the 5' untranslated region (UTR), 152 within the 3'UTR, 2 within intergenic regions and 34 within coding sequences.
严重急性呼吸综合征冠状病毒 2(SARS-CoV-2)病毒基因组是一种由大约 30000 个碱基组成的 RNA 病毒。作为测试工作的一部分,对人类分离株的全基因组测序已经导致来自 GenBank 的 1600 多个完整基因组可供公开使用。我们对序列进行了比较分析,以便检测人群中的常见突变。对组装基因组中出现的变异进行分析,得出至少在 1%的完整基因组中出现的 417 个变异,其中包括 5'非翻译区(UTR)中的 229 个、3'UTR 中的 152 个、基因间区中的 2 个和编码序列中的 34 个。