Rahimian Karim, Arefian Ehsan, Mahdavi Bahar, Mahmanzar Mohammadamin, Kuehu Donna Lee, Deng Youping
Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran.
Department of Microbiology, School of Biology, College of Science, University of Tehran, Tehran, Iran.
NAR Genom Bioinform. 2023 Apr 24;5(2):lqad037. doi: 10.1093/nargab/lqad037. eCollection 2023 Jun.
The coronavirus disease 19 (COVID-19) is a highly pathogenic viral infection of the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), resulted in the global pandemic of 2020. A lack of therapeutic and preventive strategies has quickly posed significant threats to world health. A comprehensive understanding of SARS-CoV-2 evolution and natural selection, how it impacts host interaction, and phenotype symptoms is vital to develop effective strategies against the virus. The SARS2Mutant database (http://sars2mutant.com/) was developed to provide valuable insights based on millions of high-quality, high-coverage SARS-CoV-2 complete protein sequences. Users of this database have the ability to search for information on three amino acid substitution mutation strategies based on gene name, geographical zone, or comparative analysis. Each strategy is presented in five distinct formats which includes: (i) mutated sample frequencies, (ii) heat maps of mutated amino acid positions, (iii) mutation survivals, (iv) natural selections and (v) details of substituted amino acids, including their names, positions, and frequencies. GISAID is a primary database of genomics sequencies of influenza viruses updated daily. SARS2Mutant is a secondary database developed to discover mutation and conserved regions from the primary data to assist with design for targeted vaccine, primer, and drug discoveries.
新型冠状病毒肺炎(COVID-19)是由新型严重急性呼吸综合征冠状病毒2(SARS-CoV-2)引起的一种高致病性病毒感染,导致了2020年的全球大流行。缺乏治疗和预防策略迅速对世界卫生构成了重大威胁。全面了解SARS-CoV-2的进化和自然选择、它如何影响宿主相互作用以及表型症状对于制定有效的抗病毒策略至关重要。SARS2Mutant数据库(http://sars2mutant.com/)的开发是为了基于数百万条高质量、高覆盖率的SARS-CoV-2完整蛋白质序列提供有价值的见解。该数据库的用户能够根据基因名称、地理区域或比较分析搜索三种氨基酸替代突变策略的信息。每种策略以五种不同的格式呈现,包括:(i)突变样本频率,(ii)突变氨基酸位置的热图,(iii)突变存活率,(iv)自然选择,以及(v)替代氨基酸的详细信息,包括它们的名称、位置和频率。GISAID是一个每天更新的流感病毒基因组序列的主要数据库。SARS2Mutant是一个二级数据库,旨在从原始数据中发现突变和保守区域,以协助设计靶向疫苗、引物和药物。