College of Veterinary Medicine, Western University of Health Sciences, 309 East Second Street, Pomona, CA 91766, USA.
Central Research Facility, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India.
Genomics. 2020 Nov;112(6):5331-5342. doi: 10.1016/j.ygeno.2020.11.003. Epub 2020 Nov 5.
To understand SARS-CoV-2 microevolution, this study explored the genome-wide frequency, gene-wise distribution, and molecular nature of all point-mutations detected across its 71,703 RNA-genomes deposited in GISAID till 21 August 2020. Globally, nsp1/nsp2 and orf7a/orf3a were the most mutation-ridden non-structural and structural genes respectively. Phylogeny of 4618 spatiotemporally-representative genomes revealed that entities belonging to the early lineages are mostly spread over Asian countries, including India, whereas the recently-derived lineages are more globally distributed. Of the total 20,163 instances of polymorphism detected across global genomes, 12,594 and 7569 involved transitions and transversions, predominated by cytidine-to-uridine and guanosine-to-uridine conversions, respectively. Positive selection of nonsynonymous mutations (dN/dS >1) in most of the structural, but not the non-structural, genes indicated that SARS-CoV-2 has already harmonized its replication/transcription machineries with the host metabolism, while it is still redefining virulence/transmissibility strategies at the molecular level. Mechanistic bases and evolutionary/pathogenicity-related implications are discussed for the predominant mutation-types.
为了了解 SARS-CoV-2 的微观进化,本研究探索了截至 2020 年 8 月 21 日在 GISAID 中存储的其 71703 个 RNA 基因组中所有点突变的全基因组频率、基因分布和分子性质。在全球范围内,非结构蛋白 nsp1/nsp2 和结构蛋白 orf7a/orf3a 分别是突变最严重的非结构和结构基因。对 4618 个具有时空代表性的基因组的系统发育分析表明,早期谱系的实体主要分布在亚洲国家,包括印度,而最近衍生的谱系则分布更广。在全球基因组中检测到的 20163 个多态性实例中,转换和颠换分别占 12594 个和 7569 个,分别以胞嘧啶到尿嘧啶和鸟嘌呤到尿嘧啶的转换为主。大多数结构基因中的非同义突变(dN/dS>1)都受到正选择,而非结构基因则没有,这表明 SARS-CoV-2 已经使其复制/转录机制与宿主代谢相协调,而它仍在分子水平上重新定义毒力/传染性策略。讨论了主要突变类型的机制基础和与进化/致病性相关的影响。