Wang Hongru, Pipes Lenore, Nielsen Rasmus
Department of Integrative Biology, UC Berkeley, Berkeley, CA 94707, USA.
Department of Statistics, UC Berkeley, Berkeley, CA 94707, USA.
Virus Evol. 2020 Dec 30;7(1):veaa098. doi: 10.1093/ve/veaa098. eCollection 2021 Jan.
Human severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is most closely related, by average genetic distance, to two coronaviruses isolated from bats, RaTG13 and RmYN02. However, there is a segment of high amino acid similarity between human SARS-CoV-2 and a pangolin-isolated strain, GD410721, in the receptor-binding domain (RBD) of the spike protein, a pattern that can be caused by either recombination or by convergent amino acid evolution driven by natural selection. We perform a detailed analysis of the synonymous divergence, which is less likely to be affected by selection than amino acid divergence, between human SARS-CoV-2 and related strains. We show that the synonymous divergence between the bat-derived viruses and SARS-CoV-2 is larger than between GD410721 and SARS-CoV-2 in the RBD, providing strong additional support for the recombination hypothesis. However, the synonymous divergence between pangolin strain and SARS-CoV-2 is also relatively high, which is not consistent with a recent recombination between them, instead, it suggests a recombination into RaTG13. We also find a 14-fold increase in the / ratio from the lineage leading to SARS-CoV-2 to the strains of the current pandemic, suggesting that the vast majority of nonsynonymous mutations currently segregating within the human strains have a negative impact on viral fitness. Finally, we estimate that the time to the most recent common ancestor of SARS-CoV-2 and RaTG13 or RmYN02 based on synonymous divergence is 51.71 years (95% CI, 28.11-75.31) and 37.02 years (95% CI, 18.19-55.85), respectively.
人类严重急性呼吸综合征冠状病毒2(SARS-CoV-2)按平均遗传距离计算,与从蝙蝠中分离出的两种冠状病毒RaTG13和RmYN02关系最为密切。然而,在刺突蛋白的受体结合域(RBD)中,人类SARS-CoV-2与一种从穿山甲中分离出的毒株GD410721存在一段高氨基酸相似性,这种模式可能是由重组或自然选择驱动的趋同氨基酸进化导致的。我们对人类SARS-CoV-2与相关毒株之间的同义差异进行了详细分析,同义差异受选择影响的可能性比氨基酸差异小。我们发现,在RBD中,源自蝙蝠的病毒与SARS-CoV-2之间的同义差异大于GD410721与SARS-CoV-2之间的同义差异,为重组假说提供了有力的额外支持。然而,穿山甲毒株与SARS-CoV-2之间的同义差异也相对较高,这与它们近期发生重组的情况不一致,相反,这表明是重组进入了RaTG13。我们还发现,从导致SARS-CoV-2的谱系到当前大流行毒株,/比值增加了14倍,这表明目前在人类毒株中分离出的绝大多数非同义突变对病毒适应性有负面影响。最后,我们根据同义差异估计,SARS-CoV-2与RaTG13或RmYN02的最近共同祖先的时间分别为51.71年(95%置信区间,28.11 - 75.31)和37.02年(95%置信区间,18.19 - 55.85)。