Zhou Yezhen, Zhang Shihao, Chen Jiayi, Wan Chengsong, Zhao Wei, Zhang Bao
Biosafety Level-3 Laboratory, School of Public Health, Southern Medical University, Guangzhou 510515, China.
Nan Fang Yi Ke Da Xue Xue Bao. 2020 Feb 29;40(2):152-158. doi: 10.12122/j.issn.1673-4254.2020.02.02.
To analyze the evolution and variation of SARS-CoV-2 during the epidemic starting at the end of 2019.
We downloaded the full-length genome sequence of SARS-CoV-2 from the databases of GISAID and NCBI. Using the software for bioinformatics including MEGA-X, BEAST, and TempEst, we constructed the genomic evolution tree, inferred the time evolution signal of the virus, calculated the tMRCA time of the virus and analyzed the selection pressure of the virus during evolution.
The phylogenetic tree showed that SARS-CoV-2 belonged to the Sarbecovirus subgenus of β Coronavirus genus together with bat coronavirus BetaCoV/bat/Yunnan/RaTG13/2013, bat-SL-CoVZC45, bat-SL-CoVZXC21 and SARS-CoV. The genomic sequences of SARS-CoV-2 isolated from the ongoing epidemic showed a weak time evolution signal with an average tMRCA time of 73 days (95% CI: 38.9-119.3 days). No positive time evolution signal was found between SARS-CoV-2 and BetaCoV/bat/Yunnan/RaTG13/2013, but the former virus had a strong positive temporal evolution relationship with bat-SL-CoVZC45 and SARS-CoV. The major cause for mutations of SARS-CoV-2 was the pressure of purification selection during the epidemic.
SARS-CoV-2 may have emerged as early as November, 2019, originating most likely from bat-associated coronavirus. This finding may provide evidence for tracing the sources and evolution of the virus.
分析2019年底开始流行的严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的进化与变异情况。
从全球共享流感数据倡议组织(GISAID)和美国国立生物技术信息中心(NCBI)数据库下载SARS-CoV-2全长基因组序列。使用包括MEGA-X、BEAST和TempEst在内的生物信息学软件构建基因组进化树,推断病毒的时间进化信号,计算病毒的最近共同祖先(tMRCA)时间,并分析病毒进化过程中的选择压力。
系统发育树显示,SARS-CoV-2与蝙蝠冠状病毒BetaCoV/bat/Yunnan/RaTG13/2013、蝙蝠严重急性呼吸综合征相关冠状病毒ZC45(bat-SL-CoVZC45)、蝙蝠严重急性呼吸综合征相关冠状病毒ZXC21(bat-SL-CoVZXC21)以及严重急性呼吸综合征冠状病毒(SARS-CoV)同属于β冠状病毒属的Sarbecovirus亚属。从当前流行疫情中分离出的SARS-CoV-2基因组序列显示出较弱的时间进化信号,平均tMRCA时间为73天(95%可信区间:38.9 - 119.3天)。在SARS-CoV-2与BetaCoV/bat/Yunnan/RaTG13/2013之间未发现正向时间进化信号,但前者与bat-SL-CoVZC45和SARS-CoV存在较强的正向时间进化关系。SARS-CoV-2发生突变的主要原因是疫情期间的纯化选择压力。
SARS-CoV-2可能早在2019年11月就已出现,最有可能起源于与蝙蝠相关的冠状病毒。这一发现可能为追踪病毒的来源和进化提供证据。