Bergna Annalisa, Ventura Carla Della, Marzo Rossella, Ciccozzi Massimo, Galli Massimo, Zehender Gianguglielmo, Lai Alessia
Department of Biomedical and Clinical Sciences Luigi Sacco, University of Milan, Italy.
Unit of Clinical Pathology and Microbiology, University Campus Bio-Medico of Rome, Italy.
Infez Med. 2022 Mar 1;30(1):109-118. doi: 10.53854/liim-3001-13. eCollection 2022.
Aim of this study was to reconstruct the phylogeography of virus (VARV) in the XX century, using 47 VARV whole genome sequences available in public databases, through two different methods for ancestral character reconstruction: a frequently used Bayesian framework and a fast maximum-likelihood (ML) based method. The substitution rate of the whole VARV genome was estimated to be between 6.7×10 and 1.1×10 substitutions/site/year. Both ML and Bayesian methods gave similar trees topology, showing two distinct monophyletic groups: one (known as P1) including the great part of and the second (P2) including West African and American () isolates and close evolutionary rate estimations, between 6.73×10 and 1.1×10 for the whole genome. The phylogeographical reconstruction of P1 suggested that the common ancestor of the circulating in the Old World between the 1940s and the 1970s most probably originated in the Far East in the first decades of the XX century, and then spread to Indian subcontinent in the 1920s. India represented a center of further spread of VARV to eastern Africa in the 1940s and to the Middle East in the 1960s. The phylogeographic scenario obtained by the maximum-likelihood based method was congruent with that obtained by Bayesian framework, but the analysis was faster indicating the usefulness of this method in the analyses of large viral genomes. Our results may help to explain the controversial reconstructions of the history of VARV obtained using long or short timescale for calibration.
本研究的目的是利用公共数据库中现有的47个天花病毒(VARV)全基因组序列,通过两种不同的祖先特征重建方法:一种常用的贝叶斯框架和一种基于快速最大似然(ML)的方法,来重建20世纪天花病毒的系统地理学。天花病毒全基因组的替换率估计在6.7×10⁻⁶和1.1×10⁻⁶替换/位点/年之间。最大似然法和贝叶斯法都给出了相似的树形拓扑结构,显示出两个不同的单系类群:一个(称为P1)包括大部分亚洲分离株,另一个(P2)包括西非和美洲(巴西)分离株,并且全基因组的进化速率估计相近,在6.73×10⁻⁶和1.1×10⁻⁶之间。P1的系统地理学重建表明,20世纪40年代至70年代在旧世界传播的天花病毒的共同祖先很可能起源于20世纪头几十年的远东地区,然后在20世纪20年代传播到印度次大陆。印度是天花病毒在20世纪40年代进一步传播到东非以及在20世纪60年代传播到中东的一个中心。基于最大似然法获得的系统地理学情况与通过贝叶斯框架获得的情况一致,但分析速度更快,表明该方法在分析大型病毒基因组方面的有用性。我们的结果可能有助于解释使用长或短时间尺度进行校准所获得的天花病毒历史的有争议的重建。