Rabadan Raul, Levine Arnold J, Robins Harlan
Institute for Advanced Study, Einstein Drive, Princeton, NJ 08540, USA.
J Virol. 2006 Dec;80(23):11887-91. doi: 10.1128/JVI.01414-06. Epub 2006 Sep 20.
In the last few years, the genomic sequence data for thousands of influenza A virus strains, including the 1918 pandemic strain, and hundreds of isolates of the avian influenza virus H5N1, which is causing an increasing number of human fatalities, have become publicly available. This large quantity of sequence data allows us to do comparative genomics with the human and avian versions of the virus. We find that the nucleotide compositions of influenza A viruses infecting the two hosts are sufficiently different that we can determine the host at almost 100% accuracy. This assignment works at the segment level, which allows us to construct the reassortment history of individual segments within each strain. We suggest that the different nucleotide compositions can be explained by a host-dependent mutation bias. To support this idea, we estimate the fixation rates for the different polymerase segments and the ratios of synonymous to nonsynonymous changes. Additionally, we provide evidence supporting the hypothesis that the H1N1 influenza virus entered the human population just prior to the 1918 outbreak, with an earliest bound of 1910.
在过去几年中,包括1918年大流行毒株在内的数千种甲型流感病毒株以及导致人类死亡人数不断增加的数百株H5N1禽流感病毒的基因组序列数据已公开可用。如此大量的序列数据使我们能够对该病毒的人类和禽类版本进行比较基因组学研究。我们发现,感染这两种宿主的甲型流感病毒的核苷酸组成差异足够大,以至于我们能够以近100%的准确率确定宿主。这种识别在片段水平上有效,这使我们能够构建每个毒株内各个片段的重配历史。我们认为,不同的核苷酸组成可以用宿主依赖性突变偏差来解释。为支持这一观点,我们估计了不同聚合酶片段的固定率以及同义变化与非同义变化的比率。此外,我们提供了证据支持以下假设:H1N1流感病毒在1918年疫情爆发前刚刚进入人类群体,最早界限为1910年。