Rouzine I M, Coffin J M
Department of Molecular Biology and Microbiology, School of Medicine, Tufts University, Boston, Massachusetts 02111, USA.
J Virol. 1999 Oct;73(10):8167-78. doi: 10.1128/JVI.73.10.8167-8178.1999.
To study the mechanism of evolution of the human immunodeficiency virus (HIV) protease gene (pro), we analyzed a database of 213 pro sequences isolated from 11 HIV type 1-infected patients who had not been treated with protease inhibitors. Variation in pro is restricted to rare variable bases which are highly diverse and differ in location among individuals; an average variable base appears in about 16% of individuals. The average intrapatient distance per individual variable site, 27%, is similar for synonymous and nonsynonymous sites, although synonymous sites are twice as abundant. The latter observation excludes selection for diversity as an important, permanently acting factor in the evolution of pro and leaves purifying selection as the only kind of selection. Based on this, we developed a model of evolution, both within individuals and along the transmission chain, which explains variable sites as slightly deleterious mutants slowly reverting to the better-fit variant during individual infection. In the case of a single-source transmission, genetic bottlenecks at the moment of transmission effectively suppress selection, allowing mutants to accumulate along the transmission chain to high levels. However, even very rare coinfections from independent sources are, as we show, able to counteract the bottleneck effect. Therefore, there are two possible explanations for the high mutant frequency. First, the frequency of coinfection in the natural host population may be quite low. Alternatively, a strong variation of the best-adapted sequence between individuals could be caused by a combination of an immune response present in early infection and coselection.
为研究人类免疫缺陷病毒(HIV)蛋白酶基因(pro)的进化机制,我们分析了一个数据库,该数据库包含从11名未接受蛋白酶抑制剂治疗的1型HIV感染患者中分离出的213个pro序列。pro的变异仅限于罕见的可变碱基,这些碱基高度多样且在个体间位置不同;平均每个可变碱基出现在约16%的个体中。每个个体可变位点的患者内平均距离为27%,同义位点和非同义位点相似,尽管同义位点的数量是其两倍。后一观察结果排除了多样性选择作为pro进化中一个重要的、持续起作用的因素,仅留下纯化选择作为唯一的选择类型。基于此,我们建立了一个个体内部和传播链上的进化模型,该模型将可变位点解释为在个体感染期间缓慢恢复为更适应变体的轻度有害突变体。在单源传播的情况下,传播时的基因瓶颈有效地抑制了选择,使突变体能够在传播链上积累到高水平。然而,正如我们所表明的,即使是来自独立来源的非常罕见的合并感染也能够抵消瓶颈效应。因此,对于高突变频率有两种可能的解释。第一,自然宿主群体中合并感染的频率可能相当低。或者,个体之间最佳适应序列的强烈变异可能是由早期感染中存在的免疫反应和共选择的组合引起的。