Institute of Microbiology, University Hospital Center and University of Lausanne, Lausanne, Switzerland.
Retrovirology. 2011 Nov 1;8:87. doi: 10.1186/1742-4690-8-87.
The HIV-1 genome is subject to pressures that target the virus resulting in escape and adaptation. On the other hand, there is a requirement for sequence conservation because of functional and structural constraints. Mapping the sites of selective pressure and conservation on the viral genome generates a reference for understanding the limits to viral escape, and can serve as a template for the discovery of sites of genetic conflict with known or unknown host proteins.
To build a thorough evolutionary, functional and structural map of the HIV-1 genome, complete subtype B sequences were obtained from the Los Alamos database. We mapped sites under positive selective pressure, amino acid conservation, protein and RNA structure, overlapping coding frames, CD8 T cell, CD4 T cell and antibody epitopes, and sites enriched in AG and AA dinucleotide motives. Globally, 33% of amino acid positions were found to be variable and 12% of the genome was under positive selection. Because interrelated constraining and diversifying forces shape the viral genome, we included the variables from both classes of pressure in a multivariate model to predict conservation or positive selection: structured RNA and α-helix domains independently predicted conservation while CD4 T cell and antibody epitopes were associated with positive selection.
The global map of the viral genome contains positive selected sites that are not in canonical CD8 T cell, CD4 T cell or antibody epitopes; thus, it identifies a class of residues that may be targeted by other host selective pressures. Overall, RNA structure represents the strongest determinant of HIV-1 conservation. These data can inform the combined analysis of host and viral genetic information.
HIV-1 基因组受到针对病毒的选择压力,导致病毒逃逸和适应。另一方面,由于功能和结构的限制,需要保持序列的保守性。对病毒基因组中选择压力和保守性的位点进行定位,可以为理解病毒逃逸的限制提供参考,并可作为发现与已知或未知宿主蛋白发生遗传冲突的位点的模板。
为了构建 HIV-1 基因组的全面进化、功能和结构图谱,我们从洛斯阿拉莫斯数据库中获得了完整的 B 亚型序列。我们对受正选择压力、氨基酸保守性、蛋白质和 RNA 结构、重叠编码框、CD8 T 细胞、CD4 T 细胞和抗体表位以及富含 AG 和 AA 二核苷酸基序的位点进行了定位。总体而言,33%的氨基酸位置是可变的,12%的基因组受到正选择。由于相关的约束和多样化力量塑造了病毒基因组,我们将这两类压力下的变量纳入多变量模型,以预测保守性或正选择:结构 RNA 和α-螺旋结构域独立地预测保守性,而 CD4 T 细胞和抗体表位与正选择相关。
病毒基因组的全球图谱包含不受经典 CD8 T 细胞、CD4 T 细胞或抗体表位约束的正选择位点;因此,它确定了一类可能受到其他宿主选择压力影响的残基。总体而言,RNA 结构是 HIV-1 保守性的最强决定因素。这些数据可以为宿主和病毒遗传信息的综合分析提供信息。