Department of Molecular Biology and Ecology of Plants, Tel Aviv University, Tel-Aviv 69978, Israel.
BMC Evol Biol. 2013 Aug 4;13:164. doi: 10.1186/1471-2148-13-164.
Synonymous or silent mutations are usually thought to evolve neutrally. However, accumulating recent evidence has demonstrated that silent mutations may destabilize RNA structures or disrupt cis regulatory motifs superimposed on coding sequences. Such observations suggest the existence of stretches of codon sites that are evolutionary conserved at both DNA-RNA and protein levels. Such stretches may point to functionally important regions within protein coding sequences not necessarily reflecting functional constraints on the amino-acid sequence. The HIV-1 genome is highly compact, and often harbors overlapping functional elements at the protein, RNA, and DNA levels. This superimposition of functions leads to complex selective forces acting on all levels of the genome and proteome. Considering the constraints on HIV-1 to maintain such a highly compact genome, we hypothesized that stretches of synonymous conservation would be common within its genome.
We used a combined computational-experimental approach to detect and characterize regions exhibiting strong purifying selection against synonymous substitutions along the HIV-1 genome. Our methodology is based on advanced probabilistic evolutionary models that explicitly account for synonymous rate variation among sites and rate dependencies among adjacent sites. These models are combined with a randomization procedure to automatically identify the most statistically significant regions of conserved synonymous sites along the genome. Using this procedure we identified 21 conserved regions. Twelve of these are mapped to regions within overlapping genes, seven correlate with known functional elements, while the functions of the remaining four are yet unknown. Among these four regions, we chose the one that deviates most from synonymous rate homogeneity for in-depth computational and experimental characterization. In our assays aiming to quantify viral fitness in both early and late stages of the replication cycle, no differences were observed between the mutated and the wild type virus following the introduction of synonymous mutations.
The contradiction between the inferred purifying selective forces and the lack of effect of these mutations on viral replication may be explained by the fact that the phenotype was measured in single-cycle infection assays in cell culture. Such a system does not account for the complexity of HIV-1 infections in vivo, which involves multiple infection cycles and interaction with the host immune system.
同义或沉默突变通常被认为是中性进化的。然而,越来越多的证据表明,沉默突变可能会破坏 RNA 结构或干扰叠加在编码序列上的顺式调控基序。这些观察结果表明,在 DNA-RNA 和蛋白质水平上存在着密码子位点的连续区域,这些区域是进化保守的。这些连续区域可能指向蛋白质编码序列中功能重要的区域,而不一定反映对氨基酸序列的功能限制。HIV-1 基因组高度紧凑,通常在蛋白质、RNA 和 DNA 水平上具有重叠的功能元件。这种功能的叠加导致作用于基因组和蛋白质组所有层面的复杂选择压力。考虑到 HIV-1 维持如此高度紧凑基因组的限制,我们假设其基因组内会存在大量同义保守的连续区域。
我们使用了一种组合的计算实验方法来检测和描述 HIV-1 基因组中存在的强烈纯化选择对同义替换的区域。我们的方法基于先进的概率进化模型,这些模型明确考虑了位点间的同义替换率变异和相邻位点间的速率依赖性。这些模型与随机化程序相结合,自动识别基因组中最具统计学意义的保守同义位点区域。使用这种方法,我们鉴定了 21 个保守区域。其中 12 个位于重叠基因内的区域,7 个与已知的功能元件相关,而其余 4 个的功能尚不清楚。在这四个区域中,我们选择了一个与同义替换率同质性偏离最大的区域进行深入的计算和实验特征描述。在我们旨在量化复制周期早期和晚期病毒适应性的实验中,在引入同义突变后,突变病毒与野生型病毒之间没有观察到差异。
推断的纯化选择力与这些突变对病毒复制没有影响之间的矛盾可以用这样一个事实来解释,即在细胞培养的单轮感染实验中测量表型。这种系统不能解释 HIV-1 感染体内的复杂性,因为它涉及多个感染周期和与宿主免疫系统的相互作用。