Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, Switzerland.
BMC Genomics. 2009 Dec 18;10:619. doi: 10.1186/1471-2164-10-619.
The expansion of amino acid repeats is determined by a high mutation rate and can be increased or limited by selection. It has been suggested that recent expansions could be associated with the potential of adaptation to new environments. In this work, we quantify the strength of this association, as well as the contribution of potential confounding factors.
Mammalian positively selected genes have accumulated more recent amino acid repeats than other mammalian genes. However, we found little support for an accelerated evolutionary rate as the main driver for the expansion of amino acid repeats. The most significant predictors of amino acid repeats are gene function and GC content. There is no correlation with expression level.
Our analyses show that amino acid repeat expansions are causally independent from protein adaptive evolution in mammalian genomes. Relaxed purifying selection or positive selection do not associate with more or more recent amino acid repeats. Their occurrence is slightly favoured by the sequence context but mainly determined by the molecular function of the gene.
氨基酸重复序列的扩展由高突变率决定,可以通过选择增加或限制。有人认为,最近的扩展可能与适应新环境的潜力有关。在这项工作中,我们量化了这种关联的强度,以及潜在混杂因素的贡献。
哺乳动物正选择基因积累的近期氨基酸重复序列多于其他哺乳动物基因。然而,我们发现,加速进化率并不是氨基酸重复序列扩展的主要驱动因素。氨基酸重复序列的最重要预测因子是基因功能和 GC 含量。与表达水平没有相关性。
我们的分析表明,在哺乳动物基因组中,氨基酸重复序列的扩展与蛋白质适应性进化是因果独立的。放松的纯化选择或正选择与更多或更新的氨基酸重复序列没有关联。它们的发生稍微受到序列背景的影响,但主要由基因的分子功能决定。