Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Sweden.
Genome Biol Evol. 2011;3:1210-9. doi: 10.1093/gbe/evr094. Epub 2011 Sep 16.
Recent results from Drosophila suggest that positive selection has a substantial impact on genomic patterns of polymorphism and divergence. However, species with smaller population sizes and/or stronger population structure may not be expected to exhibit Drosophila-like patterns of sequence variation. We test this prediction and identify determinants of levels of polymorphism and rates of protein evolution using genomic data from Arabidopsis thaliana and the recently sequenced Arabidopsis lyrata genome. We find that, in contrast to Drosophila, there is no negative relationship between nonsynonymous divergence and silent polymorphism at any spatial scale examined. Instead, synonymous divergence is a major predictor of silent polymorphism, which suggests variation in mutation rate as the main determinant of silent variation. Variation in rates of protein divergence is mainly correlated with gene expression level and breadth, consistent with results for a broad range of taxa, and map-based estimates of recombination rate are only weakly correlated with nonsynonymous divergence. Variation in mutation rates and the strength of purifying selection seem to be major drivers of patterns of polymorphism and divergence in Arabidopsis. Nevertheless, a model allowing for varying negative and positive selection by functional gene category explains the data better than a homogeneous model, implying the action of positive selection on a subset of genes. Genes involved in disease resistance and abiotic stress display high proportions of adaptive substitution. Our results are important for a general understanding of the determinants of rates of protein evolution and the impact of selection on patterns of polymorphism and divergence.
最近来自果蝇的研究结果表明,正选择对基因组多态性和分化的模式有很大的影响。然而,种群规模较小和/或种群结构较强的物种可能不会表现出与果蝇类似的序列变异模式。我们使用来自拟南芥和最近测序的拟南芥 lyrata 基因组的基因组数据来检验这一预测,并确定多态性水平和蛋白质进化速率的决定因素。我们发现,与果蝇不同,在任何研究的空间尺度上,非同义分歧与沉默多态性之间没有负相关关系。相反,同义分歧是沉默多态性的主要预测因子,这表明突变率的变化是沉默变异的主要决定因素。蛋白质分歧率的变化主要与基因表达水平和广度相关,与广泛的分类群的结果一致,并且基于图谱的重组率估计值与非同义分歧的相关性较弱。突变率和纯化选择的强度变化似乎是拟南芥多态性和分化模式的主要驱动因素。尽管如此,允许按功能基因类别进行不同的负选择和正选择的模型比均匀模型更好地解释数据,这意味着正选择作用于一部分基因。与非生物胁迫和疾病抗性相关的基因显示出高比例的适应性替代。我们的研究结果对于全面了解蛋白质进化速率的决定因素以及选择对多态性和分化模式的影响具有重要意义。