Institute of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, 8057, Switzerland.
Mol Ecol. 2013 Jul;22(13):3525-38. doi: 10.1111/mec.12329. Epub 2013 Jun 15.
Sequence data for >20 000 annotated genes from 56 accessions of Medicago truncatula were used to identify potential targets of positive selection, the determinants of evolutionary rate variation and the relative importance of positive and purifying selection in shaping nucleotide diversity. Based upon patterns of intraspecific diversity and interspecific divergence, c. 50-75% of nonsynonymous polymorphisms are subject to strong purifying selection and 1% of the sampled genes harbour a signature of positive selection. Combining polymorphism with expression data, we estimated the distribution of fitness effects and found that the proportion of deleterious mutations is significantly greater for expressed genes than for genes with undetected transcripts (nonexpressed) in a previous RNA-seq experiment and greater for broadly expressed genes than those expressed in only a single tissue. Expression level is the strongest correlate of evolutionary rates at nonsynonymous sites, and despite multiple genomic features being significantly correlated with evolutionary rates, they explain less than 20% of the variation in nonsynonymous rates (dN) and <15% of the variation in either synonymous rates (dS) or dN:dS. Among putative targets of selection were genes involved in defence against pathogens and herbivores, genes with roles in mediating the relationship with rhizobial symbionts and one-third of annotated histone-lysine methyltransferases. Adaptive evolution of the methyltransferases suggests that positive selection in gene expression may have occurred through evolution of enzymes involved in epigenetic modification.
利用来自 56 个蒺藜苜蓿(Medicago truncatula)品系的 20000 多个注释基因的序列数据,鉴定潜在的正选择靶点、进化速率变化的决定因素以及正选择和纯化选择在塑造核苷酸多样性方面的相对重要性。基于种内多样性和种间分化的模式,大约 50-75%的非同义多态性受到强烈的纯化选择,1%的被抽样基因具有正选择的特征。结合多态性和表达数据,我们估计了适应度效应的分布,发现与先前的 RNA-seq 实验中未检测到转录本(无表达)的基因相比,表达基因中的有害突变比例显著更高,与仅在单个组织中表达的基因相比,广泛表达的基因中的有害突变比例更高。表达水平是非同义位点进化速率的最强相关因素,尽管多个基因组特征与进化速率显著相关,但它们仅解释了非同义速率(dN)变化的不到 20%,以及同义速率(dS)或 dN:dS 变化的不到 15%。在假定的选择靶点中,有一些参与防御病原体和食草动物的基因,有一些在与根瘤菌共生体的关系中起作用的基因,还有三分之一的注释组蛋白赖氨酸甲基转移酶。甲基转移酶的适应性进化表明,基因表达的正选择可能是通过参与表观遗传修饰的酶的进化而发生的。