Casillas Sònia, Barbadilla Antonio, Bergman Casey M
Faculty of Life Sciences, University of Manchester, Michael Smith Building, Manchester M13 9PT, UK.
Mol Biol Evol. 2007 Oct;24(10):2222-34. doi: 10.1093/molbev/msm150. Epub 2007 Jul 23.
The majority of metazoan genomes consist of nonprotein-coding regions, although the functional significance of most noncoding DNA sequences remains unknown. Highly conserved noncoding sequences (CNSs) have proven to be reliable indicators of functionally constrained sequences such as cis-regulatory elements and noncoding RNA genes. However, CNSs may arise from nonselective evolutionary processes such as genomic regions with extremely low mutation rates known as mutation "cold spots." Here we combine comparative genomic data from recently completed insect genome projects with population genetic data in Drosophila melanogaster to test predictions of the mutational cold spot model of CNS evolution in the genus Drosophila. We find that point mutations in intronic and intergenic CNSs exhibit a significant reduction in levels of divergence relative to levels of polymorphism, as well as a significant excess of rare derived alleles, compared with either the nonconserved spacer regions between CNSs or with 4-fold silent sites in coding regions. Controlling for the effects of purifying selection, we find no evidence of positive selection acting on Drosophila CNSs, although we do find evidence for the action of recurrent positive selection in the spacer regions between CNSs. We estimate that approximately 85% of sites in Drosophila CNSs are under constraint with selection coefficients (N(e)s) on the order of 10-100, and thus, the estimated strength and number of sites under purifying selection is greater for Drosophila CNSs relative to those in the human genome. These patterns of nonneutral molecular evolution are incompatible with the mutational cold spot hypothesis to explain the existence of CNSs in Drosophila and, coupled with similar findings in mammals, argue against the general likelihood that CNSs are generated by mutational cold spots in any metazoan genome.
尽管大多数非编码DNA序列的功能意义仍不明确,但后生动物基因组的大部分由非蛋白质编码区域组成。高度保守的非编码序列(CNSs)已被证明是功能受限序列(如顺式调控元件和非编码RNA基因)的可靠指标。然而,CNSs可能源于非选择性的进化过程,如具有极低突变率的基因组区域,即所谓的突变“冷点”。在这里,我们将最近完成的昆虫基因组项目的比较基因组数据与黑腹果蝇的群体遗传数据相结合,以检验果蝇属中CNS进化的突变冷点模型的预测。我们发现,与CNSs之间的非保守间隔区或编码区的4倍简并位点相比,内含子和基因间CNSs中的点突变在分歧水平上相对于多态性水平显著降低,并且罕见衍生等位基因显著过量。在控制了纯化选择的影响后,我们没有发现正选择作用于果蝇CNSs的证据,尽管我们确实发现了CNSs之间间隔区存在反复正选择作用的证据。我们估计,果蝇CNSs中约85%的位点受到选择系数(N(e)s)约为10 - 100的约束,因此,相对于人类基因组,果蝇CNSs中纯化选择作用下的位点的估计强度和数量更大。这些非中性分子进化模式与解释果蝇中CNSs存在的突变冷点假说不相符,并且与哺乳动物中的类似发现相结合,反对CNSs由任何后生动物基因组中的突变冷点产生的普遍可能性。