Department of Biology II, University of Munich, Planegg-Martinsried, Germany.
Mol Biol Evol. 2010 Jun;27(6):1226-34. doi: 10.1093/molbev/msq046. Epub 2010 Feb 11.
The detection of selection, both positive and negative, acting on a DNA sequence or class of nucleotide sites requires comparison with a reference sequence that is unaffected by selection. In Drosophila, recent findings of widespread selective constraint, as well as adaptive evolution, in both coding and noncoding regions highlight the difficulties in choosing such a reference sequence. Here, we investigate the utility of short intron sequences as a reference for the detection of selection. For a set of 119 Drosophila melanogaster genes containing 195 short introns (<or=120 bp), we analyzed polymorphism and divergence at 1) 4-fold synonymous sites, 2) all sites of introns <or=120 bp, 3) all sites of introns <or=65 bp, 4) bases 8-30 of introns <or=120 bp, and 5) bases 8-30 of introns <or=65 bp. The last class of sites shows the highest levels of both interspecific divergence and intraspecific polymorphism, suggesting that these sites are under the least selective constraint. Bases 8-30 of introns <or=65 bp also have the lowest ratio of divergence to polymorphism, which may indicate that a small proportion of substitutions in the other classes of sites are the result of adaptive evolution. Although there is little signal of selection on the primary sequence of short introns, patterns of insertion-deletion polymorphism and divergence suggest that both positive and negative selection act to maintain an optimal intron length.
检测 DNA 序列或核苷酸位点类别的正选择和负选择都需要与不受选择影响的参考序列进行比较。在果蝇中,最近发现编码和非编码区域广泛存在选择限制和适应性进化,这突出了选择这样的参考序列的困难。在这里,我们研究了短内含子序列作为检测选择的参考的效用。对于一组包含 195 个短内含子(<=120bp)的 119 个果蝇 melanogaster 基因,我们分析了 1)四倍同义位点、2)所有<=120bp 的内含子位点、3)所有<=65bp 的内含子位点、4)<=120bp 的内含子 8-30 位碱基和 5)<=65bp 的内含子 8-30 位碱基的多态性和分歧。最后一类位点显示出最高水平的种间分歧和种内多态性,表明这些位点受到的选择限制最小。<=65bp 的内含子 8-30 位碱基的分歧与多态性之比也最低,这可能表明其他类位点的少数取代是适应性进化的结果。尽管短内含子的一级序列几乎没有选择信号,但插入缺失多态性和分歧的模式表明,正选择和负选择都作用于维持最佳的内含子长度。