Physical and Computational Genomics Division, Genomics Research Center, Academia Sinica, Taipei, Taiwan.
Mol Biol Evol. 2014 Feb;31(2):387-96. doi: 10.1093/molbev/mst208. Epub 2013 Oct 24.
DNA methylation at CpG dinucleotides can significantly increase the rate of cytosine-to-thymine mutations and the level of sequence divergence. Although the correlations between DNA methylation and genomic sequence evolution have been widely studied, an unaddressed yet fundamental question is how DNA methylation is associated with the conservation of individual nucleotides in different sequence contexts. Here, we demonstrate that in mammalian exons, the correlations between DNA methylation and the conservation of individual nucleotides are dependent on the type of exonic sequence (coding or untranslated), the degeneracy of coding nucleotides, background selection pressure, and the relative position (first or nonfirst exon in the transcript) where the nucleotides are located. For untranslated and nonzero-fold degenerate nucleotides, methylated sites are less conserved than unmethylated sites regardless of background selection pressure and the relative position of the exon. For zero-fold degenerate (or nondegenerate) nucleotides, however, the reverse trend is observed in nonfirst coding exons and first coding exons that are under stringent background selection pressure. Furthermore, cytosine-to-thymine mutations at methylated zero-fold degenerate nucleotides are predicted to be more detrimental than those that occur at unmethylated nucleotides. As zero-fold and nonzero-fold degenerate nucleotides are very close to each other, our results suggest that the "functional resolution" of DNA methylation may be finer than previously recognized. In addition, the positive correlation between CpG methylation and the level of conservation at zero-fold degenerate nucleotides implies that CpG methylation may serve as an "indicator" of functional importance of these nucleotides.
CpG 二核苷酸中的 DNA 甲基化可以显著增加胞嘧啶向胸腺嘧啶突变的速率和序列差异水平。尽管 DNA 甲基化与基因组序列进化之间的相关性已被广泛研究,但一个尚未解决的基本问题是 DNA 甲基化如何与不同序列背景下单个核苷酸的保守性相关。在这里,我们证明在哺乳动物外显子中,DNA 甲基化与单个核苷酸保守性之间的相关性取决于外显子序列的类型(编码或非翻译)、编码核苷酸的简并度、背景选择压力以及核苷酸所处的相对位置(转录本中的第一个或非第一个外显子)。对于非翻译和非零倍简并核苷酸,无论背景选择压力和外显子的相对位置如何,甲基化位点的保守性都不如未甲基化位点。然而,对于零倍简并(或非简并)核苷酸,在严格的背景选择压力下,非第一个编码外显子和第一个编码外显子中观察到相反的趋势。此外,预测在甲基化零倍简并核苷酸上发生的胞嘧啶向胸腺嘧啶突变比在未甲基化核苷酸上发生的更具危害性。由于零倍和非零倍简并核苷酸非常接近,我们的结果表明 DNA 甲基化的“功能分辨率”可能比以前认识的更精细。此外,CpG 甲基化与零倍简并核苷酸的保守水平之间的正相关关系意味着 CpG 甲基化可能作为这些核苷酸功能重要性的“指标”。