Computational Biology and Bioinformatics Group, Max Planck Institute for Molecular Biomedicine, 48149 Münster, Germany;
Genome Res. 2013 Dec;23(12):2013-29. doi: 10.1101/gr.155960.113. Epub 2013 Oct 22.
Gene expression regulation is gated by promoter methylation states modulating transcription factor binding. The known DNA methylation/unmethylation mechanisms are sequence unspecific, but different cells with the same genome have different methylomes. Thus, additional processes bringing specificity to the methylation/unmethylation mechanisms are required. Searching for such processes, we demonstrated that CpG methylation states are influenced by the sequence context surrounding the CpGs. We used such a property to develop a CpG methylation motif discovery algorithm. The newly discovered motifs reveal "methylation/unmethylation factors" that could recruit the "methylation/unmethylation machinery" to the loci specified by the motifs. Our methylation motif discovery algorithm provides a synergistic approach to the differently methylated region algorithms. Since our algorithm searches for commonly methylated regions inside the same sample, it requires only a single sample to operate. The motifs that were found discriminate between hypomethylated and hypermethylated regions. The hypomethylation-associated motifs have a high CG content, their targets appear in conserved regions near transcription start sites, they tend to co-occur within transcription factor binding sites, they are involved in breaking the H3K4me3/H3K27me3 bivalent balance, and they transit the enhancers from repressive H3K27me3 to active H3K27ac during ES cell differentiation. The new methylation motifs characterize the pluripotent state shared between ES and iPS cells. Additionally, we found a collection of motifs associated with the somatic memory inherited by the iPS from the initial fibroblast cells, thus revealing the existence of epigenetic somatic memory on a fine methylation scale.
基因表达调控受启动子甲基化状态调控,改变转录因子结合。已知的 DNA 甲基化/去甲基化机制是无序列特异性的,但具有相同基因组的不同细胞具有不同的甲基组。因此,需要将特异性引入甲基化/去甲基化机制的其他过程。在寻找这些过程时,我们证明了 CpG 甲基化状态受 CpG 周围序列环境的影响。我们利用这一特性开发了一种 CpG 甲基化基序发现算法。新发现的基序揭示了“甲基化/去甲基化因子”,可以将“甲基化/去甲基化机制”募集到基序指定的基因座。我们的甲基化基序发现算法为不同甲基化区域算法提供了一种协同方法。由于我们的算法在同一样本中搜索常见的甲基化区域,因此它只需要一个样本即可运行。发现的基序可区分低甲基化和高甲基化区域。与低甲基化相关的基序具有高 CG 含量,其靶标出现在转录起始位点附近的保守区域,它们往往在转录因子结合位点内共同出现,参与打破 H3K4me3/H3K27me3 二价平衡,并在 ES 细胞分化过程中使增强子从抑制性 H3K27me3 转变为活性 H3K27ac。新的甲基化基序描绘了 ES 和 iPS 细胞之间共享的多能状态。此外,我们发现了一组与 iPS 从初始成纤维细胞继承的体细胞记忆相关的基序,从而揭示了在精细的甲基化尺度上存在表观遗传体细胞记忆。