Department of Vertebrate Genomics, Max-Planck-Institute for Molecular Genetics, D-14195 Berlin, Germany.
Genome Res. 2010 Oct;20(10):1441-50. doi: 10.1101/gr.110114.110. Epub 2010 Aug 27.
The generation of genome-wide data derived from methylated DNA immunoprecipitation followed by sequencing (MeDIP-seq) has become a major tool for epigenetic studies in health and disease. The computational analysis of such data, however, still falls short on accuracy, sensitivity, and speed. We propose a time-efficient statistical method that is able to cope with the inherent complexity of MeDIP-seq data with similar performance compared with existing methods. In order to demonstrate the computational approach, we have analyzed alterations in DNA methylation during the differentiation of human embryonic stem cells (hESCs) to definitive endoderm. We show improved correlation of normalized MeDIP-seq data in comparison to available whole-genome bisulfite sequencing data, and investigated the effect of differential methylation on gene expression. Furthermore, we analyzed the interplay between DNA methylation, histone modifications, and transcription factor binding and show that in contrast to de novo methylation, demethylation is mainly associated with regions of low CpG densities.
基于甲基化 DNA 免疫沉淀测序(MeDIP-seq)生成的全基因组数据已成为健康和疾病领域中表观遗传学研究的主要工具。然而,此类数据的计算分析在准确性、灵敏度和速度方面仍存在不足。我们提出了一种高效的统计方法,能够以与现有方法相当的性能来应对 MeDIP-seq 数据固有的复杂性。为了验证该计算方法,我们分析了人胚胎干细胞(hESC)向确定内胚层分化过程中 DNA 甲基化的改变。与现有的全基因组亚硫酸氢盐测序数据相比,我们展示了归一化 MeDIP-seq 数据相关性的提高,并研究了差异甲基化对基因表达的影响。此外,我们分析了 DNA 甲基化、组蛋白修饰和转录因子结合之间的相互作用,并表明与从头甲基化相反,去甲基化主要与低 CpG 密度区域相关。