HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA.
Genome Res. 2013 Mar;23(3):555-67. doi: 10.1101/gr.147942.112. Epub 2013 Jan 16.
As studies of DNA methylation increase in scope, it has become evident that methylation has a complex relationship with gene expression, plays an important role in defining cell types, and is disrupted in many diseases. We describe large-scale single-base resolution DNA methylation profiling on a diverse collection of 82 human cell lines and tissues using reduced representation bisulfite sequencing (RRBS). Analysis integrating RNA-seq and ChIP-seq data illuminates the functional role of this dynamic mark. Loci that are hypermethylated across cancer types are enriched for sites bound by NANOG in embryonic stem cells, which supports and expands the model of a stem/progenitor cell signature in cancer. CpGs that are hypomethylated across cancer types are concentrated in megabase-scale domains that occur near the telomeres and centromeres of chromosomes, are depleted of genes, and are enriched for cancer-specific EZH2 binding and H3K27me3 (repressive chromatin). In noncancer samples, there are cell-type specific methylation signatures preserved in primary cell lines and tissues as well as methylation differences induced by cell culture. The relationship between methylation and expression is context-dependent, and we find that CpG-rich enhancers bound by EP300 in the bodies of expressed genes are unmethylated despite the dense gene-body methylation surrounding them. Non-CpG cytosine methylation occurs in human somatic tissue, is particularly prevalent in brain tissue, and is reproducible across many individuals. This study provides an atlas of DNA methylation across diverse and well-characterized samples and enables new discoveries about DNA methylation and its role in gene regulation and disease.
随着 DNA 甲基化研究范围的扩大,很明显甲基化与基因表达有着复杂的关系,在定义细胞类型方面起着重要作用,并且在许多疾病中被打乱。我们使用简化代表性亚硫酸氢盐测序(RRBS)对 82 个人类细胞系和组织的多样化样本进行了大规模的单碱基分辨率 DNA 甲基化分析。整合 RNA-seq 和 ChIP-seq 数据的分析阐明了这种动态标记的功能作用。在癌症类型中广泛甲基化的基因座富含在胚胎干细胞中被 NANOG 结合的位点,这支持并扩展了癌症中干细胞/祖细胞特征的模型。在癌症类型中广泛低甲基化的 CpG 集中在兆碱基尺度的区域,这些区域位于染色体的端粒和着丝粒附近,基因缺失,并且富含癌症特异性 EZH2 结合和 H3K27me3(抑制性染色质)。在非癌症样本中,在原代细胞系和组织中以及在细胞培养诱导的差异中保留了细胞类型特异性的甲基化特征。甲基化与表达之间的关系是上下文相关的,我们发现尽管它们周围有密集的基因体甲基化,但在表达基因的体部被 EP300 结合的富含 CpG 的增强子是未甲基化的。非 CpG 胞嘧啶甲基化发生在人类体细胞组织中,特别是在脑组织中,并且在许多个体中具有可重复性。本研究提供了跨越多样化和特征良好的样本的 DNA 甲基化图谱,并为 DNA 甲基化及其在基因调控和疾病中的作用提供了新的发现。