School of Biology; Georgia Institute of Technology; Atlanta, GA USA.
Epigenetics. 2014 Apr;9(4):483-91. doi: 10.4161/epi.27654. Epub 2014 Jan 13.
CpG islands (CGIs) are commonly used as genomic markers to study the patterns and regulatory consequences of DNA methylation. Interestingly, recent studies reveal a substantial diversity among CGIs: long and short CGIs, for example, exhibit contrasting patterns of gene expression complexity and nucleosome occupancy. Evolutionary origins of CGIs are also highly heterogeneous. In order to systematically evaluate potential diversities among CGIs and ultimately to illuminate the link between diversity of CGIs and their epigenetic variation, we analyzed the nucleotide-resolution DNA methylation maps (methylomes) of multiple cellular origins. We discover novel 'clusters' of CGIs according to their patterns of DNA methylation; the stably hypomethylated CGI cluster (cluster I), sperm-hypomethylated CGI cluster (cluster II), and variably methylated CGI cluster (cluster III). These epigenomic CGI clusters are strikingly distinct at multiple biological features including genomic, evolutionary, and functional characteristics. At the genomic level, the stably hypomethylated CGI cluster tends to be longer and harbors many more CpG dinucleotides than those in other clusters. They are also frequently associated with promoters, while CGI clusters II and III mostly reside in intragenic or intergenic regions and exhibit highly tissue-specific DNA methylation. Functional ontology terms and transcriptional profiles co-vary with CGI clusters, indicating that the regulatory functions of CGIs are tightly linked to their heterogeneity. Finally, CGIs associated with distinctive biological processes, such as diseases, aging, and imprinting, occur disproportionately across CGI clusters. These new findings provide an effective means to combine existing knowledge on CGIs into a genomic context while bringing new insights that elucidate the significance of DNA methylation across different biological conditions and demography.
CpG 岛(CGIs)通常被用作研究 DNA 甲基化模式和调控后果的基因组标记。有趣的是,最近的研究揭示了 CGIs 的显著多样性:例如,长 CGIs 和短 CGIs 表现出截然不同的基因表达复杂程度和核小体占有率模式。CGIs 的进化起源也高度异质。为了系统评估 CGIs 之间的潜在多样性,并最终阐明 CGIs 多样性与其表观遗传变异之间的联系,我们分析了多个细胞来源的核苷酸分辨率 DNA 甲基化图谱(甲基组)。我们根据 DNA 甲基化模式发现了新型 CGIs“簇”;稳定低甲基化 CGI 簇(簇 I)、精子低甲基化 CGI 簇(簇 II)和可变甲基化 CGI 簇(簇 III)。这些表观基因组 CGIs 簇在多个生物学特征上存在显著差异,包括基因组、进化和功能特征。在基因组水平上,稳定低甲基化 CGI 簇往往更长,并且比其他簇含有更多的 CpG 二核苷酸。它们也经常与启动子相关联,而 CGI 簇 II 和 III 主要位于基因内或基因间区域,表现出高度组织特异性的 DNA 甲基化。功能本体论术语和转录谱与 CGI 簇共变,表明 CGIs 的调控功能与其异质性紧密相关。最后,与特定生物学过程(如疾病、衰老和印记)相关的 CGIs 在 CGI 簇中不成比例地发生。这些新发现为将现有 CGIs 知识结合到基因组背景中提供了有效手段,同时阐明了 DNA 甲基化在不同生物学条件和人口统计学中的意义。