Wake Forest School of Medicine, Winston-Salem, NC, USA.
Hum Mol Genet. 2013 Dec 15;22(24):5065-74. doi: 10.1093/hmg/ddt356. Epub 2013 Jul 29.
DNA methylation is one of several epigenetic mechanisms that contribute to the regulation of gene expression; however, the extent to which methylation of CpG dinucleotides correlates with gene expression at the genome-wide level is still largely unknown. Using purified primary monocytes from subjects in a large community-based cohort (n = 1264), we characterized methylation (>485 000 CpG sites) and mRNA expression (>48K transcripts) and carried out genome-wide association analyses of 8370 expression phenotypes. We identified 11 203 potential cis-acting CpG loci whose degree of methylation was associated with gene expression (eMS) at a false discovery rate threshold of 0.001. Most of the associations were consistent in effect size and direction of effect across sex and three ethnicities. Contrary to expectation, these eMS were not predominately enriched in promoter regions, or CpG islands, but rather in the 3' UTR, gene bodies, CpG shores or 'offshore' sites, and both positive and negative correlations between methylation and expression were observed across all locations. eMS were enriched for regions predicted to be regulatory by ENCODE (Encyclopedia of DNA Elements) data in multiple cell types, particularly enhancers. One of the strongest association signals detected (P < 2.2 × 10(-308)) was a methylation probe (cg17005068) in the promoter/enhancer region of the glutathione S-transferase theta 1 gene (GSTT1, encoding the detoxification enzyme) with GSTT1 mRNA expression. Our study provides a detailed description of the epigenetic architecture in human monocytes and its relationship to gene expression. These data may help prioritize interrogation of biologically relevant methylation loci and provide new insights into the epigenetic basis of human health and diseases.
DNA 甲基化是几种表观遗传机制之一,有助于基因表达的调控;然而,在全基因组水平上,CpG 二核苷酸的甲基化与基因表达的相关性在很大程度上仍不清楚。我们使用来自大型基于社区的队列(n=1264)中的受试者的纯化原代单核细胞,对甲基化(>485000 个 CpG 位点)和 mRNA 表达(>48K 个转录物)进行了表征,并对 8370 个表达表型进行了全基因组关联分析。我们确定了 11203 个潜在的顺式作用 CpG 基因座,这些基因座的甲基化程度与基因表达(eMS)在假发现率阈值为 0.001 时相关。大多数关联在性别和三种族裔之间的效应大小和效应方向上是一致的。与预期相反,这些 eMS 并非主要富集在启动子区域或 CpG 岛,而是富集在 3'UTR、基因体、CpG 海岸或“近海”区域,以及在所有位置都观察到甲基化与表达之间的正相关和负相关。eMS 富集了多种细胞类型的 ENCODE(DNA 元件百科全书)数据预测的调控区域,特别是增强子。检测到的最强关联信号之一(P<2.2×10(-308))是谷胱甘肽 S-转移酶 theta 1 基因(编码解毒酶)启动子/增强子区域中的一个甲基化探针(cg17005068)与 GSTT1 mRNA 表达的相关性。我们的研究提供了人类单核细胞中表观遗传结构及其与基因表达关系的详细描述。这些数据可能有助于优先研究生物学上相关的甲基化基因座,并为人类健康和疾病的表观遗传基础提供新的见解。