Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA.
Genome Biol. 2011;12(1):R10. doi: 10.1186/gb-2011-12-1-r10. Epub 2011 Jan 20.
DNA methylation is an essential epigenetic mechanism involved in gene regulation and disease, but little is known about the mechanisms underlying inter-individual variation in methylation profiles. Here we measured methylation levels at 22,290 CpG dinucleotides in lymphoblastoid cell lines from 77 HapMap Yoruba individuals, for which genome-wide gene expression and genotype data were also available.
Association analyses of methylation levels with more than three million common single nucleotide polymorphisms (SNPs) identified 180 CpG-sites in 173 genes that were associated with nearby SNPs (putatively in cis, usually within 5 kb) at a false discovery rate of 10%. The most intriguing trans signal was obtained for SNP rs10876043 in the disco-interacting protein 2 homolog B gene (DIP2B, previously postulated to play a role in DNA methylation), that had a genome-wide significant association with the first principal component of patterns of methylation; however, we found only modest signal of trans-acting associations overall. As expected, we found significant negative correlations between promoter methylation and gene expression levels measured by RNA-sequencing across genes. Finally, there was a significant overlap of SNPs that were associated with both methylation and gene expression levels.
Our results demonstrate a strong genetic component to inter-individual variation in DNA methylation profiles. Furthermore, there was an enrichment of SNPs that affect both methylation and gene expression, providing evidence for shared mechanisms in a fraction of genes.
DNA 甲基化是一种重要的表观遗传机制,参与基因调控和疾病,但个体间甲基化谱差异的机制知之甚少。在这里,我们测量了 77 名 HapMap 约鲁巴个体的淋巴母细胞系中 22290 个 CpG 二核苷酸的甲基化水平,这些个体的全基因组基因表达和基因型数据也可获得。
对甲基化水平与超过 300 万个常见单核苷酸多态性(SNP)的关联分析,在 173 个基因中鉴定出了 180 个 CpG 位点,这些基因与附近的 SNP(假定为顺式,通常在 5kb 内)相关,假发现率为 10%。最引人注目的跨信号是 disco-interacting protein 2 同源物 B 基因(DIP2B)中的 SNP rs10876043 获得的,该基因先前被认为在 DNA 甲基化中发挥作用,与甲基化模式的第一主成分有全基因组显著关联;然而,我们发现整体上只有适度的跨信号关联。正如预期的那样,我们发现启动子甲基化与 RNA-seq 测量的基因表达水平之间存在显著的负相关。最后,与甲基化和基因表达水平都相关的 SNP 存在显著重叠。
我们的结果表明,个体间 DNA 甲基化谱差异存在很强的遗传成分。此外,影响甲基化和基因表达的 SNP 富集,为部分基因中共享机制提供了证据。