Schlosberg Christopher E, VanderKraats Nathan D, Edwards John R
Center for Pharmacogenomics, Department of Medicine, Washington University in St. Louis School of Medicine, St. Louis, MO, USA.
Nucleic Acids Res. 2017 May 19;45(9):5100-5111. doi: 10.1093/nar/gkx078.
Numerous genomic studies are underway to determine which genes are abnormally regulated by DNA methylation in disease. However, we have a poor understanding of how disease-specific methylation changes affect expression. We thus developed an integrative analysis tool, Methylation-based Gene Expression Classification (ME-Class), to explain specific variation in methylation that associates with expression change. This model captures the complexity of methylation changes around a gene promoter. Using 17 whole-genome bisulfite sequencing and RNA-seq datasets from different tissues from the Roadmap Epigenomics Project, ME-Class significantly outperforms standard methods using methylation to predict differential gene expression change. To demonstrate its utility, we used ME-Class to analyze 32 datasets from different hematopoietic cell types from the Blueprint Epigenome project. Expression-associated methylation changes were predominantly found when comparing cells from distantly related lineages, implying that changes in the cell's transcriptional program precede associated methylation changes. Training ME-Class on normal-tumor pairs from The Cancer Genome Atlas indicated that cancer-specific expression-associated methylation changes differ from tissue-specific changes. We further show that ME-Class can detect functionally relevant cancer-specific, expression-associated methylation changes that are reversed upon the removal of methylation. ME-Class is thus a powerful tool to identify genes that are dysregulated by DNA methylation in disease.
目前正在进行大量基因组研究,以确定哪些基因在疾病中受到DNA甲基化的异常调控。然而,我们对疾病特异性甲基化变化如何影响基因表达了解甚少。因此,我们开发了一种综合分析工具——基于甲基化的基因表达分类法(ME-Class),以解释与表达变化相关的甲基化特异性变异。该模型捕捉了基因启动子周围甲基化变化的复杂性。利用来自路线图表观基因组计划不同组织的17个全基因组亚硫酸氢盐测序和RNA测序数据集,ME-Class在使用甲基化预测差异基因表达变化方面显著优于标准方法。为了证明其效用,我们使用ME-Class分析了来自蓝图表观基因组计划不同造血细胞类型的32个数据集。在比较远亲谱系的细胞时,主要发现了与表达相关的甲基化变化,这意味着细胞转录程序的变化先于相关的甲基化变化。在来自癌症基因组图谱的正常-肿瘤对中训练ME-Class表明,癌症特异性的与表达相关的甲基化变化不同于组织特异性变化。我们进一步表明,ME-Class可以检测到功能相关的癌症特异性、与表达相关的甲基化变化,这些变化在去除甲基化后会逆转。因此,ME-Class是一种强大的工具,可用于识别在疾病中因DNA甲基化而失调的基因。