Jiao Yinming, Widschwendter Martin, Teschendorff Andrew E
Computational Systems Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China, Department of Women's Cancer, UCL Elizabeth Garrett Anderson Institute for Women's Health and Statistical Genomics Group, Paul O'Gorman Building, UCL Cancer Institute, University College London, London WC1E 6BT, UK.
Computational Systems Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China, Department of Women's Cancer, UCL Elizabeth Garrett Anderson Institute for Women's Health and Statistical Genomics Group, Paul O'Gorman Building, UCL Cancer Institute, University College London, London WC1E 6BT, UKComputational Systems Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China, Department of Women's Cancer, UCL Elizabeth Garrett Anderson Institute for Women's Health and Statistical Genomics Group, Paul O'Gorman Building, UCL Cancer Institute, University College London, London WC1E 6BT, UK.
Bioinformatics. 2014 Aug 15;30(16):2360-6. doi: 10.1093/bioinformatics/btu316. Epub 2014 May 2.
There is a growing number of studies generating matched Illumina Infinium HumanMethylation450 and gene expression data, yet there is a corresponding shortage of statistical tools aimed at their integrative analysis. Such integrative tools are important for the discovery of epigenetically regulated gene modules or molecular pathways, which play key roles in cellular differentiation and disease.
Here, we present a novel functional supervised algorithm, called Functional Epigenetic Modules (FEM), for the integrative analysis of Infinium 450k DNA methylation and matched or unmatched gene expression data. The algorithm identifies gene modules of coordinated differential methylation and differential expression in the context of a human interactome. We validate the FEM algorithm on simulated and real data, demonstrating how it successfully retrieves an epigenetically deregulated gene, previously known to drive endometrial cancer development. Importantly, in the same cancer, FEM identified a novel epigenetically deregulated hotspot, directly upstream of the well-known progesterone receptor tumour suppressor pathway. In the context of cellular differentiation, FEM successfully identifies known endothelial cell subtype-specific gene expression markers, as well as a novel gene module whose overexpression in blood endothelial cells is mediated by DNA hypomethylation. The systems-level integrative framework presented here could be used to identify novel key genes or signalling pathways, which drive cellular differentiation or disease through an underlying epigenetic mechanism.
FEM is freely available as an R-package from http://sourceforge.net/projects/funepimod.
越来越多的研究生成了匹配的Illumina Infinium HumanMethylation450和基因表达数据,但相应地缺乏旨在进行综合分析的统计工具。这种综合工具对于发现表观遗传调控的基因模块或分子途径很重要,这些模块或途径在细胞分化和疾病中起关键作用。
在此,我们提出了一种名为功能表观遗传模块(FEM)的新型功能监督算法,用于对Infinium 450k DNA甲基化和匹配或不匹配的基因表达数据进行综合分析。该算法在人类相互作用组的背景下识别出协调的差异甲基化和差异表达的基因模块。我们在模拟数据和真实数据上验证了FEM算法,展示了它如何成功检索出一个先前已知驱动子宫内膜癌发展的表观遗传失调基因。重要的是,在同一种癌症中,FEM识别出一个新的表观遗传失调热点,位于著名的孕激素受体肿瘤抑制途径的直接上游。在细胞分化的背景下,FEM成功识别出已知的内皮细胞亚型特异性基因表达标志物,以及一个新的基因模块,其在血液内皮细胞中的过表达是由DNA低甲基化介导的。这里提出的系统水平综合框架可用于识别通过潜在的表观遗传机制驱动细胞分化或疾病的新的关键基因或信号通路。