使用基于小波的功能混合模型识别差异甲基化位点。

Identification of differentially methylated loci using wavelet-based functional mixed models.

作者信息

Lee Wonyul, Morris Jeffrey S

机构信息

Department of Biostatistics, The University of Texas M.D. Anderson Cancer Center, Houston, TX, USA.

出版信息

Bioinformatics. 2016 Mar 1;32(5):664-72. doi: 10.1093/bioinformatics/btv659. Epub 2015 Nov 11.

DOI:10.1093/bioinformatics/btv659

PMID:26559505

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4907398/

Abstract

MOTIVATION

DNA methylation is a key epigenetic modification that can modulate gene expression. Over the past decade, a lot of studies have focused on profiling DNA methylation and investigating its alterations in complex diseases such as cancer. While early studies were mostly restricted to CpG islands or promoter regions, recent findings indicate that many of important DNA methylation changes can occur in other regions and DNA methylation needs to be examined on a genome-wide scale. In this article, we apply the wavelet-based functional mixed model methodology to analyze the high-throughput methylation data for identifying differentially methylated loci across the genome. Contrary to many commonly-used methods that model probes independently, this framework accommodates spatial correlations across the genome through basis function modeling as well as correlations between samples through functional random effects, which allows it to be applied to many different settings and potentially leads to more power in detection of differential methylation.

RESULTS

We applied this framework to three different high-dimensional methylation data sets (CpG Shore data, THREE data and NIH Roadmap Epigenomics data), studied previously in other works. A simulation study based on CpG Shore data suggested that in terms of detection of differentially methylated loci, this modeling approach using wavelets outperforms analogous approaches modeling the loci as independent. For the THREE data, the method suggests newly detected regions of differential methylation, which were not reported in the original study.

AVAILABILITY AND IMPLEMENTATION

Automated software called WFMM is available at https://biostatistics.mdanderson.org/SoftwareDownload CpG Shore data is available at http://rafalab.dfci.harvard.edu NIH Roadmap Epigenomics data is available at http://compbio.mit.edu/roadmap

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

CONTACT

jefmorris@mdanderson.org.

摘要

动机

DNA甲基化是一种关键的表观遗传修饰，可调节基因表达。在过去十年中，许多研究聚焦于DNA甲基化谱分析，并研究其在癌症等复杂疾病中的变化。早期研究大多局限于CpG岛或启动子区域，但最近的研究结果表明，许多重要的DNA甲基化变化可能发生在其他区域，因此需要在全基因组范围内对DNA甲基化进行检测。在本文中，我们应用基于小波的功能混合模型方法来分析高通量甲基化数据，以识别全基因组中差异甲基化位点。与许多独立对探针进行建模的常用方法不同，该框架通过基函数建模来适应全基因组的空间相关性，并通过功能随机效应来适应样本间的相关性，这使得它能够应用于许多不同的情况，并有可能在检测差异甲基化方面具有更强的能力。

结果

我们将此框架应用于先前在其他研究中使用过的三个不同的高维甲基化数据集（CpG海岸数据、THREE数据和美国国立卫生研究院表观基因组学路线图数据）。基于CpG海岸数据的模拟研究表明，就差异甲基化位点的检测而言，这种使用小波的建模方法优于将位点独立建模的类似方法。对于THREE数据，该方法显示了新检测到的差异甲基化区域，这些区域在原始研究中并未报道。

可用性与实现

名为WFMM的自动化软件可在https://biostatistics.mdanderson.org/SoftwareDownload获取。CpG海岸数据可在http://rafalab.dfci.harvard.edu获取。美国国立卫生研究院表观基因组学路线图数据可在http://compbio.mit.edu/roadmap获取。

补充信息

补充数据可在《生物信息学》在线版获取。

联系方式

jefmorris@mdanderson.org

相似文献

Identification of differentially methylated loci using wavelet-based functional mixed models.使用基于小波的功能混合模型识别差异甲基化位点。

Bioinformatics. 2016 Mar 1;32(5):664-72. doi: 10.1093/bioinformatics/btv659. Epub 2015 Nov 11.

Genome-wide DNA methylation analysis reveals loci that distinguish different types of adipose tissue in obese individuals.全基因组DNA甲基化分析揭示了区分肥胖个体不同类型脂肪组织的基因座。

Clin Epigenetics. 2017 May 3;9:48. doi: 10.1186/s13148-017-0344-4. eCollection 2017.

pETM: a penalized Exponential Tilt Model for analysis of correlated high-dimensional DNA methylation data.pETM：一种用于分析相关高维DNA甲基化数据的惩罚指数倾斜模型。

Bioinformatics. 2017 Jun 15;33(12):1765-1772. doi: 10.1093/bioinformatics/btx064.

Global analysis of methylation profiles from high resolution CpG data.来自高分辨率CpG数据的甲基化图谱的全局分析。

Genet Epidemiol. 2015 Feb;39(2):53-64. doi: 10.1002/gepi.21874. Epub 2014 Dec 23.

Epigenomic profiling of DNA methylation in paired prostate cancer versus adjacent benign tissue.配对的前列腺癌组织与相邻良性组织中DNA甲基化的表观基因组分析。

Prostate. 2015 Dec;75(16):1941-50. doi: 10.1002/pros.23093. Epub 2015 Sep 18.

Higher order methylation features for clustering and prediction in epigenomic studies.用于表观基因组学研究中聚类和预测的高阶甲基化特征

Bioinformatics. 2016 Sep 1;32(17):i405-i412. doi: 10.1093/bioinformatics/btw432.

Genome-wide DNA methylation profiling shows a distinct epigenetic signature associated with lung macrophages in cystic fibrosis.全基因组 DNA 甲基化分析显示，囊性纤维化肺巨噬细胞存在独特的表观遗传特征。

Clin Epigenetics. 2018 Dec 10;10(1):152. doi: 10.1186/s13148-018-0580-2.

coMethDMR: accurate identification of co-methylated and differentially methylated regions in epigenome-wide association studies with continuous phenotypes.coMethDMR：用于全基因组关联研究中具有连续表型的共甲基化和差异甲基化区域的准确识别。

Nucleic Acids Res. 2019 Sep 26;47(17):e98. doi: 10.1093/nar/gkz590.

Methyl-Analyzer--whole genome DNA methylation profiling.甲基化分析器——全基因组 DNA 甲基化分析。

Bioinformatics. 2011 Aug 15;27(16):2296-7. doi: 10.1093/bioinformatics/btr356. Epub 2011 Jun 17.

Detecting multiple differentially methylated CpG sites and regions related to dimensional psychopathology in youths.检测与青少年多维精神病理学相关的多个差异甲基化 CpG 位点和区域。

Clin Epigenetics. 2019 Oct 21;11(1):146. doi: 10.1186/s13148-019-0740-z.

引用本文的文献

fSuSiE enables fine-mapping of QTLs from genome-scale molecular profiles.fSuSiE能够对来自基因组规模分子图谱的数量性状基因座进行精细定位。

bioRxiv. 2025 Aug 17:2025.08.17.670732. doi: 10.1101/2025.08.17.670732.

Identifying Differential Methylation in Cancer Epigenetics via a Bayesian Functional Regression Model.通过贝叶斯功能回归模型识别癌症表观遗传学中的差异甲基化。

Biomolecules. 2024 May 29;14(6):639. doi: 10.3390/biom14060639.

Uncovering Alterations in Cancer Epigenetics via Trans-Dimensional Markov Chain Monte Carlo and Hidden Markov Models.通过跨维度马尔可夫链蒙特卡罗和隐马尔可夫模型揭示癌症表观遗传学的改变。

bioRxiv. 2023 Jun 15:2023.06.15.545168. doi: 10.1101/2023.06.15.545168.

FUNCTION-ON-FUNCTION REGRESSION FOR THE IDENTIFICATION OF EPIGENETIC REGIONS EXHIBITING WINDOWS OF SUSCEPTIBILITY TO ENVIRONMENTAL EXPOSURES.用于识别对环境暴露具有易感性窗口的表观遗传区域的函数对函数回归法

Ann Appl Stat. 2021 Sep;15(3):1366-1385. doi: 10.1214/20-aoas1425. Epub 2021 Sep 23.

Ten Years of EWAS.十年的 EWAS 研究。

Adv Sci (Weinh). 2021 Oct;8(20):e2100727. doi: 10.1002/advs.202100727. Epub 2021 Aug 11.

Wavelet Screening identifies regions highly enriched for differentially methylated loci for orofacial clefts.小波筛选可识别出与口腔颌面部裂隙相关的差异甲基化位点高度富集的区域。

NAR Genom Bioinform. 2021 May 3;3(2):lqab035. doi: 10.1093/nargab/lqab035. eCollection 2021 Jun.

A fast wavelet-based functional association analysis replicates several susceptibility loci for birth weight in a Norwegian population.基于快速小波变换的功能关联分析在挪威人群中重复验证了多个出生体重易感性基因座。

BMC Genomics. 2021 May 2;22(1):321. doi: 10.1186/s12864-021-07582-6.

Detecting differentially methylated regions using a fast wavelet-based approach to functional association analysis.使用快速基于小波的功能关联分析方法检测差异甲基化区域。

BMC Bioinformatics. 2021 Feb 10;22(1):61. doi: 10.1186/s12859-021-03979-y.

A Unified Analysis of Structured Sonar-terrain Data using Bayesian Functional Mixed Models.使用贝叶斯函数混合模型对结构化声纳地形数据进行统一分析。

Technometrics. 2018;60(1):112-123. doi: 10.1080/00401706.2016.1274681. Epub 2017 May 25.

Detection and accurate false discovery rate control of differentially methylated regions from whole genome bisulfite sequencing.从全基因组 bisulfite 测序中检测和准确控制差异甲基化区域。

Biostatistics. 2019 Jul 1;20(3):367-383. doi: 10.1093/biostatistics/kxy007.

本文引用的文献

WAVELET-BASED GENETIC ASSOCIATION ANALYSIS OF FUNCTIONAL PHENOTYPES ARISING FROM HIGH-THROUGHPUT SEQUENCING ASSAYS.基于小波的高通量测序分析产生的功能表型的遗传关联分析

Ann Appl Stat. 2015;9(2):655-686. doi: 10.1214/14-AOAS776.

Bayesian function-on-function regression for multilevel functional data.用于多级函数型数据的贝叶斯函数对函数回归

Biometrics. 2015 Sep;71(3):563-74. doi: 10.1111/biom.12299. Epub 2015 Mar 18.

Integrative analysis of 111 reference human epigenomes.111 个人类参考基因组的综合分析。

Nature. 2015 Feb 19;518(7539):317-30. doi: 10.1038/nature14248.

A wavelet-based method to exploit epigenomic language in the regulatory region.基于小波的方法来挖掘调控区的表观基因组语言。

Bioinformatics. 2014 Apr 1;30(7):908-14. doi: 10.1093/bioinformatics/btt467. Epub 2013 Oct 4.

A Study of Mexican Free-Tailed Bat Chirp Syllables: Bayesian Functional Mixed Models for Nonstationary Acoustic Time Series.墨西哥无尾蝙蝠鸣叫音节的研究：非平稳声学时间序列的贝叶斯功能混合模型

J Am Stat Assoc. 2013 Jun 1;108(502):514-526. doi: 10.1080/01621459.2013.793118.

Review of processing and analysis methods for DNA methylation array data.DNA 甲基化芯片数据的处理和分析方法综述。

Br J Cancer. 2013 Sep 17;109(6):1394-402. doi: 10.1038/bjc.2013.496. Epub 2013 Aug 27.

WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.WaveSeq：一种使用小波检测组蛋白修饰富集的新型数据驱动方法。

PLoS One. 2012;7(9):e45486. doi: 10.1371/journal.pone.0045486. Epub 2012 Sep 28.

Complete pipeline for Infinium(®) Human Methylation 450K BeadChip data processing using subset quantile normalization for accurate DNA methylation estimation.采用子集分位数归一化进行 Infinium(®) Human Methylation 450K BeadChip 数据处理的完整流水线，以实现准确的 DNA 甲基化估计。

Epigenomics. 2012 Jun;4(3):325-41. doi: 10.2217/epi.12.21.

Robust classification of functional and quantitative image data using functional mixed models.使用功能混合模型对功能和定量图像数据进行稳健分类。

Biometrics. 2012 Dec;68(4):1260-8. doi: 10.1111/j.1541-0420.2012.01765.x. Epub 2012 Jun 6.

CpGassoc: an R function for analysis of DNA methylation microarray data.CpGassoc：用于分析 DNA 甲基化微阵列数据的 R 函数。

Bioinformatics. 2012 May 1;28(9):1280-1. doi: 10.1093/bioinformatics/bts124. Epub 2012 Mar 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验