DNA甲基化研究中细胞类型异质性校正方法的评估。

An evaluation of methods correcting for cell-type heterogeneity in DNA methylation studies.

作者信息

McGregor Kevin, Bernatsky Sasha, Colmegna Ines, Hudson Marie, Pastinen Tomi, Labbe Aurélie, Greenwood Celia M T

机构信息

McGill University, Department of Epidemiology, Biostatistics, and Occupational Health, 1020 Pine Ave. West, Montréal, H3A 1A2, QC, Canada.

Lady Davis Research Institute, Jewish General Hospital, 3755 Chemin de la Côte Sainte Catherine, Montréal, H3T 1E2, QC, Canada.

出版信息

Genome Biol. 2016 May 3;17:84. doi: 10.1186/s13059-016-0935-y.

DOI:10.1186/s13059-016-0935-y

PMID:27142380

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4855979/

Abstract

BACKGROUND

Many different methods exist to adjust for variability in cell-type mixture proportions when analyzing DNA methylation studies. Here we present the result of an extensive simulation study, built on cell-separated DNA methylation profiles from Illumina Infinium 450K methylation data, to compare the performance of eight methods including the most commonly used approaches.

RESULTS

We designed a rich multi-layered simulation containing a set of probes with true associations with either binary or continuous phenotypes, confounding by cell type, variability in means and standard deviations for population parameters, additional variability at the level of an individual cell-type-specific sample, and variability in the mixture proportions across samples. Performance varied quite substantially across methods and simulations. In particular, the number of false positives was sometimes unrealistically high, indicating limited ability to discriminate the true signals from those appearing significant through confounding. Methods that filtered probes had consequently poor power. QQ plots of p values across all tested probes showed that adjustments did not always improve the distribution. The same methods were used to examine associations between smoking and methylation data from a case-control study of colorectal cancer, and we also explored the effect of cell-type adjustments on associations between rheumatoid arthritis cases and controls.

CONCLUSIONS

We recommend surrogate variable analysis for cell-type mixture adjustment since performance was stable under all our simulated scenarios.

摘要

背景

在分析DNA甲基化研究时，存在许多不同的方法来调整细胞类型混合比例的变异性。在此，我们展示了一项广泛模拟研究的结果，该研究基于来自Illumina Infinium 450K甲基化数据的细胞分离DNA甲基化谱，以比较包括最常用方法在内的八种方法的性能。

结果

我们设计了一个丰富的多层模拟，包含一组与二元或连续表型具有真实关联的探针，受到细胞类型的混杂影响、群体参数均值和标准差的变异性、个体细胞类型特异性样本水平的额外变异性以及样本间混合比例的变异性。不同方法和模拟的性能差异相当大。特别是，假阳性的数量有时高得离谱，表明从因混杂而显得显著的信号中区分真实信号的能力有限。因此，过滤探针的方法功效较差。所有测试探针的p值QQ图表明，调整并不总是能改善分布。我们使用相同的方法来检查一项结直肠癌病例对照研究中吸烟与甲基化数据之间的关联，并且我们还探讨了细胞类型调整对类风湿性关节炎病例与对照之间关联的影响。

结论

我们推荐使用替代变量分析进行细胞类型混合调整，因为在我们所有模拟场景下其性能都很稳定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baec/4855979/a183d1862db7/13059_2016_935_Fig1_HTML.jpg

相似文献

An evaluation of methods correcting for cell-type heterogeneity in DNA methylation studies.

Genome Biol. 2016 May 3;17:84. doi: 10.1186/s13059-016-0935-y.

The correlation of methylation levels measured using Illumina 450K and EPIC BeadChips in blood samples.

Epigenomics. 2017 Nov;9(11):1363-1371. doi: 10.2217/epi-2017-0078. Epub 2017 Aug 15.

Improving cell mixture deconvolution by identifying optimal DNA methylation libraries (IDOL).

BMC Bioinformatics. 2016 Mar 8;17:120. doi: 10.1186/s12859-016-0943-7.

Guidelines for cell-type heterogeneity quantification based on a comparative analysis of reference-free DNA methylation deconvolution software.

BMC Bioinformatics. 2020 Jan 13;21(1):16. doi: 10.1186/s12859-019-3307-2.

An evaluation of statistical methods for DNA methylation microarray data analysis.

BMC Bioinformatics. 2015 Jul 10;16:217. doi: 10.1186/s12859-015-0641-x.

High-specificity bioinformatics framework for epigenomic profiling of discordant twins reveals specific and shared markers for ACPA and ACPA-positive rheumatoid arthritis.

Genome Med. 2016 Nov 22;8(1):124. doi: 10.1186/s13073-016-0374-0.

Evaluation of the Infinium Methylation 450K technology.

Epigenomics. 2011 Dec;3(6):771-84. doi: 10.2217/epi.11.105.

Cell-type deconvolution from DNA methylation: a review of recent applications.

Hum Mol Genet. 2017 Oct 1;26(R2):R216-R224. doi: 10.1093/hmg/ddx275.

A comparison of cluster analysis methods using DNA methylation data.

Bioinformatics. 2004 Aug 12;20(12):1896-904. doi: 10.1093/bioinformatics/bth176. Epub 2004 Mar 25.

Differential CpG DNA methylation in peripheral naïve CD4 T-cells in early rheumatoid arthritis patients.

Clin Epigenetics. 2020 Apr 7;12(1):54. doi: 10.1186/s13148-020-00837-1.

引用本文的文献

A Data-Driven Epigenetic Characterization of Morning Fatigue Severity in Oncology Patients Receiving Chemotherapy: Associations With Epigenetic Age Acceleration, Blood Cell Types, and Expression-Associated Methylation.

Cancer Med. 2025 Aug;14(15):e71067. doi: 10.1002/cam4.71067.

Detection of cell-type-specific differentially methylated regions in epigenome-wide association studies.

Bioinformatics. 2025 Jul 1;41(Supplement_1):i502-i512. doi: 10.1093/bioinformatics/btaf243.

A review of the use of tumour DNA methylation for breast cancer subtyping and prediction of outcomes.

Clin Epigenetics. 2025 Jul 2;17(1):109. doi: 10.1186/s13148-025-01922-z.

Development and validation of a novel cell type estimation method for targeted bisulfite sequencing data.

Epigenomics. 2025 Apr;17(6):389-396. doi: 10.1080/17501911.2025.2479423. Epub 2025 Mar 18.

Fast matrix completion in epigenetic methylation studies with informative covariates.

Biostatistics. 2024 Oct 1;25(4):1062-1078. doi: 10.1093/biostatistics/kxae016.

Brain cell-type shifts in Alzheimer's disease, autism, and schizophrenia interrogated using methylomics and genetics.

Sci Adv. 2024 May 24;10(21):eadn7655. doi: 10.1126/sciadv.adn7655. Epub 2024 May 23.

DNA Methylation Changes in Blood Cells of Fibromyalgia and Chronic Fatigue Syndrome Patients.

J Pain Res. 2023 Nov 30;16:4025-4036. doi: 10.2147/JPR.S439412. eCollection 2023.

Peripheral blood DNA methylation and neuroanatomical responses to HDACi treatment that rescues neurological deficits in a Kabuki syndrome mouse model.

Clin Epigenetics. 2023 Oct 27;15(1):172. doi: 10.1186/s13148-023-01582-x.

Integrative genomic analyses in adipocytes implicate DNA methylation in human obesity and diabetes.

Nat Commun. 2023 May 15;14(1):2784. doi: 10.1038/s41467-023-38439-z.

Epigenetic Regulation of Inflammatory Mechanisms and a Psychological Symptom Cluster in Patients Receiving Chemotherapy.

Nurs Res. 2023;72(3):200-210. doi: 10.1097/NNR.0000000000000643. Epub 2023 Mar 17.

本文引用的文献

Adjusting for Cell Type Composition in DNA Methylation Data Using a Regression-Based Approach.

Methods Mol Biol. 2017;1589:99-106. doi: 10.1007/7651_2015_262.

Independent genomewide screens identify the tumor suppressor VTRNA2-1 as a human epiallele responsive to periconceptional environment.

Genome Biol. 2015 Jun 11;16(1):118. doi: 10.1186/s13059-015-0660-y.

Cell-composition effects in the analysis of DNA methylation array data: a mathematical perspective.

BMC Bioinformatics. 2015 Mar 21;16:95. doi: 10.1186/s12859-015-0527-y.

Functional normalization of 450k methylation array data improves replication in large cancer studies.

Genome Biol. 2014 Dec 3;15(12):503. doi: 10.1186/s13059-014-0503-2.

Cigarette smoking reduces DNA methylation levels at multiple genomic loci but the effect is partially reversible upon cessation.

Epigenetics. 2014 Oct;9(10):1382-96. doi: 10.4161/15592294.2014.969637.

Grasping nettles: cellular heterogeneity and other confounders in epigenome-wide association studies.

Hum Mol Genet. 2014 Sep 15;23(R1):R83-8. doi: 10.1093/hmg/ddu284. Epub 2014 Jun 13.

An assessment of computational methods for estimating purity and clonality using genomic data derived from heterogeneous tumor tissue samples.

Brief Bioinform. 2015 Mar;16(2):232-41. doi: 10.1093/bib/bbu002. Epub 2014 Feb 20.

Accounting for cellular heterogeneity is critical in epigenome-wide association studies.

Genome Biol. 2014 Feb 4;15(2):R31. doi: 10.1186/gb-2014-15-2-r31.

Epigenome-wide association studies without the need for cell-type composition.

Nat Methods. 2014 Mar;11(3):309-11. doi: 10.1038/nmeth.2815. Epub 2014 Jan 26.

Reference-free cell mixture adjustments in analysis of DNA methylation data.

Bioinformatics. 2014 May 15;30(10):1431-9. doi: 10.1093/bioinformatics/btu029. Epub 2014 Jan 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

DNA甲基化研究中细胞类型异质性校正方法的评估。

An evaluation of methods correcting for cell-type heterogeneity in DNA methylation studies.

作者信息

McGregor Kevin, Bernatsky Sasha, Colmegna Ines, Hudson Marie, Pastinen Tomi, Labbe Aurélie, Greenwood Celia M T

机构信息

McGill University, Department of Epidemiology, Biostatistics, and Occupational Health, 1020 Pine Ave. West, Montréal, H3A 1A2, QC, Canada.

Lady Davis Research Institute, Jewish General Hospital, 3755 Chemin de la Côte Sainte Catherine, Montréal, H3T 1E2, QC, Canada.