用于协调 Illumina 的 450K 和 EPIC 平台的 DNA 甲基化数据以用于流行病学研究的有效处理管道。

An effective processing pipeline for harmonizing DNA methylation data from Illumina's 450K and EPIC platforms for epidemiological studies.

机构信息

Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

出版信息

BMC Res Notes. 2021 Sep 8;14(1):352. doi: 10.1186/s13104-021-05741-2.

DOI:10.1186/s13104-021-05741-2

PMID:34496950

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8424820/

Abstract

OBJECTIVE

Illumina BeadChip arrays are commonly used to generate DNA methylation data for large epidemiological studies. Updates in technology over time create challenges for data harmonization within and between studies, many of which obtained data from the older 450K and newer EPIC platforms. The pre-processing pipeline for DNA methylation is not trivial, and influences the downstream analyses. Incorporating different platforms adds a new level of technical variability that has not yet been taken into account by recommended pipelines. Our study evaluated the performance of various tools on different versions of platform data harmonization at each step of pre-processing pipeline, including quality control (QC), normalization, batch effect adjustment, and genomic inflation. We illustrate our novel approach using 450K and EPIC data from the Diabetes Autoimmunity Study in the Young (DAISY) prospective cohort.

RESULTS

We found normalization and probe filtering had the biggest effect on data harmonization. Employing a meta-analysis was an effective and easily executable method for accounting for platform variability. Correcting for genomic inflation also helped with harmonization. We present guidelines for studies seeking to harmonize data from the 450K and EPIC platforms, which includes the use of technical replicates for evaluating numerous pre-processing steps, and employing a meta-analysis.

摘要

目的

Illumina BeadChip 阵列常用于生成大型流行病学研究的 DNA 甲基化数据。随着时间的推移，技术的更新为研究内部和研究之间的数据协调带来了挑战，其中许多研究从较旧的 450K 和较新的 EPIC 平台获得了数据。DNA 甲基化的预处理管道并不简单，并且会影响下游分析。整合不同的平台增加了一个尚未被推荐管道考虑到的新的技术可变性层次。我们的研究评估了各种工具在预处理管道的每个步骤（包括质量控制 (QC)、标准化、批次效应调整和基因组膨胀）中对不同版本平台数据协调的性能。我们使用来自年轻糖尿病自身免疫研究 (DAISY) 前瞻性队列的 450K 和 EPIC 数据说明了我们的新方法。

结果

我们发现标准化和探针过滤对数据协调有最大的影响。采用荟萃分析是一种有效且易于执行的方法，可以解决平台变异性问题。校正基因组膨胀也有助于协调。我们为试图协调 450K 和 EPIC 平台数据的研究提供了指导方针，包括使用技术重复来评估众多预处理步骤，并采用荟萃分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07e/8424820/6b76de7579c7/13104_2021_5741_Fig1_HTML.jpg

相似文献

An effective processing pipeline for harmonizing DNA methylation data from Illumina's 450K and EPIC platforms for epidemiological studies.用于协调 Illumina 的 450K 和 EPIC 平台的 DNA 甲基化数据以用于流行病学研究的有效处理管道。

BMC Res Notes. 2021 Sep 8;14(1):352. doi: 10.1186/s13104-021-05741-2.

Systematic evaluation of DNA methylation age estimation with common preprocessing methods and the Infinium MethylationEPIC BeadChip array.采用常见预处理方法和 Infinium MethylationEPIC BeadChip 阵列进行 DNA 甲基化年龄估算的系统评价。

Clin Epigenetics. 2018 Oct 16;10(1):123. doi: 10.1186/s13148-018-0556-2.

Considerations for normalization of DNA methylation data by Illumina 450K BeadChip assay in population studies.考虑使用 Illumina 450K BeadChip assay 在人群研究中对 DNA 甲基化数据进行标准化。

Epigenetics. 2013 Nov;8(11):1141-52. doi: 10.4161/epi.26037. Epub 2013 Aug 19.

Correlation of Infinium HumanMethylation450K and MethylationEPIC BeadChip arrays in cartilage.Infinium HumanMethylation450K芯片与甲基化EPIC芯片在软骨中的相关性

Epigenetics. 2020 Jun-Jul;15(6-7):594-603. doi: 10.1080/15592294.2019.1700003. Epub 2019 Dec 13.

Aclust2.0: a revamped unsupervised R tool for Infinium methylation beadchips data analyses.Aclust2.0：一款经过改进的用于 Infinium 甲基化芯片数据分析的无监督 R 工具。

Bioinformatics. 2022 Oct 14;38(20):4820-4822. doi: 10.1093/bioinformatics/btac583.

IMA: an R package for high-throughput analysis of Illumina's 450K Infinium methylation data.IMA：一个用于高通量分析 Illumina 450K Infinium 甲基化数据的 R 包。

Bioinformatics. 2012 Mar 1;28(5):729-30. doi: 10.1093/bioinformatics/bts013. Epub 2012 Jan 16.

Complete pipeline for Infinium(®) Human Methylation 450K BeadChip data processing using subset quantile normalization for accurate DNA methylation estimation.采用子集分位数归一化进行 Infinium(®) Human Methylation 450K BeadChip 数据处理的完整流水线，以实现准确的 DNA 甲基化估计。

Epigenomics. 2012 Jun;4(3):325-41. doi: 10.2217/epi.12.21.

An evaluation of analysis pipelines for DNA methylation profiling using the Illumina HumanMethylation450 BeadChip platform.基于 Illumina HumanMethylation450 BeadChip 平台的 DNA 甲基化分析流程评估。

Epigenetics. 2013 Mar;8(3):333-46. doi: 10.4161/epi.24008. Epub 2013 Feb 19.

Comparison of Illumina 450K and EPIC arrays in placental DNA methylation.Illumina 450K 与 EPIC 芯片在胎盘 DNA 甲基化分析中的比较。

Epigenetics. 2019 Dec;14(12):1177-1182. doi: 10.1080/15592294.2019.1634975. Epub 2019 Jun 28.

A framework for analyzing DNA methylation data from Illumina Infinium HumanMethylation450 BeadChip.用于分析 Illumina Infinium HumanMethylation450 BeadChip 中 DNA 甲基化数据的框架。

BMC Bioinformatics. 2018 Apr 11;19(Suppl 5):115. doi: 10.1186/s12859-018-2096-3.

引用本文的文献

Differences in immune cell profiles around the time of islet autoimmunity seroconversion in children with and without type 1 diabetes.1型糖尿病患儿与非1型糖尿病患儿在胰岛自身免疫血清转化前后免疫细胞谱的差异。

bioRxiv. 2025 Jul 10:2025.06.23.661117. doi: 10.1101/2025.06.23.661117.

Examining cellular heterogeneity in human DNA methylation studies: Overview and recommendations.人类DNA甲基化研究中的细胞异质性检测：综述与建议

STAR Protoc. 2025 Mar 21;6(1):103638. doi: 10.1016/j.xpro.2025.103638. Epub 2025 Feb 12.

Prediction of Multiple Degenerative Diseases Based on DNA Methylation in a Co-Physiology Mechanisms Perspective.基于共生理学机制视角下的 DNA 甲基化预测多种退行性疾病。

Int J Mol Sci. 2024 Sep 1;25(17):9514. doi: 10.3390/ijms25179514.

Longitudinal changes in DNA methylation during the onset of islet autoimmunity differentiate between reversion versus progression of islet autoimmunity.在胰岛自身免疫起始过程中 DNA 甲基化的纵向变化可区分胰岛自身免疫的逆转与进展。

Front Immunol. 2024 Jun 10;15:1345494. doi: 10.3389/fimmu.2024.1345494. eCollection 2024.

DNA Methylation Near May Mediate the Relationship between Family History of Type 1 Diabetes and Type 1 Diabetes Risk.DNA 甲基化可能在 1 型糖尿病家族史与 1 型糖尿病风险之间发挥中介作用。

Pediatr Diabetes. 2023;2023. doi: 10.1155/2023/5367637. Epub 2023 Sep 11.

A novel approach toward optimal workflow selection for DNA methylation biomarker discovery.一种针对 DNA 甲基化生物标志物发现的最优工作流程选择的新方法。

BMC Bioinformatics. 2024 Jan 23;25(1):37. doi: 10.1186/s12859-024-05658-0.

Associations between blood leukocyte DNA methylation and sustained attention in mid-to-late childhood.血液白细胞 DNA 甲基化与儿童中期至晚期持续性注意力的关联。

Epigenomics. 2023 Oct;15(19):965-981. doi: 10.2217/epi-2023-0169. Epub 2023 Nov 9.

Integrative Approaches of DNA Methylation Patterns According to Age, Sex and Longitudinal Changes.根据年龄、性别和纵向变化的DNA甲基化模式的综合研究方法

Curr Genomics. 2023 Feb 14;23(6):385-399. doi: 10.2174/1389202924666221207100513.

A systematic evaluation of normalization methods and probe replicability using infinium EPIC methylation data.使用 Infinium EPIC 甲基化数据进行标准化方法和探针可重复性的系统评价。

Clin Epigenetics. 2023 Mar 11;15(1):41. doi: 10.1186/s13148-023-01459-z.

Epigenetic-based age acceleration in a representative sample of older Americans: Associations with aging-related morbidity and mortality.基于表观遗传学的美国老年人代表性样本的年龄加速：与与衰老相关的发病率和死亡率的关联。

Proc Natl Acad Sci U S A. 2023 Feb 28;120(9):e2215840120. doi: 10.1073/pnas.2215840120. Epub 2023 Feb 21.

本文引用的文献

Longitudinal DNA methylation differences precede type 1 diabetes.纵向 DNA 甲基化差异先于 1 型糖尿病。

Sci Rep. 2020 Feb 28;10(1):3721. doi: 10.1038/s41598-020-60758-0.

Epigenome-Wide Association Study for All-Cause Mortality in a Cardiovascular Cohort Identifies Differential Methylation in Castor Zinc Finger 1 ().全因死亡率的全基因组关联研究在心血管队列中鉴定出 Castor Zinc Finger 1() 的差异甲基化。

J Am Heart Assoc. 2019 Nov 5;8(21):e013228. doi: 10.1161/JAHA.119.013228. Epub 2019 Oct 23.

In Epigenomic Studies, Including Cell-Type Adjustments in Regression Models Can Introduce Multicollinearity, Resulting in Apparent Reversal of Direction of Association.在表观基因组学研究中，在回归模型中纳入细胞类型调整可能会引入多重共线性，导致关联方向出现明显反转。

Front Genet. 2019 Sep 10;10:816. doi: 10.3389/fgene.2019.00816. eCollection 2019.

Comparison of Illumina 450K and EPIC arrays in placental DNA methylation.Illumina 450K 与 EPIC 芯片在胎盘 DNA 甲基化分析中的比较。

Epigenetics. 2019 Dec;14(12):1177-1182. doi: 10.1080/15592294.2019.1634975. Epub 2019 Jun 28.

Clin Epigenetics. 2018 Oct 16;10(1):123. doi: 10.1186/s13148-018-0556-2.

DNA methylation in human diseases.人类疾病中的DNA甲基化

Genes Dis. 2018 Jan 31;5(1):1-8. doi: 10.1016/j.gendis.2018.01.002. eCollection 2018 Mar.

SeSAMe: reducing artifactual detection of DNA methylation by Infinium BeadChips in genomic deletions.SeSAMe：减少基因组缺失中 Infinium BeadChips 检测到的 DNA 甲基化假阳性。

Nucleic Acids Res. 2018 Nov 16;46(20):e123. doi: 10.1093/nar/gky691.

Comparison of DNA methylation measured by Illumina 450K and EPIC BeadChips in blood of newborns and 14-year-old children.比较 Illumina 450K 和 EPIC BeadChips 在新生儿和 14 岁儿童血液中测量的 DNA 甲基化。

Epigenetics. 2018;13(6):655-664. doi: 10.1080/15592294.2018.1497386. Epub 2018 Aug 15.

Adjusting for Batch Effects in DNA Methylation Microarray Data, a Lesson Learned.DNA甲基化微阵列数据中批次效应的校正：经验教训

Front Genet. 2018 Mar 16;9:83. doi: 10.3389/fgene.2018.00083. eCollection 2018.

Positional effects revealed in Illumina methylation array and the impact on analysis.Illumina 甲基化阵列揭示的位置效应及其对分析的影响。

Epigenomics. 2018 May;10(5):643-659. doi: 10.2217/epi-2017-0105. Epub 2018 Feb 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于协调 Illumina 的 450K 和 EPIC 平台的 DNA 甲基化数据以用于流行病学研究的有效处理管道。

An effective processing pipeline for harmonizing DNA methylation data from Illumina's 450K and EPIC platforms for epidemiological studies.

机构信息

出版信息

OBJECTIVE

RESULTS

目的

结果

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献