• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DNA甲基化微阵列数据中批次效应的校正:经验教训

Adjusting for Batch Effects in DNA Methylation Microarray Data, a Lesson Learned.

作者信息

Price E M, Robinson Wendy P

机构信息

BC Children's Hospital Research Institute, Vancouver, BC, Canada.

Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada.

出版信息

Front Genet. 2018 Mar 16;9:83. doi: 10.3389/fgene.2018.00083. eCollection 2018.

DOI:10.3389/fgene.2018.00083
PMID:29616078
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5864890/
Abstract

It is well-known, but frequently overlooked, that low- and high-throughput molecular data may contain batch effects, i.e., systematic technical variation. Confounding of experimental batches with the variable(s) of interest is especially concerning, as a batch effect may then be interpreted as a biologically significant finding. An integral step toward reducing false discovery in molecular data analysis includes inspection for batch effects and accounting for this signal if present. In a 30-sample pilot Illumina Infinium HumanMethylation450 (450k array) experiment, we identified two sources of batch effects: row and chip. Here, we demonstrate two approaches taken to process the 450k data in which an R function, , was applied to adjust for the non-biological signal. In the "initial analysis," the application of ComBat to an unbalanced study design resulted in 9,612 and 19,214 significant (FDR < 0.05) DNA methylation differences, despite none present prior to correction. Suspicious of this dramatic change, a "revised processing" included changes to our analysis as well as a greater number of samples, and successfully reduced batch effects without introducing false signal. Our work supports conclusions made by an article previously published in this journal: though the ultimate antidote to batch effects is thoughtful study design, every DNA methylation microarray analysis should inspect, assess and, if necessary, account for batch effects. The analysis experience presented here can serve as a reminder to the broader community to establish research questions , ensure that they match with study design and encourage communication between technicians and analysts.

摘要

众所周知,但经常被忽视的是,低通量和高通量分子数据可能包含批次效应,即系统性技术变异。实验批次与感兴趣的变量之间的混淆尤其令人担忧,因为批次效应可能会被解释为具有生物学意义的发现。减少分子数据分析中错误发现的一个不可或缺的步骤包括检查批次效应,并在存在这种信号时对其进行处理。在一项包含30个样本的Illumina Infinium HumanMethylation450(450k阵列)先导实验中,我们识别出了两种批次效应来源:行和芯片。在这里,我们展示了处理450k数据所采用的两种方法,其中应用了一个R函数来调整非生物学信号。在“初始分析”中,将ComBat应用于不平衡的研究设计导致了9612个和19214个显著(FDR < 0.05)的DNA甲基化差异,尽管在校正之前不存在差异。由于怀疑这种巨大变化,“修订处理”包括对我们的分析进行更改以及增加样本数量,并成功减少了批次效应而未引入错误信号。我们的工作支持了该期刊之前发表的一篇文章所得出的结论:尽管消除批次效应的最终方法是精心设计研究,但每次DNA甲基化微阵列分析都应检查、评估并在必要时处理批次效应。这里呈现的分析经验可以提醒更广泛的群体确立研究问题,确保它们与研究设计相匹配,并鼓励技术人员和分析人员之间的沟通。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e247/5864890/85ca97fb98d5/fgene-09-00083-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e247/5864890/b70c6d3d922c/fgene-09-00083-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e247/5864890/85ca97fb98d5/fgene-09-00083-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e247/5864890/b70c6d3d922c/fgene-09-00083-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e247/5864890/85ca97fb98d5/fgene-09-00083-g002.jpg

相似文献

1
Adjusting for Batch Effects in DNA Methylation Microarray Data, a Lesson Learned.DNA甲基化微阵列数据中批次效应的校正:经验教训
Front Genet. 2018 Mar 16;9:83. doi: 10.3389/fgene.2018.00083. eCollection 2018.
2
Simulating ComBat: how batch correction can lead to the systematic introduction of false positive results in DNA methylation microarray studies.模拟 ComBat:批次校正如何导致 DNA 甲基化微阵列研究中系统地引入假阳性结果。
BMC Bioinformatics. 2020 Jun 30;21(1):271. doi: 10.1186/s12859-020-03559-6.
3
Batch-effect detection, correction and characterisation in Illumina HumanMethylation450 and MethylationEPIC BeadChip array data.Illumina HumanMethylation450 和 MethylationEPIC BeadChip 阵列数据中的批次效应检测、校正和特征描述。
Clin Epigenetics. 2022 Apr 29;14(1):58. doi: 10.1186/s13148-022-01277-9.
4
Stratified randomization controls better for batch effects in 450K methylation analysis: a cautionary tale.在450K甲基化分析中,分层随机化能更好地控制批次效应:一个警示故事。
Front Genet. 2014 Oct 13;5:354. doi: 10.3389/fgene.2014.00354. eCollection 2014.
5
Considerations for normalization of DNA methylation data by Illumina 450K BeadChip assay in population studies.考虑使用 Illumina 450K BeadChip assay 在人群研究中对 DNA 甲基化数据进行标准化。
Epigenetics. 2013 Nov;8(11):1141-52. doi: 10.4161/epi.26037. Epub 2013 Aug 19.
6
An evaluation of analysis pipelines for DNA methylation profiling using the Illumina HumanMethylation450 BeadChip platform.基于 Illumina HumanMethylation450 BeadChip 平台的 DNA 甲基化分析流程评估。
Epigenetics. 2013 Mar;8(3):333-46. doi: 10.4161/epi.24008. Epub 2013 Feb 19.
7
Systematic evaluation of DNA methylation age estimation with common preprocessing methods and the Infinium MethylationEPIC BeadChip array.采用常见预处理方法和 Infinium MethylationEPIC BeadChip 阵列进行 DNA 甲基化年龄估算的系统评价。
Clin Epigenetics. 2018 Oct 16;10(1):123. doi: 10.1186/s13148-018-0556-2.
8
An evaluation of processing methods for HumanMethylation450 BeadChip data.人类甲基化450芯片数据处理方法的评估
BMC Genomics. 2016 Jun 22;17:469. doi: 10.1186/s12864-016-2819-7.
9
Identifying and correcting epigenetics measurements for systematic sources of variation.识别和校正系统变异来源的表观遗传学测量。
Clin Epigenetics. 2018 Mar 21;10:38. doi: 10.1186/s13148-018-0471-6. eCollection 2018.
10
Batch effect correction for genome-wide methylation data with Illumina Infinium platform.基于 Illumina Infinium 平台的全基因组甲基化数据的批次效应校正。
BMC Med Genomics. 2011 Dec 16;4:84. doi: 10.1186/1755-8794-4-84.

引用本文的文献

1
Placental epigenetic clocks derived from crowdsourcing: Implications for the study of accelerated aging in obstetrics.通过众包得出的胎盘表观遗传时钟:对产科加速衰老研究的启示。
iScience. 2025 Jul 23;28(8):113181. doi: 10.1016/j.isci.2025.113181. eCollection 2025 Aug 15.
2
Accounting for differences between Infinium MethylationEPIC v2 and v1 in DNA methylation-based tools.在基于DNA甲基化的工具中考虑Infinium甲基化EPIC v2和v1之间的差异。
Life Sci Alliance. 2025 Jul 8;8(9). doi: 10.26508/lsa.202403155. Print 2025 Sep.
3
ComBat-met: adjusting batch effects in DNA methylation data.

本文引用的文献

1
A manifesto for reproducible science.可重复科学宣言。
Nat Hum Behav. 2017 Jan 10;1(1):0021. doi: 10.1038/s41562-016-0021.
2
Accelerated placental aging in early onset preeclampsia pregnancies identified by DNA methylation.通过DNA甲基化鉴定早发型子痫前期妊娠中加速的胎盘老化。
Epigenomics. 2017 Mar;9(3):279-289. doi: 10.2217/epi-2016-0103. Epub 2016 Nov 29.
3
Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses.在保留组间差异的同时消除批次效应的方法可能会导致对下游分析的信心过度膨胀。
ComBat-met:调整DNA甲基化数据中的批次效应。
NAR Genom Bioinform. 2025 May 19;7(2):lqaf062. doi: 10.1093/nargab/lqaf062. eCollection 2025 Jun.
4
5G-exposed human skin cells do not respond with altered gene expression and methylation profiles.暴露于5G环境下的人体皮肤细胞在基因表达和甲基化谱方面没有出现变化。
PNAS Nexus. 2025 May 13;4(5):pgaf127. doi: 10.1093/pnasnexus/pgaf127. eCollection 2025 May.
5
Caregiver-child interaction and early childhood development among preschool children in rural China: the possible role of blood epigenome-wide DNA methylation.中国农村学龄前儿童中照顾者与儿童的互动及幼儿发展:血液全基因组DNA甲基化的潜在作用
BMC Genomics. 2025 Apr 1;26(1):329. doi: 10.1186/s12864-025-11406-2.
6
Breaking rules: the complex relationship between DNA methylation and X-chromosome inactivation in the human placenta.打破常规:人类胎盘中DNA甲基化与X染色体失活之间的复杂关系
Biol Sex Differ. 2025 Mar 4;16(1):18. doi: 10.1186/s13293-025-00696-6.
7
HarmonizR: blocking and singular feature data adjustment improve runtime efficiency and data preservation.HarmonizR:阻塞和奇异特征数据调整可提高运行时效率和数据保存能力。
BMC Bioinformatics. 2025 Feb 11;26(1):47. doi: 10.1186/s12859-025-06073-9.
8
Discrepancies in readouts between Infinium MethylationEPIC v2.0 and v1.0 reflected in DNA methylation-based tools: implications and considerations for human population epigenetic studies.基于DNA甲基化的工具所反映的Infinium甲基化EPIC v2.0和v1.0之间读数差异:对人类群体表观遗传学研究的影响与考量
bioRxiv. 2024 Sep 28:2024.07.02.600461. doi: 10.1101/2024.07.02.600461.
9
Transcriptome- and DNA methylation-based cell-type deconvolutions produce similar estimates of differential gene expression and differential methylation.基于转录组和DNA甲基化的细胞类型反卷积产生了相似的差异基因表达和差异甲基化估计值。
BioData Min. 2024 Jul 11;17(1):21. doi: 10.1186/s13040-024-00374-0.
10
The DNA methylation landscape across the TCR loci in patients with acute myeloid leukemia.急性髓系白血病患者 TCR 基因座中的 DNA 甲基化图谱。
Int Immunopharmacol. 2024 Sep 10;138:112376. doi: 10.1016/j.intimp.2024.112376. Epub 2024 Jun 24.
Biostatistics. 2016 Jan;17(1):29-39. doi: 10.1093/biostatistics/kxv027. Epub 2015 Aug 13.
4
A coherent approach for analysis of the Illumina HumanMethylation450 BeadChip improves data quality and performance in epigenome-wide association studies.一种用于分析Illumina HumanMethylation450 芯片的连贯方法可提高全表观基因组关联研究中的数据质量和性能。
Genome Biol. 2015 Feb 15;16(1):37. doi: 10.1186/s13059-015-0600-x.
5
Landscape of DNA methylation on the X chromosome reflects CpG density, functional chromatin state and X-chromosome inactivation.X染色体上的DNA甲基化格局反映了CpG密度、功能性染色质状态和X染色体失活。
Hum Mol Genet. 2015 Mar 15;24(6):1528-39. doi: 10.1093/hmg/ddu564. Epub 2014 Nov 7.
6
Stratified randomization controls better for batch effects in 450K methylation analysis: a cautionary tale.在450K甲基化分析中,分层随机化能更好地控制批次效应:一个警示故事。
Front Genet. 2014 Oct 13;5:354. doi: 10.3389/fgene.2014.00354. eCollection 2014.
7
Correlations of MTHFR 677C>T polymorphism with cardiovascular disease in patients with end-stage renal disease: a meta-analysis.终末期肾病患者中MTHFR 677C>T基因多态性与心血管疾病的相关性:一项荟萃分析
PLoS One. 2014 Jul 22;9(7):e102323. doi: 10.1371/journal.pone.0102323. eCollection 2014.
8
"Polymorphisms in folate metabolism genes as maternal risk factor for neural tube defects: an updated meta-analysis".叶酸代谢基因多态性作为神经管缺陷的母亲风险因素:一项更新的荟萃分析
Metab Brain Dis. 2015 Feb;30(1):7-24. doi: 10.1007/s11011-014-9575-7. Epub 2014 Jul 9.
9
Scientific method: statistical errors.科学方法:统计误差
Nature. 2014 Feb 13;506(7487):150-2. doi: 10.1038/506150a.
10
Recommendations for the design and analysis of epigenome-wide association studies.全基因组关联研究的设计和分析建议。
Nat Methods. 2013 Oct;10(10):949-55. doi: 10.1038/nmeth.2632.