• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物医学数据有效批量校正的思考要点。

Thinking points for effective batch correction on biomedical data.

机构信息

Lee Kong Chian School of Medicine, Nanyang Technological University, 59 Nanyang Drive, Singapore 636921, Singapore.

School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore.

出版信息

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae515.

DOI:10.1093/bib/bbae515
PMID:39397427
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11471903/
Abstract

Batch effects introduce significant variability into high-dimensional data, complicating accurate analysis and leading to potentially misleading conclusions if not adequately addressed. Despite technological and algorithmic advancements in biomedical research, effectively managing batch effects remains a complex challenge requiring comprehensive considerations. This paper underscores the necessity of a flexible and holistic approach for selecting batch effect correction algorithms (BECAs), advocating for proper BECA evaluations and consideration of artificial intelligence-based strategies. We also discuss key challenges in batch effect correction, including the importance of uncovering hidden batch factors and understanding the impact of design imbalance, missing values, and aggressive correction. Our aim is to provide researchers with a robust framework for effective batch effects management and enhancing the reliability of high-dimensional data analyses.

摘要

批次效应会给高维数据带来显著的变异性,如果不加以充分处理,这将使数据分析变得复杂,并导致潜在的误导性结论。尽管在生物医学研究中技术和算法取得了进步,但有效地管理批次效应仍然是一个复杂的挑战,需要全面考虑。本文强调了为选择批次效应校正算法(BECAs)采用灵活和整体方法的必要性,提倡对适当的 BECA 进行评估,并考虑基于人工智能的策略。我们还讨论了批次效应校正中的关键挑战,包括揭示隐藏批次因素以及理解设计不平衡、缺失值和激进校正的影响的重要性。我们的目标是为研究人员提供一个有效的批次效应管理的强大框架,并提高高维数据分析的可靠性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/6d479191998d/bbae515f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/5600c98a4363/bbae515f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/f62c9cef8226/bbae515f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/abcf461e4b9e/bbae515f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/4e6e6a4f3acd/bbae515f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/838a606425b4/bbae515f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/898f194e6db8/bbae515f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/1660b40ecaca/bbae515f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/6d479191998d/bbae515f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/5600c98a4363/bbae515f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/f62c9cef8226/bbae515f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/abcf461e4b9e/bbae515f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/4e6e6a4f3acd/bbae515f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/838a606425b4/bbae515f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/898f194e6db8/bbae515f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/1660b40ecaca/bbae515f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfb/11471903/6d479191998d/bbae515f8.jpg

相似文献

1
Thinking points for effective batch correction on biomedical data.生物医学数据有效批量校正的思考要点。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae515.
2
Examining the practical limits of batch effect-correction algorithms: When should you care about batch effects?探究批量效应校正算法的实际限制:何时应关注批量效应?
J Genet Genomics. 2019 Sep 20;46(9):433-443. doi: 10.1016/j.jgg.2019.08.002.
3
MBECS: Microbiome Batch Effects Correction Suite.MBECS:微生物组批次效应校正套件。
BMC Bioinformatics. 2023 May 3;24(1):182. doi: 10.1186/s12859-023-05252-w.
4
Batch correction evaluation framework using a-priori gene-gene associations: applied to the GTEx dataset.基于先验基因-基因关联的批量校正评估框架:在 GTEx 数据集上的应用。
BMC Bioinformatics. 2019 May 28;20(1):268. doi: 10.1186/s12859-019-2855-9.
5
Why Batch Effects Matter in Omics Data, and How to Avoid Them.为什么组间效应在组学数据中很重要,以及如何避免它们。
Trends Biotechnol. 2017 Jun;35(6):498-507. doi: 10.1016/j.tibtech.2017.02.012. Epub 2017 Mar 25.
6
Preserving biological heterogeneity with a permuted surrogate variable analysis for genomics batch correction.通过置换替代变量分析进行基因组批次校正以保留生物异质性。
Bioinformatics. 2014 Oct;30(19):2757-63. doi: 10.1093/bioinformatics/btu375. Epub 2014 Jun 6.
7
Active Batch Selection via Convex Relaxations with Guaranteed Solution Bounds.通过凸松弛实现主动批量选择,并保证解的界。
IEEE Trans Pattern Anal Mach Intell. 2015 Oct;37(10):1945-58. doi: 10.1109/TPAMI.2015.2389848.
8
iSMNN: batch effect correction for single-cell RNA-seq data via iterative supervised mutual nearest neighbor refinement.iSMNN:通过迭代监督的互近邻修正对单细胞 RNA-seq 数据进行批次效应校正。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab122.
9
Recommended Requirements and Essential Elements for Proper Reporting of the Use of Artificial Intelligence Machine Learning Tools in Biomedical Research and Scientific Publications.推荐的人工智能机器学习工具在生物医学研究和科学出版物中使用的正确报告的要求和基本要素。
Arthroscopy. 2024 Apr;40(4):1033-1038. doi: 10.1016/j.arthro.2023.12.027. Epub 2024 Jan 12.
10
Diabetic retinopathy screening through artificial intelligence algorithms: A systematic review.基于人工智能算法的糖尿病视网膜病变筛查:系统综述。
Surv Ophthalmol. 2024 Sep-Oct;69(5):707-721. doi: 10.1016/j.survophthal.2024.05.008. Epub 2024 Jun 15.

引用本文的文献

1
Meta-analysis of genomic characteristics for antiviral influenza defective interfering particle prioritization.用于抗流感缺陷干扰颗粒优先级排序的基因组特征的荟萃分析。
NAR Genom Bioinform. 2025 Apr 4;7(2):lqaf031. doi: 10.1093/nargab/lqaf031. eCollection 2025 Jun.

本文引用的文献

1
scMaui: a widely applicable deep learning framework for single-cell multiomics integration in the presence of batch effects and missing data.scMaui:一种广泛适用的深度学习框架,可在存在批次效应和缺失数据的情况下进行单细胞多组学整合。
BMC Bioinformatics. 2024 Aug 6;25(1):257. doi: 10.1186/s12859-024-05880-w.
2
DeepComBat: A statistically motivated, hyperparameter-robust, deep learning approach to harmonization of neuroimaging data.DeepComBat:一种基于统计学的、超参数稳健的、深度学习方法,用于神经影像学数据的调和。
Hum Brain Mapp. 2024 Aug 1;45(11):e26708. doi: 10.1002/hbm.26708.
3
NDMNN: A novel deep residual network based MNN method to remove batch effects from scRNA-seq data.
NDMNN:一种基于深度残差网络的新型MNN方法,用于去除单细胞RNA测序数据中的批次效应。
J Bioinform Comput Biol. 2024 Jun;22(3):2450015. doi: 10.1142/S021972002450015X. Epub 2024 Jul 20.
4
spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics.spatiAlign:一种用于空间分辨转录组学数据集成的无监督对比学习模型。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae042.
5
scVIC: deep generative modeling of heterogeneity for scRNA-seq data.scVIC:用于scRNA-seq数据异质性的深度生成建模
Bioinform Adv. 2024 Jun 13;4(1):vbae086. doi: 10.1093/bioadv/vbae086. eCollection 2024.
6
Optimizing differential expression analysis for proteomics data via high-performing rules and ensemble inference.通过高性能规则和集成推理优化蛋白质组学数据的差异表达分析。
Nat Commun. 2024 May 9;15(1):3922. doi: 10.1038/s41467-024-47899-w.
7
scPROTEIN: a versatile deep graph contrastive learning framework for single-cell proteomics embedding.scPROTEIN:单细胞蛋白质组学嵌入的多功能深度图对比学习框架。
Nat Methods. 2024 Apr;21(4):623-634. doi: 10.1038/s41592-024-02214-9. Epub 2024 Mar 19.
8
BERMAD: batch effect removal for single-cell RNA-seq data using a multi-layer adaptation autoencoder with dual-channel framework.BERMAD:基于双通道框架的多层自适应自动编码器去除单细胞 RNA-seq 数据中的批次效应
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae127.
9
Semi-supervised integration of single-cell transcriptomics data.单细胞转录组学数据的半监督整合。
Nat Commun. 2024 Jan 29;15(1):872. doi: 10.1038/s41467-024-45240-z.
10
Batch correction of single-cell sequencing data via an autoencoder architecture.通过自动编码器架构对单细胞测序数据进行批量校正。
Bioinform Adv. 2023 Dec 28;4(1):vbad186. doi: 10.1093/bioadv/vbad186. eCollection 2024.