• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在大数据时代,批次效应是否仍然相关?

Are batch effects still relevant in the age of big data?

作者信息

Goh Wilson Wen Bin, Yong Chern Han, Wong Limsoon

机构信息

Lee Kong Chian School of Medicine, Nanyang Technological University, 636921, Singapore; School of Biological Science, Nanyang Technological University, 637551, Singapore.

Department of Computer Science, National University of Singapore, 117417, Singapore.

出版信息

Trends Biotechnol. 2022 Sep;40(9):1029-1040. doi: 10.1016/j.tibtech.2022.02.005. Epub 2022 Mar 10.

DOI:10.1016/j.tibtech.2022.02.005
PMID:35282901
Abstract

Batch effects (BEs) are technical biases that may confound analysis of high-throughput biotechnological data. BEs are complex and effective mitigation is highly context-dependent. In particular, the advent of high-resolution technologies such as single-cell RNA sequencing presents new challenges. We first cover how BE modeling differs between traditional datasets and the new data landscape. We also discuss new approaches for measuring and mitigating BEs, including whether a BE is significant enough to warrant correction. Even with the advent of machine learning and artificial intelligence, the increased complexity of next-generation biotechnological data means increased complexities in BE management. We forecast that BEs will not only remain relevant in the age of big data but will become even more important.

摘要

批次效应(BEs)是可能混淆高通量生物技术数据分析的技术偏差。批次效应很复杂,有效的缓解措施高度依赖于具体情况。特别是,诸如单细胞RNA测序等高分辨率技术的出现带来了新的挑战。我们首先介绍传统数据集和新数据格局下批次效应建模的差异。我们还讨论了测量和减轻批次效应的新方法,包括批次效应是否显著到足以进行校正。即使机器学习和人工智能已经出现,但下一代生物技术数据日益增加的复杂性意味着批次效应管理的复杂性也在增加。我们预测,批次效应不仅在大数据时代仍将存在相关性,而且会变得更加重要。

相似文献

1
Are batch effects still relevant in the age of big data?在大数据时代,批次效应是否仍然相关?
Trends Biotechnol. 2022 Sep;40(9):1029-1040. doi: 10.1016/j.tibtech.2022.02.005. Epub 2022 Mar 10.
2
Artificial intelligence and machine learning in precision medicine: A paradigm shift in big data analysis.人工智能和机器学习在精准医学中的应用:大数据分析的范式转变。
Prog Mol Biol Transl Sci. 2022;190(1):57-100. doi: 10.1016/bs.pmbts.2022.03.002. Epub 2022 Apr 8.
3
Protein-DNA/RNA interactions: Machine intelligence tools and approaches in the era of artificial intelligence and big data.蛋白质 - DNA/RNA 相互作用:人工智能与大数据时代的机器智能工具及方法
Proteomics. 2022 Apr;22(8):e2100197. doi: 10.1002/pmic.202100197. Epub 2022 Feb 13.
4
Big Data in Surgery.大数据与外科手术
Surg Clin North Am. 2023 Apr;103(2):219-232. doi: 10.1016/j.suc.2022.12.002.
5
Biotechnology, Big Data and Artificial Intelligence.生物技术、大数据和人工智能。
Biotechnol J. 2019 Aug;14(8):e1800613. doi: 10.1002/biot.201800613. Epub 2019 May 27.
6
Artificial Intelligence and Big Data in Diabetes Care: A Position Statement of the Italian Association of Medical Diabetologists.糖尿病护理中的人工智能与大数据:意大利医学糖尿病专家协会立场声明
J Med Internet Res. 2020 Jun 22;22(6):e16922. doi: 10.2196/16922.
7
m-Health 2.0: New perspectives on mobile health, machine learning and big data analytics.移动医疗 2.0:移动医疗、机器学习和大数据分析的新视角。
Methods. 2018 Dec 1;151:34-40. doi: 10.1016/j.ymeth.2018.05.015. Epub 2018 Jun 8.
8
Revolutionizing enzyme engineering through artificial intelligence and machine learning.通过人工智能和机器学习彻底改变酶工程。
Emerg Top Life Sci. 2021 May 14;5(1):113-125. doi: 10.1042/ETLS20200257.
9
Machine Learning and Artificial Intelligence: A Paradigm Shift in Big Data-Driven Drug Design and Discovery.机器学习和人工智能:大数据驱动的药物设计与发现的范式转变。
Curr Top Med Chem. 2022;22(20):1692-1727. doi: 10.2174/1568026622666220701091339.
10
Big Data and Artificial Intelligence Modeling for Drug Discovery.大数据和人工智能在药物发现中的建模。
Annu Rev Pharmacol Toxicol. 2020 Jan 6;60:573-589. doi: 10.1146/annurev-pharmtox-010919-023324. Epub 2019 Sep 13.

引用本文的文献

1
High performance data integration for large-scale analyses of incomplete Omic profiles using Batch-Effect Reduction Trees (BERT).使用批效应减少树(BERT)对不完整组学图谱进行大规模分析的高性能数据集成。
Nat Commun. 2025 Aug 2;16(1):7104. doi: 10.1038/s41467-025-62237-4.
2
Advancing atmospheric solids analysis probe mass spectrometry applications: a multifaceted approach to optimising clinical data set generation.推进大气固体分析探针质谱应用:一种优化临床数据集生成的多方面方法。
Analyst. 2025 May 15. doi: 10.1039/d5an00166h.
3
Balancing ethical data sharing and open science for reproducible research in biomedical data science.
在生物医学数据科学中,平衡道德数据共享与开放科学以实现可重复研究。
Cell Rep Med. 2025 Apr 15;6(4):102080. doi: 10.1016/j.xcrm.2025.102080.
4
Thinking points for effective batch correction on biomedical data.生物医学数据有效批量校正的思考要点。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae515.
5
Assessing and mitigating batch effects in large-scale omics studies.评估和减轻大规模组学研究中的批次效应。
Genome Biol. 2024 Oct 3;25(1):254. doi: 10.1186/s13059-024-03401-9.
6
Ten quick tips for ensuring machine learning model validity.确保机器学习模型有效性的十个快速技巧。
PLoS Comput Biol. 2024 Sep 19;20(9):e1012402. doi: 10.1371/journal.pcbi.1012402. eCollection 2024 Sep.
7
Particle uptake in cancer cells can predict malignancy and drug resistance using machine learning.利用机器学习预测癌细胞中的颗粒摄取可预测恶性肿瘤和耐药性。
Sci Adv. 2024 May 31;10(22):eadj4370. doi: 10.1126/sciadv.adj4370. Epub 2024 May 29.
8
Data pre-processing for analyzing microbiome data - A mini review.用于分析微生物组数据的数据预处理——一篇综述短文
Comput Struct Biotechnol J. 2023 Oct 4;21:4804-4815. doi: 10.1016/j.csbj.2023.10.001. eCollection 2023.
9
Correcting batch effects in large-scale multiomics studies using a reference-material-based ratio method.使用基于参考物质的比率法纠正大规模多组学研究中的批次效应。
Genome Biol. 2023 Sep 7;24(1):201. doi: 10.1186/s13059-023-03047-z.
10
Artificial intelligence-driven electrochemical immunosensing biochips in multi-component detection.用于多组分检测的人工智能驱动的电化学免疫传感生物芯片
Biomicrofluidics. 2023 Aug 21;17(4):041301. doi: 10.1063/5.0160808. eCollection 2023 Jul.