• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习:在植物系统生物学中的挑战与机遇。

Machine learning: its challenges and opportunities in plant system biology.

机构信息

Department of Plant Agriculture, University of Guelph, Guelph, ON, N1G 2W1, Canada.

Department of Botany, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada.

出版信息

Appl Microbiol Biotechnol. 2022 May;106(9-10):3507-3530. doi: 10.1007/s00253-022-11963-6. Epub 2022 May 16.

DOI:10.1007/s00253-022-11963-6
PMID:35575915
Abstract

Sequencing technologies are evolving at a rapid pace, enabling the generation of massive amounts of data in multiple dimensions (e.g., genomics, epigenomics, transcriptomic, metabolomics, proteomics, and single-cell omics) in plants. To provide comprehensive insights into the complexity of plant biological systems, it is important to integrate different omics datasets. Although recent advances in computational analytical pipelines have enabled efficient and high-quality exploration and exploitation of single omics data, the integration of multidimensional, heterogenous, and large datasets (i.e., multi-omics) remains a challenge. In this regard, machine learning (ML) offers promising approaches to integrate large datasets and to recognize fine-grained patterns and relationships. Nevertheless, they require rigorous optimizations to process multi-omics-derived datasets. In this review, we discuss the main concepts of machine learning as well as the key challenges and solutions related to the big data derived from plant system biology. We also provide in-depth insight into the principles of data integration using ML, as well as challenges and opportunities in different contexts including multi-omics, single-cell omics, protein function, and protein-protein interaction. KEY POINTS: • The key challenges and solutions related to the big data derived from plant system biology have been highlighted. • Different methods of data integration have been discussed. • Challenges and opportunities of the application of machine learning in plant system biology have been highlighted and discussed.

摘要

测序技术发展迅速,能够在多个维度(如基因组学、表观基因组学、转录组学、代谢组学、蛋白质组学和单细胞组学)产生大量数据。为了全面了解植物生物系统的复杂性,整合不同的组学数据集非常重要。尽管计算分析管道的最新进展使得高效、高质量地探索和利用单一组学数据成为可能,但多维、异质和大型数据集(即多组学)的整合仍然是一个挑战。在这方面,机器学习 (ML) 提供了有前途的方法来整合大型数据集,并识别细粒度的模式和关系。然而,它们需要严格的优化来处理多组学衍生的数据集。在这篇综述中,我们讨论了机器学习的主要概念,以及与植物系统生物学相关的大数据相关的关键挑战和解决方案。我们还深入探讨了使用 ML 进行数据集成的原则,以及在不同背景下(包括多组学、单细胞组学、蛋白质功能和蛋白质-蛋白质相互作用)的挑战和机遇。

要点

  • 强调了与植物系统生物学相关的大数据的关键挑战和解决方案。

  • 讨论了不同的数据集成方法。

  • 强调和讨论了机器学习在植物系统生物学中的应用的挑战和机遇。

相似文献

1
Machine learning: its challenges and opportunities in plant system biology.机器学习:在植物系统生物学中的挑战与机遇。
Appl Microbiol Biotechnol. 2022 May;106(9-10):3507-3530. doi: 10.1007/s00253-022-11963-6. Epub 2022 May 16.
2
A comprehensive review of machine learning techniques for multi-omics data integration: challenges and applications in precision oncology.多组学数据整合的机器学习技术综合综述:精准肿瘤学中的挑战与应用
Brief Funct Genomics. 2024 Sep 27;23(5):549-560. doi: 10.1093/bfgp/elae013.
3
Machine learning meets omics: applications and perspectives.机器学习与组学的融合:应用与展望。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab460.
4
The application of multi-omics and systems biology to identify therapeutic targets in chronic kidney disease.多组学和系统生物学在慢性肾脏病治疗靶点识别中的应用。
Nephrol Dial Transplant. 2016 Dec;31(12):2003-2011. doi: 10.1093/ndt/gfv364. Epub 2015 Oct 20.
5
Multi -omics and metabolic modelling pipelines: challenges and tools for systems microbiology.多组学和代谢建模管道:系统微生物学的挑战和工具。
Microbiol Res. 2015 Feb;171:52-64. doi: 10.1016/j.micres.2015.01.003. Epub 2015 Jan 7.
6
Bioinformatics Analysis and Validation of Potential Markers Associated with Prediction and Prognosis of Gastric Cancer.生物信息学分析和验证与胃癌预测和预后相关的潜在标志物。
Int J Mol Sci. 2024 May 28;25(11):5880. doi: 10.3390/ijms25115880.
7
Omics-Based Strategies in Precision Medicine: Toward a Paradigm Shift in Inborn Errors of Metabolism Investigations.精准医学中基于组学的策略:代谢性遗传病研究范式的转变
Int J Mol Sci. 2016 Sep 14;17(9):1555. doi: 10.3390/ijms17091555.
8
Multi-omics based artificial intelligence for cancer research.基于多组学的人工智能在癌症研究中的应用。
Adv Cancer Res. 2024;163:303-356. doi: 10.1016/bs.acr.2024.06.005. Epub 2024 Jul 9.
9
Opportunities and Challenges in Advancing Plant Research with Single-cell Omics.单细胞组学推动植物研究的机遇与挑战。
Genomics Proteomics Bioinformatics. 2024 Jul 3;22(2). doi: 10.1093/gpbjnl/qzae026.
10
Omics-Based Investigations of Breast Cancer.基于组学的乳腺癌研究。
Molecules. 2023 Jun 14;28(12):4768. doi: 10.3390/molecules28124768.

引用本文的文献

1
Antioxidant Defense Systems in Plants: Mechanisms, Regulation, and Biotechnological Strategies for Enhanced Oxidative Stress Tolerance.植物中的抗氧化防御系统:增强氧化应激耐受性的机制、调控及生物技术策略
Life (Basel). 2025 Aug 14;15(8):1293. doi: 10.3390/life15081293.
2
Harnessing Multi-Omics and Predictive Modeling for Climate-Resilient Crop Breeding: From Genomes to Fields.利用多组学和预测模型实现气候适应性作物育种:从基因组到田间
Genes (Basel). 2025 Jul 10;16(7):809. doi: 10.3390/genes16070809.
3
The role of statistics in advancing nitric oxide research in plant biology: from data analysis to mechanistic insights.

本文引用的文献

1
SNP discovery and structural insights into OeFAD2 unravelling high oleic/linoleic ratio in olive oil.油橄榄FAD2基因单核苷酸多态性发现及结构解析:揭示橄榄油高油酸/亚油酸比例的原因
Comput Struct Biotechnol J. 2022 Mar 4;20:1229-1243. doi: 10.1016/j.csbj.2022.02.028. eCollection 2022.
2
Data and its (dis)contents: A survey of dataset development and use in machine learning research.数据及其(不)内容:机器学习研究中数据集开发与使用的调查
Patterns (N Y). 2021 Nov 12;2(11):100336. doi: 10.1016/j.patter.2021.100336.
3
Prediction of conserved and variable heat and cold stress response in maize using cis-regulatory information.
统计学在推动植物生物学中一氧化氮研究方面的作用:从数据分析到机理洞察。
Front Plant Sci. 2025 Jul 1;16:1597030. doi: 10.3389/fpls.2025.1597030. eCollection 2025.
4
Using supervised machine-learning approaches to understand abiotic stress tolerance and design resilient crops.利用监督式机器学习方法来理解非生物胁迫耐受性并设计抗逆作物。
Philos Trans R Soc Lond B Biol Sci. 2025 May 29;380(1927):20240252. doi: 10.1098/rstb.2024.0252.
5
The Effect of Naturally Acquired Immunity on Mortality Predictors: A Focus on Individuals with New Coronavirus.自然获得性免疫对死亡率预测指标的影响:聚焦新型冠状病毒感染者
Biomedicines. 2025 Mar 27;13(4):803. doi: 10.3390/biomedicines13040803.
6
Breaking the barrier of human-annotated training data for machine learning-aided plant research using aerial imagery.打破用于机器学习辅助植物研究的人工标注训练数据的壁垒——利用航空图像
Plant Physiol. 2025 Mar 28;197(4). doi: 10.1093/plphys/kiaf132.
7
Machine Learning-Aided Optimization of In Vitro Tetraploid Induction in Cannabis.机器学习辅助优化大麻体外四倍体诱导
Int J Mol Sci. 2025 Feb 18;26(4):1746. doi: 10.3390/ijms26041746.
8
Reliability of plastid and mitochondrial localisation prediction declines rapidly with the evolutionary distance to the training set increasing.质体和线粒体定位预测的可靠性随着与训练集的进化距离的增加而迅速下降。
PLoS Comput Biol. 2024 Nov 11;20(11):e1012575. doi: 10.1371/journal.pcbi.1012575. eCollection 2024 Nov.
9
Genotype-specific responses to drought stress in myrtle ( L.): integrating machine learning techniques.香桃木(L.)对干旱胁迫的基因型特异性响应:整合机器学习技术。
PeerJ. 2024 Oct 7;12:e18081. doi: 10.7717/peerj.18081. eCollection 2024.
10
A Reinforcement Learning approach to study climbing plant behaviour.强化学习方法在攀援植物行为研究中的应用。
Sci Rep. 2024 Aug 6;14(1):18222. doi: 10.1038/s41598-024-62147-3.
利用顺式调控信息预测玉米中保守和可变的热激和冷激响应。
Plant Cell. 2022 Jan 20;34(1):514-534. doi: 10.1093/plcell/koab267.
4
A survey on missing data in machine learning.关于机器学习中缺失数据的一项调查。
J Big Data. 2021;8(1):140. doi: 10.1186/s40537-021-00516-9. Epub 2021 Oct 27.
5
Deep learning in retrosynthesis planning: datasets, models and tools.深度学习在反合成规划中的应用:数据集、模型和工具。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab391.
6
Epigenome plasticity in plants.植物中的表观基因组可塑性。
Nat Rev Genet. 2022 Jan;23(1):55-68. doi: 10.1038/s41576-021-00407-y. Epub 2021 Sep 15.
7
A guide to machine learning for biologists.生物学机器学习指南。
Nat Rev Mol Cell Biol. 2022 Jan;23(1):40-55. doi: 10.1038/s41580-021-00407-0. Epub 2021 Sep 13.
8
Comprehensive Evaluation of Differential Methylation Analysis Methods for Bisulfite Sequencing Data.基于亚硫酸氢盐测序数据的差异甲基化分析方法的综合评估
Int J Environ Res Public Health. 2021 Jul 28;18(15):7975. doi: 10.3390/ijerph18157975.
9
Reinforcement-learning in fronto-striatal circuits.额叶-纹状体回路中的强化学习
Neuropsychopharmacology. 2022 Jan;47(1):147-162. doi: 10.1038/s41386-021-01108-0. Epub 2021 Aug 5.
10
Team effort: Combinatorial control of seed maturation by transcription factors.团队合作:转录因子对种子成熟的组合控制。
Curr Opin Plant Biol. 2021 Oct;63:102091. doi: 10.1016/j.pbi.2021.102091. Epub 2021 Jul 31.