• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无监督和半监督学习:植物系统生物学机器学习的下一个前沿。

Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology.

机构信息

Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing, 100094, China.

National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China.

出版信息

Plant J. 2022 Sep;111(6):1527-1538. doi: 10.1111/tpj.15905. Epub 2022 Jul 27.

DOI:10.1111/tpj.15905
PMID:35821601
Abstract

Advances in high-throughput omics technologies are leading plant biology research into the era of big data. Machine learning (ML) performs an important role in plant systems biology because of its excellent performance and wide application in the analysis of big data. However, to achieve ideal performance, supervised ML algorithms require large numbers of labeled samples as training data. In some cases, it is impossible or prohibitively expensive to obtain enough labeled training data; here, the paradigms of unsupervised learning (UL) and semi-supervised learning (SSL) play an indispensable role. In this review, we first introduce the basic concepts of ML techniques, as well as some representative UL and SSL algorithms, including clustering, dimensionality reduction, self-supervised learning (self-SL), positive-unlabeled (PU) learning and transfer learning. We then review recent advances and applications of UL and SSL paradigms in both plant systems biology and plant phenotyping research. Finally, we discuss the limitations and highlight the significance and challenges of UL and SSL strategies in plant systems biology.

摘要

高通量组学技术的进步使植物生物学研究进入了大数据时代。机器学习 (ML) 在植物系统生物学中发挥着重要作用,因为它在大数据分析中的出色性能和广泛应用。然而,为了达到理想的性能,监督式 ML 算法需要大量标记样本作为训练数据。在某些情况下,获得足够的标记训练数据是不可能的或代价高昂的;在这里,无监督学习 (UL) 和半监督学习 (SSL) 的范例发挥了不可或缺的作用。在这篇综述中,我们首先介绍了 ML 技术的基本概念,以及一些有代表性的 UL 和 SSL 算法,包括聚类、降维、自监督学习 (self-SL)、正无标记 (PU) 学习和迁移学习。然后,我们回顾了 UL 和 SSL 范例在植物系统生物学和植物表型研究中的最新进展和应用。最后,我们讨论了这些方法的局限性,并强调了 UL 和 SSL 策略在植物系统生物学中的重要性和挑战。

相似文献

1
Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology.无监督和半监督学习:植物系统生物学机器学习的下一个前沿。
Plant J. 2022 Sep;111(6):1527-1538. doi: 10.1111/tpj.15905. Epub 2022 Jul 27.
2
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey.半监督与无监督深度视觉学习:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1327-1347. doi: 10.1109/TPAMI.2022.3201576. Epub 2024 Feb 6.
3
Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods.大数据时代的小数据挑战:无监督和半监督方法的最新进展综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):2168-2187. doi: 10.1109/TPAMI.2020.3031898. Epub 2022 Mar 4.
4
ℓ-norm based safe semi-supervised learning.基于 l-范数的安全半监督学习。
Math Biosci Eng. 2021 Sep 7;18(6):7727-7742. doi: 10.3934/mbe.2021383.
5
CPSS: Fusing consistency regularization and pseudo-labeling techniques for semi-supervised deep cardiovascular disease detection using all unlabeled electrocardiograms.CPSS:利用所有未标记的心电图进行半监督深度心血管疾病检测的一致性正则化和伪标记技术融合。
Comput Methods Programs Biomed. 2024 Sep;254:108315. doi: 10.1016/j.cmpb.2024.108315. Epub 2024 Jul 4.
6
Weakly Semi-supervised phenotyping using Electronic Health records.基于电子健康记录的弱监督表型研究
J Biomed Inform. 2022 Oct;134:104175. doi: 10.1016/j.jbi.2022.104175. Epub 2022 Sep 5.
7
Multi-class motor imagery EEG classification using collaborative representation-based semi-supervised extreme learning machine.基于协同表示的半监督极限学习机的多类运动想象 EEG 分类。
Med Biol Eng Comput. 2020 Sep;58(9):2119-2130. doi: 10.1007/s11517-020-02227-4. Epub 2020 Jul 16.
8
Comprehensive study of semi-supervised learning for DNA methylation-based supervised classification of central nervous system tumors.基于 DNA 甲基化的中枢神经系统肿瘤有监督分类的半监督学习综合研究。
BMC Bioinformatics. 2022 Jun 8;23(1):223. doi: 10.1186/s12859-022-04764-1.
9
Comparing supervised and semi-supervised Machine Learning Models on Diagnosing Breast Cancer.比较监督式和半监督式机器学习模型在乳腺癌诊断中的应用
Ann Med Surg (Lond). 2021 Jan 8;62:53-64. doi: 10.1016/j.amsu.2020.12.043. eCollection 2021 Feb.
10
A Survey on Self-Supervised Learning: Algorithms, Applications, and Future Trends.自监督学习综述:算法、应用及未来趋势
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):9052-9071. doi: 10.1109/TPAMI.2024.3415112. Epub 2024 Nov 6.

引用本文的文献

1
PhytoCluster: a generative deep learning model for clustering plant single-cell RNA-seq data.植物聚类:一种用于对植物单细胞RNA测序数据进行聚类的生成式深度学习模型。
aBIOTECH. 2025 Feb 20;6(2):189-201. doi: 10.1007/s42994-025-00196-6. eCollection 2025 Jun.
2
Advancements in AI for Computational Biology and Bioinformatics: A Comprehensive Review.用于计算生物学和生物信息学的人工智能进展:全面综述。
Methods Mol Biol. 2025;2952:87-105. doi: 10.1007/978-1-0716-4690-8_6.
3
Artificial intelligence: the human response to approach the complexity of big data in biology.
人工智能:人类应对生物学大数据复杂性的方式
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf057.
4
Using supervised machine-learning approaches to understand abiotic stress tolerance and design resilient crops.利用监督式机器学习方法来理解非生物胁迫耐受性并设计抗逆作物。
Philos Trans R Soc Lond B Biol Sci. 2025 May 29;380(1927):20240252. doi: 10.1098/rstb.2024.0252.
5
Machine-learning meta-analysis reveals ethylene as a central component of the molecular core in abiotic stress responses in Arabidopsis.机器学习荟萃分析揭示乙烯是拟南芥非生物胁迫响应中分子核心的核心组成部分。
Nat Commun. 2025 May 22;16(1):4778. doi: 10.1038/s41467-025-59542-3.
6
Application of machine learning and genomics for orphan crop improvement.机器学习与基因组学在小众作物改良中的应用。
Nat Commun. 2025 Jan 24;16(1):982. doi: 10.1038/s41467-025-56330-x.
7
Enhancing ransomware defense: deep learning-based detection and family-wise classification of evolving threats.增强勒索软件防御:基于深度学习的不断演变威胁的检测与家族式分类
PeerJ Comput Sci. 2024 Nov 29;10:e2546. doi: 10.7717/peerj-cs.2546. eCollection 2024.
8
Advancing plant biology through deep learning-powered natural language processing.通过深度学习赋能的自然语言处理推动植物生物学发展。
Plant Cell Rep. 2024 Aug 5;43(8):208. doi: 10.1007/s00299-024-03294-9.
9
Machine Learning for AI Breeding in Plants.用于植物人工智能育种的机器学习
Genomics Proteomics Bioinformatics. 2024 Sep 13;22(4). doi: 10.1093/gpbjnl/qzae051.
10
A review of artificial intelligence-assisted omics techniques in plant defense: current trends and future directions.植物防御中人工智能辅助组学技术综述:当前趋势与未来方向
Front Plant Sci. 2024 Mar 5;15:1292054. doi: 10.3389/fpls.2024.1292054. eCollection 2024.