• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

打破用于机器学习辅助植物研究的人工标注训练数据的壁垒——利用航空图像

Breaking the barrier of human-annotated training data for machine learning-aided plant research using aerial imagery.

作者信息

Varela Sebastian, Zheng Xuying, Njuguna Joyce, Sacks Erik, Allen Dylan, Ruhter Jeremy, Leakey Andrew D B

机构信息

Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana Champaign, Urbana, IL 61801, USA.

Independent Researcher, Canelones 15800, Uruguay.

出版信息

Plant Physiol. 2025 Mar 28;197(4). doi: 10.1093/plphys/kiaf132.

DOI:10.1093/plphys/kiaf132
PMID:40265604
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12015685/
Abstract

Machine learning (ML) can accelerate biological research. However, the adoption of such tools to facilitate phenotyping based on sensor data has been limited by (i) the need for a large amount of human-annotated training data for each context in which the tool is used and (ii) phenotypes varying across contexts defined in terms of genetics and environment. This is a major bottleneck because acquiring training data is generally costly and time-consuming. This study demonstrates how a ML approach can address these challenges by minimizing the amount of human supervision needed for tool building. A case study was performed to compare ML approaches that examine images collected by an uncrewed aerial vehicle to determine the presence/absence of panicles (i.e. "heading") across thousands of field plots containing genetically diverse breeding populations of 2 Miscanthus species. Automated analysis of aerial imagery enabled the identification of heading approximately 9 times faster than in-field visual inspection by humans. Leveraging an Efficiently Supervised Generative Adversarial Network (ESGAN) learning strategy reduced the requirement for human-annotated data by 1 to 2 orders of magnitude compared to traditional, fully supervised learning approaches. The ESGAN model learned the salient features of the data set by using thousands of unlabeled images to inform the discriminative ability of a classifier so that it required minimal human-labeled training data. This method can accelerate the phenotyping of heading date as a measure of flowering time in Miscanthus across diverse contexts (e.g. in multistate trials) and opens avenues to promote the broad adoption of ML tools.

摘要

机器学习(ML)可以加速生物学研究。然而,基于传感器数据采用此类工具来促进表型分析受到了以下因素的限制:(i)在工具使用的每个背景下都需要大量人工标注的训练数据;(ii)表型会因遗传学和环境所定义的背景不同而有所变化。这是一个主要瓶颈,因为获取训练数据通常既昂贵又耗时。本研究展示了一种机器学习方法如何通过尽量减少工具构建所需的人工监督量来应对这些挑战。进行了一项案例研究,比较了多种机器学习方法,这些方法通过检查无人驾驶飞行器收集的图像,来确定数千个包含两种芒属植物遗传多样的育种群体的田间地块中是否存在圆锥花序(即“抽穗”)。航空图像的自动分析能够比人工实地目视检查快约9倍地识别抽穗情况。与传统的完全监督学习方法相比,利用高效监督生成对抗网络(ESGAN)学习策略将人工标注数据的需求减少了1至2个数量级。ESGAN模型通过使用数千张未标记图像来告知分类器的判别能力,从而学习数据集的显著特征,因此它只需要极少的人工标记训练数据。这种方法可以加速将抽穗日期作为芒属植物开花时间衡量指标的表型分析,适用于各种背景(例如在多州试验中),并为推动机器学习工具的广泛应用开辟了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/13e2d3cffe9e/kiaf132f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/956ca4e52adf/kiaf132f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/ef6812af31dc/kiaf132f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/ad4f1b6b96e4/kiaf132f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/96422d0a534e/kiaf132f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/2f1a8c0f3902/kiaf132f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/c1fb5216169b/kiaf132f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/13e2d3cffe9e/kiaf132f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/956ca4e52adf/kiaf132f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/ef6812af31dc/kiaf132f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/ad4f1b6b96e4/kiaf132f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/96422d0a534e/kiaf132f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/2f1a8c0f3902/kiaf132f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/c1fb5216169b/kiaf132f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5306/12015685/13e2d3cffe9e/kiaf132f7.jpg

相似文献

1
Breaking the barrier of human-annotated training data for machine learning-aided plant research using aerial imagery.打破用于机器学习辅助植物研究的人工标注训练数据的壁垒——利用航空图像
Plant Physiol. 2025 Mar 28;197(4). doi: 10.1093/plphys/kiaf132.
2
Semi-supervised training using cooperative labeling of weakly annotated data for nodule detection in chest CT.基于弱标注数据的协同标注的半监督训练在胸部 CT 结节检测中的应用。
Med Phys. 2023 Jul;50(7):4255-4268. doi: 10.1002/mp.16219. Epub 2023 Jan 27.
3
A real-time phenotyping framework using machine learning for plant stress severity rating in soybean.一种使用机器学习进行大豆植株胁迫严重程度评级的实时表型分析框架。
Plant Methods. 2017 Apr 8;13:23. doi: 10.1186/s13007-017-0173-7. eCollection 2017.
4
Semi-supervised abdominal multi-organ segmentation by object-redrawing.通过对象重绘实现半监督腹部多器官分割
Med Phys. 2024 Nov;51(11):8334-8347. doi: 10.1002/mp.17364. Epub 2024 Aug 21.
5
Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.勘误:利用幼苗浸没法高通量鉴定番茄对丁香假单胞菌 pv.番茄的抗性。
J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.
6
A medical image classification method based on self-regularized adversarial learning.基于自正则化对抗学习的医学图像分类方法。
Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.
7
SeLa-MIL: Developing an instance-level classifier via weakly-supervised self-training for whole slide image classification.SeLa-MIL:通过弱监督自训练开发用于全幻灯片图像分类的实例级分类器。
Comput Methods Programs Biomed. 2025 Apr;261:108614. doi: 10.1016/j.cmpb.2025.108614. Epub 2025 Jan 27.
8
Automatic Scoring of Rhizoctonia Crown and Root Rot Affected Sugar Beet Fields from Orthorectified UAV Images Using Machine Learning.利用机器学习从正射校正无人机图像自动评估受丝核菌冠根腐病影响的甜菜田
Plant Dis. 2024 Mar;108(3):711-724. doi: 10.1094/PDIS-04-23-0779-RE. Epub 2024 Mar 18.
9
Automated segmentation of lesions and organs at risk on [Ga]Ga-PSMA-11 PET/CT images using self-supervised learning with Swin UNETR.使用基于 Swin UNETR 的自监督学习对 [Ga]Ga-PSMA-11 PET/CT 图像上的病变和危险器官进行自动分割。
Cancer Imaging. 2024 Feb 29;24(1):30. doi: 10.1186/s40644-024-00675-x.
10
Automated Identification of Northern Leaf Blight-Infected Maize Plants from Field Imagery Using Deep Learning.利用深度学习从田间图像自动识别感染北方叶斑病的玉米植株
Phytopathology. 2017 Nov;107(11):1426-1432. doi: 10.1094/PHYTO-11-16-0417-R. Epub 2017 Aug 24.

引用本文的文献

1
Harder, better, faster, stronger, and with less annotated data: ESGAN and plant sciences.更努力、更出色、更快、更强,且使用更少的标注数据:ESGAN与植物科学。
Plant Physiol. 2025 Apr 30;198(1). doi: 10.1093/plphys/kiaf171.

本文引用的文献

1
Methodological evolution of potato yield prediction: a comprehensive review.马铃薯产量预测的方法学演变:全面综述
Front Plant Sci. 2023 Jul 26;14:1214006. doi: 10.3389/fpls.2023.1214006. eCollection 2023.
2
Scientific discovery in the age of artificial intelligence.人工智能时代的科学发现。
Nature. 2023 Aug;620(7972):47-60. doi: 10.1038/s41586-023-06221-2. Epub 2023 Aug 2.
3
Advancing artificial intelligence to help feed the world.推进人工智能以助力养活世界。
Nat Biotechnol. 2023 Sep;41(9):1188-1189. doi: 10.1038/s41587-023-01898-2.
4
Extending the breeder's equation to take aim at the target population of environments.扩展育种者方程以针对目标环境群体。
Front Plant Sci. 2023 Feb 21;14:1129591. doi: 10.3389/fpls.2023.1129591. eCollection 2023.
5
Climate change challenges, plant science solutions.气候变化挑战,植物科学解决方案。
Plant Cell. 2023 Jan 2;35(1):24-66. doi: 10.1093/plcell/koac303.
6
A new generative adversarial network for medical images super resolution.一种用于医学图像超分辨率的新型生成对抗网络。
Sci Rep. 2022 Jun 9;12(1):9533. doi: 10.1038/s41598-022-13658-4.
7
Machine learning: its challenges and opportunities in plant system biology.机器学习:在植物系统生物学中的挑战与机遇。
Appl Microbiol Biotechnol. 2022 May;106(9-10):3507-3530. doi: 10.1007/s00253-022-11963-6. Epub 2022 May 16.
8
Current progress and open challenges for applying deep learning across the biosciences.深度学习在整个生命科学中的应用现状及面临的开放性挑战。
Nat Commun. 2022 Apr 1;13(1):1728. doi: 10.1038/s41467-022-29268-7.
9
The Challenge of Data Annotation in Deep Learning-A Case Study on Whole Plant Corn Silage.深度学习中的数据标注挑战——以全株玉米青贮为例
Sensors (Basel). 2022 Feb 18;22(4):1596. doi: 10.3390/s22041596.
10
Data and its (dis)contents: A survey of dataset development and use in machine learning research.数据及其(不)内容:机器学习研究中数据集开发与使用的调查
Patterns (N Y). 2021 Nov 12;2(11):100336. doi: 10.1016/j.patter.2021.100336.