• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习集成预测玉米产量

Forecasting Corn Yield With Machine Learning Ensembles.

作者信息

Shahhosseini Mohsen, Hu Guiping, Archontoulis Sotirios V

机构信息

Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, United States.

Department of Agronomy, Iowa State University, Ames, IA, United States.

出版信息

Front Plant Sci. 2020 Jul 31;11:1120. doi: 10.3389/fpls.2020.01120. eCollection 2020.

DOI:10.3389/fpls.2020.01120
PMID:32849688
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7411227/
Abstract

The emergence of new technologies to synthesize and analyze big data with high-performance computing has increased our capacity to more accurately predict crop yields. Recent research has shown that machine learning (ML) can provide reasonable predictions faster and with higher flexibility compared to simulation crop modeling. However, a single machine learning model can be outperformed by a "committee" of models (machine learning ensembles) that can reduce prediction bias, variance, or both and is able to better capture the underlying distribution of the data. Yet, there are many aspects to be investigated with regard to prediction accuracy, time of the prediction, and scale. The earlier the prediction during the growing season the better, but this has not been thoroughly investigated as previous studies considered all data available to predict yields. This paper provides a machine leaning based framework to forecast corn yields in three US Corn Belt states (Illinois, Indiana, and Iowa) considering complete and partial in-season weather knowledge. Several ensemble models are designed using blocked sequential procedure to generate out-of-bag predictions. The forecasts are made in county-level scale and aggregated for agricultural district and state level scales. Results show that the proposed optimized weighted ensemble and the average ensemble are the most precise models with RRMSE of 9.5%. Stacked LASSO makes the least biased predictions (MBE of 53 kg/ha), while other ensemble models also outperformed the base learners in terms of bias. On the contrary, although random k-fold cross-validation is replaced by blocked sequential procedure, it is shown that stacked ensembles perform not as good as weighted ensemble models for time series data sets as they require the data to be non-IID to perform favorably. Comparing our proposed model forecasts with the literature demonstrates the acceptable performance of forecasts made by our proposed ensemble model. Results from the scenario of having partial in-season weather knowledge reveals that decent yield forecasts with RRMSE of 9.2% can be made as early as June 1. Moreover, it was shown that the proposed model performed better than individual models and benchmark ensembles at agricultural district and state-level scales as well as county-level scale. To find the marginal effect of each input feature on the forecasts made by the proposed ensemble model, a methodology is suggested that is the basis for finding feature importance for the ensemble model. The findings suggest that weather features corresponding to weather in weeks 18-24 (May 1 to June 1) are the most important input features.

摘要

利用高性能计算来合成和分析大数据的新技术的出现,提高了我们更准确预测作物产量的能力。最近的研究表明,与模拟作物建模相比,机器学习(ML)能够更快且更灵活地提供合理预测。然而,单个机器学习模型可能会被一个“模型委员会”(机器学习集成)超越,该集成能够减少预测偏差、方差或两者兼而有之,并且能够更好地捕捉数据的潜在分布。然而,在预测准确性、预测时间和规模方面仍有许多方面有待研究。在生长季节越早进行预测越好,但由于先前的研究考虑了所有可用数据来预测产量,这一点尚未得到充分研究。本文提供了一个基于机器学习的框架,用于预测美国玉米带三个州(伊利诺伊州、印第安纳州和爱荷华州)的玉米产量,同时考虑完整和部分季中天气信息。使用分块顺序程序设计了几个集成模型,以生成袋外预测。预测是在县级尺度上进行的,并汇总到农业区和州级尺度。结果表明,所提出的优化加权集成模型和平均集成模型是最精确的模型,相对均方根误差(RRMSE)为9.5%。堆叠套索回归做出的预测偏差最小(平均偏差误差为53公斤/公顷),而其他集成模型在偏差方面也优于基础学习器。相反,尽管随机k折交叉验证被分块顺序程序所取代,但结果表明,对于时间序列数据集,堆叠集成模型的表现不如加权集成模型,因为它们需要数据是非独立同分布的才能表现良好。将我们提出的模型预测与文献进行比较,证明了我们提出的集成模型所做预测的可接受性能。部分季中天气信息情景的结果表明,早在6月1日就可以做出相对均方根误差为9.2%的良好产量预测。此外,结果表明,所提出的模型在农业区、州级尺度以及县级尺度上的表现均优于单个模型和基准集成模型。为了找出每个输入特征对所提出的集成模型预测的边际效应,提出了一种方法,该方法是确定集成模型特征重要性的基础。研究结果表明,与第18 - 24周(5月1日至6月1日)天气相对应的天气特征是最重要的输入特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/49efa277f222/fpls-11-01120-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/5a5eee8a8ea9/fpls-11-01120-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/c17eea8f7cdf/fpls-11-01120-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/87472217fce0/fpls-11-01120-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/260f31eb33cf/fpls-11-01120-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/c9ec027410b7/fpls-11-01120-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/254419b71ead/fpls-11-01120-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/49efa277f222/fpls-11-01120-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/5a5eee8a8ea9/fpls-11-01120-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/c17eea8f7cdf/fpls-11-01120-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/87472217fce0/fpls-11-01120-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/260f31eb33cf/fpls-11-01120-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/c9ec027410b7/fpls-11-01120-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/254419b71ead/fpls-11-01120-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03a4/7411227/49efa277f222/fpls-11-01120-g007.jpg

相似文献

1
Forecasting Corn Yield With Machine Learning Ensembles.利用机器学习集成预测玉米产量
Front Plant Sci. 2020 Jul 31;11:1120. doi: 10.3389/fpls.2020.01120. eCollection 2020.
2
Corn Yield Prediction With Ensemble CNN-DNN.基于卷积神经网络与深度神经网络集成的玉米产量预测
Front Plant Sci. 2021 Aug 2;12:709008. doi: 10.3389/fpls.2021.709008. eCollection 2021.
3
Machine learning ensembles, neural network, hybrid and sparse regression approaches for weather based rainfed cotton yield forecast.基于机器学习集成、神经网络、混合和稀疏回归方法的天气预测雨养棉花产量。
Int J Biometeorol. 2024 Jun;68(6):1179-1197. doi: 10.1007/s00484-024-02661-1. Epub 2024 Apr 27.
4
Predictive performance of multi-model ensemble forecasts of COVID-19 across European nations.多模型集成预测对欧洲各国 COVID-19 疫情的预测性能。
Elife. 2023 Apr 21;12:e81916. doi: 10.7554/eLife.81916.
5
County-scale crop yield prediction by integrating crop simulation with machine learning models.通过整合作物模拟与机器学习模型进行县域尺度作物产量预测。
Front Plant Sci. 2022 Nov 28;13:1000224. doi: 10.3389/fpls.2022.1000224. eCollection 2022.
6
A Systems Modeling Approach to Forecast Corn Economic Optimum Nitrogen Rate.一种预测玉米经济最佳施氮量的系统建模方法。
Front Plant Sci. 2018 Apr 13;9:436. doi: 10.3389/fpls.2018.00436. eCollection 2018.
7
Coupling machine learning and crop modeling improves crop yield prediction in the US Corn Belt.机器学习与作物模型结合可提高美国玉米带的作物产量预测精度。
Sci Rep. 2021 Jan 15;11(1):1606. doi: 10.1038/s41598-020-80820-1.
8
In-season weather data provide reliable yield estimates of maize and soybean in the US central Corn Belt.季节内天气数据为美国中部玉米带的玉米和大豆提供了可靠的产量预估。
Int J Biometeorol. 2021 Apr;65(4):489-502. doi: 10.1007/s00484-020-02039-z. Epub 2020 Nov 21.
9
A deep learning approach to conflating heterogeneous geospatial data for corn yield estimation: A case study of the US Corn Belt at the county level.深度学习融合异质地理空间数据估算玉米产量的方法:以美国玉米带县级为例。
Glob Chang Biol. 2020 Mar;26(3):1754-1766. doi: 10.1111/gcb.14885. Epub 2019 Dec 2.
10
Seasonal forecasting of green water components and crop yields of winter wheat in Serbia and Austria.塞尔维亚和奥地利冬小麦绿水组分及作物产量的季节预测。
J Agric Sci. 2018 Jul;156(5):645-657. doi: 10.1017/S0021859617000788. Epub 2017 Dec 11.

引用本文的文献

1
Analyses of crop yield dynamics and the development of a multimodal neural network prediction model with G×E×M interactions.作物产量动态分析以及具有基因型×环境×管理相互作用的多模态神经网络预测模型的开发。
Front Plant Sci. 2025 Jul 31;16:1537990. doi: 10.3389/fpls.2025.1537990. eCollection 2025.
2
Machine Learning Analysis of Maize Seedling Traits Under Drought Stress.干旱胁迫下玉米幼苗性状的机器学习分析
Biology (Basel). 2025 Jun 29;14(7):787. doi: 10.3390/biology14070787.
3
Predicting wheat yield using deep learning and multi-source environmental data.

本文引用的文献

1
A CNN-RNN Framework for Crop Yield Prediction.一种用于作物产量预测的卷积神经网络-循环神经网络框架。
Front Plant Sci. 2020 Jan 24;10:1750. doi: 10.3389/fpls.2019.01750. eCollection 2019.
2
Development of a 10-km resolution global soil profile dataset for crop modeling applications.开发用于作物建模应用的10公里分辨率全球土壤剖面数据集。
Environ Model Softw. 2019 Sep;119:70-83. doi: 10.1016/j.envsoft.2019.05.012.
3
Crop Yield Prediction Using Deep Neural Networks.使用深度神经网络进行作物产量预测。
利用深度学习和多源环境数据预测小麦产量。
Sci Rep. 2025 Jul 21;15(1):26446. doi: 10.1038/s41598-025-11780-7.
4
Prediction model of in-hospital mortality risk in intensive care unit patients with cardiac arrest: a multicenter retrospective cohort study based on an ensemble model.重症监护病房心脏骤停患者院内死亡风险预测模型:一项基于集成模型的多中心回顾性队列研究
Front Cardiovasc Med. 2025 May 20;12:1582636. doi: 10.3389/fcvm.2025.1582636. eCollection 2025.
5
Mapping 1-km soybean yield across China from 2001 to 2020 based on ensemble learning.基于集成学习绘制2001年至2020年中国1公里分辨率的大豆产量图。
Sci Data. 2025 Mar 8;12(1):408. doi: 10.1038/s41597-025-04738-x.
6
Integrating Remote Sensing and Soil Features for Enhanced Machine Learning-Based Corn Yield Prediction in the Southern US.整合遥感与土壤特征以增强美国南部基于机器学习的玉米产量预测
Sensors (Basel). 2025 Jan 18;25(2):543. doi: 10.3390/s25020543.
7
Nitrogen monitoring and inversion algorithms of fruit trees based on spectral remote sensing: a deep review.基于光谱遥感的果树氮素监测与反演算法:深度综述
Front Plant Sci. 2024 Nov 22;15:1489151. doi: 10.3389/fpls.2024.1489151. eCollection 2024.
8
Global genotype by environment prediction competition reveals that diverse modeling strategies can deliver satisfactory maize yield estimates.全球基因型与环境互作预测竞赛表明,多种建模策略均可提供令人满意的玉米产量估计。
Genetics. 2025 Feb 5;229(2). doi: 10.1093/genetics/iyae195.
9
Analyzing the distribution patterns and dynamic niche of L. in the United States and China in response to climate change.分析在美国和中国,L. 针对气候变化的分布模式和动态生态位。
Front Plant Sci. 2024 Oct 22;15:1440610. doi: 10.3389/fpls.2024.1440610. eCollection 2024.
10
Global Genotype by Environment Prediction Competition Reveals That Diverse Modeling Strategies Can Deliver Satisfactory Maize Yield Estimates.全球基因型与环境预测竞赛表明,多种建模策略可提供令人满意的玉米产量估计。
bioRxiv. 2024 Sep 19:2024.09.13.612969. doi: 10.1101/2024.09.13.612969.
Front Plant Sci. 2019 May 22;10:621. doi: 10.3389/fpls.2019.00621. eCollection 2019.
4
New algorithms for detecting multi-effect and multi-way epistatic interactions.用于检测多效和多向上位性相互作用的新算法。
Bioinformatics. 2019 Dec 15;35(24):5078-5085. doi: 10.1093/bioinformatics/btz463.
5
Optimizing Selection and Mating in Genomic Selection with a Look-Ahead Approach: An Operations Research Framework.基于前瞻方法的基因组选择中选择和交配的优化:一个运筹学框架。
G3 (Bethesda). 2019 Jul 9;9(7):2123-2133. doi: 10.1534/g3.118.200842.
6
A novel machine learning-based approach for the risk assessment of nitrate groundwater contamination.一种基于新型机器学习的地下水硝酸盐污染风险评估方法。
Sci Total Environ. 2018 Dec 10;644:954-962. doi: 10.1016/j.scitotenv.2018.07.054. Epub 2018 Jul 11.
7
Modeling Long-Term Corn Yield Response to Nitrogen Rate and Crop Rotation.模拟玉米长期产量对施氮量和作物轮作的响应
Front Plant Sci. 2016 Nov 11;7:1630. doi: 10.3389/fpls.2016.01630. eCollection 2016.
8
Random Forests for Global and Regional Crop Yield Predictions.用于全球和区域作物产量预测的随机森林算法
PLoS One. 2016 Jun 3;11(6):e0156571. doi: 10.1371/journal.pone.0156571. eCollection 2016.
9
SoilGrids1km--global soil information based on automated mapping.SoilGrids1km——基于自动制图的全球土壤信息。
PLoS One. 2014 Aug 29;9(8):e105992. doi: 10.1371/journal.pone.0105992. eCollection 2014.
10
Determining the most important physiological and agronomic traits contributing to maize grain yield through machine learning algorithms: a new avenue in intelligent agriculture.通过机器学习算法确定对玉米籽粒产量有重要贡献的最重要生理和农艺性状:智能农业的新途径。
PLoS One. 2014 May 15;9(5):e97288. doi: 10.1371/journal.pone.0097288. eCollection 2014.