School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamilnadu, India.
PLoS One. 2024 Aug 26;19(8):e0291928. doi: 10.1371/journal.pone.0291928. eCollection 2024.
A timely and consistent assessment of crop yield will assist the farmers in improving their income, minimizing losses, and deriving strategic plans in agricultural commodities to adopt import-export policies. Crop yield predictions are one of the various challenges faced in the agriculture sector and play a significant role in planning and decision-making. Machine learning algorithms provided enough belief and proved their ability to predict crop yield. The selection of the most suitable crop is influenced by various environmental factors such as temperature, soil fertility, water availability, quality, and seasonal variations, as well as economic considerations such as stock availability, preservation capabilities, market demand, purchasing power, and crop prices. The paper outlines a framework used to evaluate the performance of various machine-learning algorithms for forecasting crop yields. The models were based on a range of prime parameters including pesticides, rainfall and average temperature. The Results of three machine learning algorithms, Categorical Boosting (CatBoost), Light Gradient-Boosting Machine (LightGBM), and eXtreme Gradient Boosting (XGBoost) are compared and found more accurate than other algorithms in predicting crop yields. The RMSE and R2 values were calculated to compare the predicted and observed rice yields, resulting in the following values: CatBoost with 800 (0.24), LightGBM with 737 (0.33), and XGBoost with 744 (0.31). Among these three machine learning algorithms, CatBoost demonstrated the highest precision in predicting yields, achieving an accuracy rate of 99.123%.
及时且一致的作物产量评估将有助于农民提高收入、减少损失,并制定农业商品的战略计划以采取进出口政策。作物产量预测是农业部门面临的各种挑战之一,在规划和决策中起着重要作用。机器学习算法提供了足够的可信度,并证明了其预测作物产量的能力。最合适的作物选择受到各种环境因素的影响,如温度、土壤肥力、水的可用性、质量和季节性变化,以及库存可用性、保存能力、市场需求、购买力和作物价格等经济因素。本文概述了用于评估各种用于预测作物产量的机器学习算法性能的框架。该模型基于一系列主要参数,包括杀虫剂、降雨和平均温度。比较了三种机器学习算法(分类提升机(CatBoost)、轻梯度提升机(LightGBM)和极端梯度提升机(XGBoost))的结果,发现它们在预测作物产量方面比其他算法更准确。计算了均方根误差(RMSE)和 R2 值以比较预测和观察到的水稻产量,得出以下值:CatBoost 为 800(0.24),LightGBM 为 737(0.33),XGBoost 为 744(0.31)。在这三种机器学习算法中,CatBoost 在预测产量方面表现出最高的精度,准确率达到 99.123%。