ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India.
ICAR-Indian Agricultural Research Institute, New Delhi, India.
PLoS One. 2022 Jul 6;17(7):e0270553. doi: 10.1371/journal.pone.0270553. eCollection 2022.
Price forecasting of perishable crop like vegetables has importance implications to the farmers, traders as well as consumers. Timely and accurate forecast of the price helps the farmers switch between the alternative nearby markets to sale their produce and getting good prices. The farmers can use the information to make choices around the timing of marketing. For forecasting price of agricultural commodities, several statistical models have been applied in past but those models have their own limitations in terms of assumptions.
In recent times, Machine Learning (ML) techniques have been much successful in modeling time series data. Though, numerous empirical studies have shown that ML approaches outperform time series models in forecasting time series, but their application in forecasting vegetables prices in India is scared. In the present investigation, an attempt has been made to explore efficient ML algorithms e.g. Generalized Neural Network (GRNN), Support Vector Regression (SVR), Random Forest (RF) and Gradient Boosting Machine (GBM) for forecasting wholesale price of Brinjal in seventeen major markets of Odisha, India.
An empirical comparison of the predictive accuracies of different models with that of the usual stochastic model i.e. Autoregressive integrated moving average (ARIMA) model is carried out and it is observed that ML techniques particularly GRNN performs better in most of the cases. The superiority of the models is established by means of Model Confidence Set (MCS), and other accuracy measures such as Mean Error (ME), Root Mean Square Error (RMSE), Mean Absolute Error (MAE) and Mean Absolute Prediction Error (MAPE). To this end, Diebold-Mariano test is performed to test for the significant differences in predictive accuracy of different models.
Among the machine learning techniques, GRNN performs better in all the seventeen markets as compared to other techniques. RF performs at par with GRNN in four markets. The accuracies of other techniques such as SVR, GBM and ARIMA are not up to the mark.
易腐农产品(如蔬菜)的价格预测对农民、交易商和消费者都具有重要意义。及时、准确的价格预测有助于农民在附近的替代市场之间切换,销售他们的产品并获得好价格。农民可以利用这些信息来选择销售的时机。过去,已经应用了几种统计模型来预测农产品价格,但这些模型在假设方面存在局限性。
最近,机器学习(ML)技术在时间序列数据建模方面取得了很大的成功。尽管如此,许多实证研究表明,ML 方法在时间序列预测方面优于时间序列模型,但它们在预测印度蔬菜价格方面的应用却很少。在本研究中,尝试探索了高效的 ML 算法,例如广义神经网络(GRNN)、支持向量回归(SVR)、随机森林(RF)和梯度提升机(GBM),用于预测印度奥里萨邦 17 个主要市场的茄子批发价格。
对不同模型的预测精度与常用的随机模型(即自回归综合移动平均(ARIMA)模型)进行了实证比较,结果表明,ML 技术特别是 GRNN 在大多数情况下表现更好。通过模型置信集(MCS)和其他精度指标(如平均误差(ME)、均方根误差(RMSE)、平均绝对误差(MAE)和平均绝对预测误差(MAPE)来确定模型的优越性。为此,进行了 Diebold-Mariano 检验,以检验不同模型的预测精度是否存在显著差异。
在机器学习技术中,GRNN 在所有 17 个市场中的表现都优于其他技术。RF 在四个市场中的表现与 GRNN 相当。其他技术(如 SVR、GBM 和 ARIMA)的精度则不尽如人意。