• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于短周期农产品销售预测的分层随机森林-极端梯度提升模型

A Hierarchical RF-XGBoost Model for Short-Cycle Agricultural Product Sales Forecasting.

作者信息

Li Jiawen, Lin Binfan, Wang Peixian, Chen Yanmei, Zeng Xianxian, Liu Xin, Chen Rongjun

机构信息

School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou 510665, China.

Guangxi Key Lab of Multi-Source Information Mining & Security, Guangxi Normal University, Guilin 541004, China.

出版信息

Foods. 2024 Sep 17;13(18):2936. doi: 10.3390/foods13182936.

DOI:10.3390/foods13182936
PMID:39335865
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11431005/
Abstract

Short-cycle agricultural product sales forecasting significantly reduces food waste by accurately predicting demand, ensuring producers match supply with consumer needs. However, the forecasting is often subject to uncertain factors, resulting in highly volatile and discontinuous data. To address this, a hierarchical prediction model that combines RF-XGBoost is proposed in this work. It adopts the Random Forest (RF) in the first layer to extract residuals and achieve initial prediction results based on correlation features from Grey Relation Analysis (GRA). Then, a new feature set based on residual clustering features is generated after the hierarchical clustering is applied to classify the characteristics of the residuals. Subsequently, Extreme Gradient Boosting (XGBoost) acts as the second layer that utilizes those residual clustering features to yield the prediction results. The final prediction is by incorporating the results from the first layer and second layer correspondingly. As for the performance evaluation, using agricultural product sales data from a supermarket in China from 1 July 2020 to 30 June 2023, the results demonstrate superiority over standalone RF and XGBoost, with a Mean Absolute Percentage Error (MAPE) reduction of 10% and 12%, respectively, and a coefficient of determination (R) increase of 22% and 24%, respectively. Additionally, its generalization is validated across 42 types of agricultural products from six vegetable categories, showing its extensive practical ability. Such performances reveal that the proposed model beneficially enhances the precision of short-term agricultural product sales forecasting, with the advantages of optimizing the supply chain from producers to consumers and minimizing food waste accordingly.

摘要

短周期农产品销售预测通过准确预测需求显著减少了食物浪费,确保生产者使供应与消费者需求相匹配。然而,该预测常常受到不确定因素的影响,导致数据高度波动且不连续。为了解决这个问题,本文提出了一种结合随机森林(RF)和极端梯度提升(XGBoost)的分层预测模型。它在第一层采用随机森林(RF)来提取残差,并基于灰色关联分析(GRA)的相关特征获得初始预测结果。然后,在对残差特征进行分层聚类后,生成基于残差聚类特征的新特征集。随后,极端梯度提升(XGBoost)作为第二层利用这些残差聚类特征得出预测结果。最终预测是通过相应地合并第一层和第二层的结果得到的。在性能评估方面,使用中国一家超市2020年7月1日至2023年6月30日的农产品销售数据,结果表明该模型优于单独的随机森林(RF)和极端梯度提升(XGBoost),平均绝对百分比误差(MAPE)分别降低了10%和12%,决定系数(R)分别提高了22%和24%。此外,其泛化能力在六个蔬菜类别的42种农产品上得到了验证,显示出其广泛的实际应用能力。这些性能表明,所提出的模型有益地提高了短期农产品销售预测的精度,具有优化从生产者到消费者的供应链并相应减少食物浪费的优点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/348f/11431005/7fcb20bd5e70/foods-13-02936-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/348f/11431005/e5bc51685f4c/foods-13-02936-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/348f/11431005/7fcb20bd5e70/foods-13-02936-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/348f/11431005/e5bc51685f4c/foods-13-02936-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/348f/11431005/7fcb20bd5e70/foods-13-02936-g002.jpg

相似文献

1
A Hierarchical RF-XGBoost Model for Short-Cycle Agricultural Product Sales Forecasting.一种用于短周期农产品销售预测的分层随机森林-极端梯度提升模型
Foods. 2024 Sep 17;13(18):2936. doi: 10.3390/foods13182936.
2
Decision tree models for the estimation of geo-polymer concrete compressive strength.用于估算地质聚合物混凝土抗压强度的决策树模型。
Math Biosci Eng. 2024 Jan;21(1):1413-1444. doi: 10.3934/mbe.2024061. Epub 2022 Dec 27.
3
Time series analysis of hemorrhagic fever with renal syndrome in mainland China by using an XGBoost forecasting model.中国大陆肾综合征出血热的时间序列分析:基于 XGBoost 预测模型。
BMC Infect Dis. 2021 Aug 19;21(1):839. doi: 10.1186/s12879-021-06503-y.
4
Carbon price prediction based on decomposition technique and extreme gradient boosting optimized by the grey wolf optimizer algorithm.基于分解技术和灰狼优化算法优化的极端梯度提升的碳价格预测
Sci Rep. 2023 Oct 27;13(1):18447. doi: 10.1038/s41598-023-45524-2.
5
Forecasting and analyzing influenza activity in Hebei Province, China, using a CNN-LSTM hybrid model.利用 CNN-LSTM 混合模型预测和分析中国河北省的流感活动。
BMC Public Health. 2024 Aug 12;24(1):2171. doi: 10.1186/s12889-024-19590-8.
6
Multi-step ahead forecasting of electrical conductivity in rivers by using a hybrid Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM) model enhanced by Boruta-XGBoost feature selection algorithm.利用由Boruta-XGBoost特征选择算法增强的混合卷积神经网络-长短期记忆(CNN-LSTM)模型对河流电导率进行多步提前预测。
Sci Rep. 2024 Jul 1;14(1):15051. doi: 10.1038/s41598-024-65837-0.
7
A hybrid sales forecasting scheme by combining independent component analysis with K-means clustering and support vector regression.一种通过将独立成分分析与K均值聚类和支持向量回归相结合的混合销售预测方案。
ScientificWorldJournal. 2014;2014:624017. doi: 10.1155/2014/624017. Epub 2014 Jun 17.
8
Prediction of Rock Unloading Strength Based on PSO-XGBoost Hybrid Models.基于粒子群优化-极限梯度提升混合模型的岩石卸荷强度预测
Materials (Basel). 2024 Aug 26;17(17):4214. doi: 10.3390/ma17174214.
9
An optimized XGBoost-based machine learning method for predicting wave run-up on a sloping beach.一种基于优化极限梯度提升的机器学习方法用于预测斜坡海滩上的波浪爬高。
MethodsX. 2023 Mar 10;10:102119. doi: 10.1016/j.mex.2023.102119. eCollection 2023.
10
Deep learning infused SIRVD model for COVID-19 prediction: XGBoost-SIRVD-LSTM approach.用于COVID-19预测的深度学习注入SIRVD模型:XGBoost-SIRVD-LSTM方法。
Front Med (Lausanne). 2024 Sep 3;11:1427239. doi: 10.3389/fmed.2024.1427239. eCollection 2024.

本文引用的文献

1
Price Forecasting of Marine Fish Based on Weight Allocation Intelligent Combinatorial Modelling.基于权重分配智能组合建模的海水鱼类价格预测
Foods. 2024 Apr 15;13(8):1202. doi: 10.3390/foods13081202.
2
New Approaches for Improving the Quality of Processed Fruits and Vegetables and Their By-Products.提高加工水果、蔬菜及其副产品质量的新方法
Foods. 2023 Mar 23;12(7):1353. doi: 10.3390/foods12071353.
3
A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh.
基于树的极端梯度提升(XGBoost)机器学习模型,用于预测孟加拉国的年度水稻产量。
PLoS One. 2023 Mar 27;18(3):e0283452. doi: 10.1371/journal.pone.0283452. eCollection 2023.
4
Building resilient food system amidst COVID-19: Responses and lessons from China.在新冠疫情期间构建具有韧性的粮食系统:来自中国的应对措施与经验教训
Agric Syst. 2021 May;190:103102. doi: 10.1016/j.agsy.2021.103102. Epub 2021 Feb 10.
5
Machine learning techniques for forecasting agricultural prices: A case of brinjal in Odisha, India.机器学习技术在农产品价格预测中的应用:以印度奥里萨邦茄子为例。
PLoS One. 2022 Jul 6;17(7):e0270553. doi: 10.1371/journal.pone.0270553. eCollection 2022.
6
Development of a Predictive Model for Agave Prices Employing Environmental, Economic, and Social Factors: Towards a Planned Supply Chain for Agave-Tequila Industry.利用环境、经济和社会因素开发龙舌兰价格预测模型:迈向龙舌兰-龙舌兰酒产业的规划供应链。
Foods. 2022 Apr 14;11(8):1138. doi: 10.3390/foods11081138.
7
State of the art in total pulse production in major states of India using ARIMA techniques.印度主要邦运用自回归整合移动平均(ARIMA)技术进行总脉冲生产的技术现状。
Curr Res Food Sci. 2021 Oct 28;4:800-806. doi: 10.1016/j.crfs.2021.10.009. eCollection 2021.
8
Application of a hybrid ARIMA-LSTM model based on the SPEI for drought forecasting.基于SPEI 的混合 ARIMA-LSTM 模型在干旱预测中的应用。
Environ Sci Pollut Res Int. 2022 Jan;29(3):4128-4144. doi: 10.1007/s11356-021-15325-z. Epub 2021 Aug 17.