• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的预测建模以提升葡萄酒品质。

Machine learning-based predictive modelling for the enhancement of wine quality.

作者信息

Jain Khushboo, Kaushik Keshav, Gupta Sachin Kumar, Mahajan Shubham, Kadry Seifedine

机构信息

School of Computer Science, University of Petroleum and Energy Studies, Dehradun, India.

Department of Electronics and Communication Engineering, Central University of Jammu, Samba, Jammu, Jammu and Kashmir, 181143, India.

出版信息

Sci Rep. 2023 Oct 9;13(1):17042. doi: 10.1038/s41598-023-44111-9.

DOI:10.1038/s41598-023-44111-9
PMID:37814043
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10562461/
Abstract

The certification of wine quality is essential to the wine industry. The main goal of this work is to develop a machine learning model to forecast wine quality using the dataset. We utilised samples from the red wine dataset (RWD) with eleven distinct physiochemical properties. With the initial RWD, five machine learning (ML) models were trained and put to the test. The most accurate algorithms are Random Forest (RF) and Extreme Gradient Boosting (XGBoost). Using these two ML approaches, the top three features from a total of eleven features are chosen, and ML analysis is performed on the remaining features. Several graphs are employed to demonstrate the feature importance based on the XGBoost model and RF. Wine quality was predicted using relevant characteristics, often referred to as fundamental elements, that were shown to be essential during the feature selection procedure. When trained and tested without feature selection, with feature selection (RF), and with key attributes, the XGBoost classifier displayed 100% accuracy. In the presence of essential variables, the RF classifier performed better. Finally, to assess the precision of their predictions, the authors trained an RF classifier, validated it, and changed its hyperparameters. To address collinearity and decrease the quantity of predictors without sacrificing model accuracy, we have also used cluster analysis.

摘要

葡萄酒质量认证对葡萄酒行业至关重要。这项工作的主要目标是开发一种机器学习模型,使用该数据集预测葡萄酒质量。我们使用了来自红葡萄酒数据集(RWD)的样本,这些样本具有11种不同的理化特性。利用初始的RWD,训练并测试了五个机器学习(ML)模型。最准确的算法是随机森林(RF)和极端梯度提升(XGBoost)。使用这两种ML方法,从总共11个特征中选择前三个特征,并对其余特征进行ML分析。使用几个图表来展示基于XGBoost模型和RF的特征重要性。使用在特征选择过程中显示为必不可少的相关特征(通常称为基本要素)来预测葡萄酒质量。当不进行特征选择、进行特征选择(RF)以及使用关键属性进行训练和测试时,XGBoost分类器的准确率均为100%。在存在基本变量的情况下,RF分类器表现更好。最后,为了评估预测的精度,作者训练了一个RF分类器,对其进行验证并更改其超参数。为了解决共线性问题并在不牺牲模型准确性的情况下减少预测变量的数量,我们还使用了聚类分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/ae88cbd6baaf/41598_2023_44111_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/83934deebac7/41598_2023_44111_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/8e18c6a1e8d4/41598_2023_44111_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/9f335a000155/41598_2023_44111_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/c3f6ea5c3c21/41598_2023_44111_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/410edc5528d0/41598_2023_44111_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/c91e2516e327/41598_2023_44111_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/126151fd0b07/41598_2023_44111_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/e087645a7901/41598_2023_44111_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/3556541b267f/41598_2023_44111_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/ae88cbd6baaf/41598_2023_44111_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/83934deebac7/41598_2023_44111_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/8e18c6a1e8d4/41598_2023_44111_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/9f335a000155/41598_2023_44111_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/c3f6ea5c3c21/41598_2023_44111_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/410edc5528d0/41598_2023_44111_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/c91e2516e327/41598_2023_44111_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/126151fd0b07/41598_2023_44111_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/e087645a7901/41598_2023_44111_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/3556541b267f/41598_2023_44111_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e69e/10562461/ae88cbd6baaf/41598_2023_44111_Fig11_HTML.jpg

相似文献

1
Machine learning-based predictive modelling for the enhancement of wine quality.基于机器学习的预测建模以提升葡萄酒品质。
Sci Rep. 2023 Oct 9;13(1):17042. doi: 10.1038/s41598-023-44111-9.
2
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
3
Using machine learning to realize genetic site screening and genomic prediction of productive traits in pigs.利用机器学习实现猪生产性状的遗传位点筛选和基因组预测。
FASEB J. 2023 Jun;37(6):e22961. doi: 10.1096/fj.202300245R.
4
Construction of a Diagnostic Algorithm for Diagnosis of Adult Asthma Using Machine Learning with Random Forest and XGBoost.使用随机森林和XGBoost机器学习构建成人哮喘诊断算法
Diagnostics (Basel). 2023 Sep 27;13(19):3069. doi: 10.3390/diagnostics13193069.
5
Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database.机器学习方法预测美国成年人肺炎 30 天内住院再入院结局:国家再入院数据库分析。
BMC Med Inform Decis Mak. 2022 Nov 9;22(1):288. doi: 10.1186/s12911-022-01995-3.
6
Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules.将机器学习病理诊断算法纳入甲状腺超声成像数据可提高甲状腺恶性结节的诊断风险。
Front Oncol. 2022 Dec 8;12:968784. doi: 10.3389/fonc.2022.968784. eCollection 2022.
7
Ultrafast pulse wave velocity and ensemble learning to predict atherosclerosis risk.超快脉搏波速度与集成学习预测动脉粥样硬化风险。
Int J Cardiovasc Imaging. 2022 Sep;38(9):1885-1893. doi: 10.1007/s10554-022-02574-3. Epub 2022 Feb 27.
8
Predictive modeling of blood pressure during hemodialysis: a comparison of linear model, random forest, support vector regression, XGBoost, LASSO regression and ensemble method.血液透析期间血压的预测建模:线性模型、随机森林、支持向量回归、XGBoost、LASSO回归及集成方法的比较
Comput Methods Programs Biomed. 2020 Oct;195:105536. doi: 10.1016/j.cmpb.2020.105536. Epub 2020 May 22.
9
Developing machine learning-based models to predict intrauterine insemination (IUI) success by address modeling challenges in imbalanced data and providing modification solutions for them.开发基于机器学习的模型,通过解决不平衡数据中的建模挑战来预测宫腔内人工授精(IUI)的成功率,并提供修改解决方案。
BMC Med Inform Decis Mak. 2022 Sep 1;22(1):228. doi: 10.1186/s12911-022-01974-8.
10
Prediction of acute organophosphate poisoning severity using machine learning techniques.利用机器学习技术预测急性有机磷中毒的严重程度。
Toxicology. 2023 Mar 1;486:153431. doi: 10.1016/j.tox.2023.153431. Epub 2023 Jan 19.

引用本文的文献

1
Machine learning-based prediction of volatile compounds profiles in Saccharomyces cerevisiae fermentation simulating canned meat.基于机器学习对模拟罐装肉类的酿酒酵母发酵过程中挥发性化合物谱的预测。
NPJ Sci Food. 2025 Jun 2;9(1):92. doi: 10.1038/s41538-025-00435-6.

本文引用的文献

1
Understanding Quality of Pinot Noir Wine: Can Modelling and Machine Learning Pave the Way?理解黑皮诺葡萄酒的品质:建模与机器学习能否引领方向?
Foods. 2022 Oct 3;11(19):3072. doi: 10.3390/foods11193072.
2
Risk management in sustainable supply chain: a knowledge map towards intellectual structure, logic diagram, and conceptual model.可持续供应链中的风险管理:知识图谱、逻辑图和概念模型的研究方向。
Environ Sci Pollut Res Int. 2022 Sep;29(44):66041-66067. doi: 10.1007/s11356-022-22255-x. Epub 2022 Aug 1.
3
Prediction of quality parameters in straw wine by means of FT-IR spectroscopy combined with multivariate data processing.
利用傅里叶变换红外光谱结合多元数据分析预测秸秆酒的质量参数。
Food Chem. 2020 Feb 1;305:125512. doi: 10.1016/j.foodchem.2019.125512. Epub 2019 Sep 10.
4
Prediction Models to Control Aging Time in Red Wine.预测红葡萄酒陈酿时间的模型。
Molecules. 2019 Feb 26;24(5):826. doi: 10.3390/molecules24050826.