• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于Cubist回归模型的煤的总热值估算

Estimation of gross calorific value of coal based on the cubist regression model.

作者信息

Chen Junlin, He Yuli, Liang Yuexia, Wang Wenjia, Duan Xiong

机构信息

College of Earth Sciences and Resources, China University of Geosciences (Beijing), Beijing, 100083, China.

School of Geographical Sciences, China West Normal University, Nanchong, 637009, Sichuan, China.

出版信息

Sci Rep. 2024 Oct 5;14(1):23176. doi: 10.1038/s41598-024-74469-3.

DOI:10.1038/s41598-024-74469-3
PMID:39369086
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11455942/
Abstract

The gross calorific value (GCV) of coal is an important parameter for evaluating coal quality, and regression analysis methods can be used to predict GCV. In this study, we proposed a GCV prediction model based on cubist regression. To develop a good regression model, feature selection of input variables was performed using a correlation analysis and a recursive feature elimination algorithm. Thus, in this study, we determined three sets of variables as the optimal combination for regression models: proximate analysis variables (Set 1: moisture, standard ash, and volatile matter), element analysis variables (Set 2: carbon, sulfur, and oxygen), and comprehensive index variables (Set 3: carbon, volatile matter, standard ash, sulfur, moisture, and hydrogen). Results for comparison with multiple linear regression, random forest regression, and numerous previous prediction models, such as gradient boosting regression tree, support vector regression (SVR), backpropagation neural networks, and particle swarm optimization-artificial neural network (PSO-ANN), indicate that these seven regression models have the best fitting effect on the comprehensive index variables among the three sets of input variables. The cubist model showed higher prediction accuracy and lower error than most other models (R, mean absolute error, root mean square error, and average absolute relative deviation percentage values are 0.990, 0.476, 0.668, and 0.086% for the proximate analysis variables; 0.992, 0.381, 0.596, and 0.140% for element analysis variables; and 0.999, 0.161, 0.219, and 0.087% for comprehensive index variables, respectively). The cubist model combines the advantages of decision tree and linear regression, which not only enables it to perform well in terms of accuracy but also makes the model highly interpretable because it is based on multiple sublinear equations. In addition, the cubist model shows obvious advantages in terms of running speed, especially compared with SVR and PSO-ANN, which require complex parameter optimization. In summary, the cubist model considers the prediction accuracy, model interpretability, and computational efficiency as well as provides a new and effective method for GCV prediction.

摘要

煤的高位发热量(GCV)是评估煤质的一个重要参数,回归分析方法可用于预测GCV。在本研究中,我们提出了一种基于Cubist回归的GCV预测模型。为了建立一个良好的回归模型,使用相关分析和递归特征消除算法对输入变量进行特征选择。因此,在本研究中,我们确定了三组变量作为回归模型的最优组合:工业分析变量(第1组:水分、标准灰分和挥发分)、元素分析变量(第2组:碳、硫和氧)以及综合指标变量(第3组:碳、挥发分、标准灰分、硫、水分和氢)。与多元线性回归、随机森林回归以及众多先前的预测模型(如梯度提升回归树、支持向量回归(SVR)、反向传播神经网络和粒子群优化 - 人工神经网络(PSO - ANN))的比较结果表明,这七个回归模型在三组输入变量中对综合指标变量具有最佳拟合效果。Cubist模型比大多数其他模型显示出更高的预测精度和更低的误差(对于工业分析变量,R、平均绝对误差、均方根误差和平均绝对相对偏差百分比值分别为0.990、0.476、0.668和0.086%;对于元素分析变量,分别为0.992、0.381、0.596和0.140%;对于综合指标变量,分别为0.999、0.161、0.219和0.087%)。Cubist模型结合了决策树和线性回归的优点,这不仅使其在准确性方面表现良好,而且由于它基于多个子线性方程,使得模型具有高度可解释性。此外,Cubist模型在运行速度方面显示出明显优势,特别是与需要复杂参数优化的SVR和PSO - ANN相比。总之,Cubist模型兼顾了预测精度、模型可解释性和计算效率,为GCV预测提供了一种新的有效方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/dcacc43fe00c/41598_2024_74469_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/a753d1a93821/41598_2024_74469_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/03f2fcecd1c9/41598_2024_74469_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/46e07cbb89e4/41598_2024_74469_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/e156366c7c75/41598_2024_74469_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/8b6fa6deeb77/41598_2024_74469_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/4744d5a4f4b1/41598_2024_74469_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/9f8603864086/41598_2024_74469_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/dcacc43fe00c/41598_2024_74469_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/a753d1a93821/41598_2024_74469_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/03f2fcecd1c9/41598_2024_74469_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/46e07cbb89e4/41598_2024_74469_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/e156366c7c75/41598_2024_74469_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/8b6fa6deeb77/41598_2024_74469_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/4744d5a4f4b1/41598_2024_74469_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/9f8603864086/41598_2024_74469_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbf5/11455942/dcacc43fe00c/41598_2024_74469_Fig8_HTML.jpg

相似文献

1
Estimation of gross calorific value of coal based on the cubist regression model.基于Cubist回归模型的煤的总热值估算
Sci Rep. 2024 Oct 5;14(1):23176. doi: 10.1038/s41598-024-74469-3.
2
Prediction of gross calorific value from coal analysis using decision tree-based bagging and boosting techniques.使用基于决策树的装袋和提升技术从煤炭分析中预测总发热量。
Heliyon. 2023 Dec 7;10(1):e23395. doi: 10.1016/j.heliyon.2023.e23395. eCollection 2024 Jan 15.
3
Fuzzy-based prediction of solar PV and wind power generation for microgrid modeling using particle swarm optimization.基于模糊理论的太阳能光伏和风力发电预测,用于采用粒子群优化算法的微电网建模
Heliyon. 2023 Jan 5;9(1):e12802. doi: 10.1016/j.heliyon.2023.e12802. eCollection 2023 Jan.
4
Prediction of Individual Gas Yields of Supercritical Water Gasification of Lignocellulosic Biomass by Machine Learning Models.基于机器学习模型预测木质纤维素生物质超临界水气化的个体产气率。
Molecules. 2024 May 16;29(10):2337. doi: 10.3390/molecules29102337.
5
Quantitative analysis of coal quality by laser-induced breakdown spectroscopy assisted with different chemometric methods.采用不同化学计量学方法辅助的激光诱导击穿光谱定量分析煤质。
Anal Methods. 2020 Jul 16;12(27):3530-3536. doi: 10.1039/d0ay00905a.
6
Improved ANN-Based Approach Using Relative Impact for the Prediction of Thermal Coal Elemental Composition Using Proximate Analysis.基于人工神经网络的改进方法:利用相对影响通过近似分析预测动力煤元素组成
ACS Omega. 2022 Aug 16;7(34):29734-29746. doi: 10.1021/acsomega.2c02324. eCollection 2022 Aug 30.
7
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
8
Development of a Rapid Coal Analyzer Using Laser-Induced Breakdown Spectroscopy (LIBS).利用激光诱导击穿光谱(LIBS)快速煤质分析仪的研制。
Appl Spectrosc. 2018 Aug;72(8):1225-1233. doi: 10.1177/0003702818772856. Epub 2018 Jun 1.
9
An extensive experimental survey of regression methods.回归方法的广泛实验调查。
Neural Netw. 2019 Mar;111:11-34. doi: 10.1016/j.neunet.2018.12.010. Epub 2018 Dec 21.
10
Seasonal prediction of daily PM concentrations with interpretable machine learning: a case study of Beijing, China.基于可解释机器学习的日 PM 浓度季节性预测:以中国北京为例。
Environ Sci Pollut Res Int. 2022 Jun;29(30):45821-45836. doi: 10.1007/s11356-022-18913-9. Epub 2022 Feb 12.

本文引用的文献

1
Potential application of machine learning for exploring adsorption mechanisms of pharmaceuticals onto biochars.机器学习在探索药物吸附到生物炭上的吸附机制中的潜在应用。
Chemosphere. 2022 Jan;287(Pt 2):132203. doi: 10.1016/j.chemosphere.2021.132203. Epub 2021 Sep 8.
2
Gradient boosting machines, a tutorial.梯度提升机,教程。
Front Neurorobot. 2013 Dec 4;7:21. doi: 10.3389/fnbot.2013.00021. eCollection 2013.
3
A comparison of random forest regression and multiple linear regression for prediction in neuroscience.随机森林回归与多元线性回归在神经科学预测中的比较。
J Neurosci Methods. 2013 Oct 30;220(1):85-91. doi: 10.1016/j.jneumeth.2013.08.024. Epub 2013 Sep 6.
4
A working guide to boosted regression trees.提升回归树实用指南。
J Anim Ecol. 2008 Jul;77(4):802-13. doi: 10.1111/j.1365-2656.2008.01390.x. Epub 2008 Apr 8.
5
Prediction error estimation: a comparison of resampling methods.预测误差估计:重采样方法的比较
Bioinformatics. 2005 Aug 1;21(15):3301-7. doi: 10.1093/bioinformatics/bti499. Epub 2005 May 19.
6
New support vector algorithms.新的支持向量算法。
Neural Comput. 2000 May;12(5):1207-45. doi: 10.1162/089976600300015565.