• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习和可解释人工智能预测皮革废料气化产生的氢气和甲烷产量:一个原始数据集。

Prediction of hydrogen and methane yields from gasification of leather waste using machine learning and explainable AI: An original dataset.

作者信息

Cihan Pınar, Alfarra Fatma, Kurtulus Ozcan H, Ciner Mirac Nur, Ongen Atakan

机构信息

Corlu Engineering Faculty, Department of Computer Engineering, Tekirdag Namık Kemal University, 59860, Çorlu, Tekirdag, Turkey.

Engineering Faculty, Department of Environmental Engineering, Istanbul University-Cerrahpaşa, 34320, Avcilar, Istanbul, Turkey.

出版信息

J Environ Manage. 2025 Sep;391:126521. doi: 10.1016/j.jenvman.2025.126521. Epub 2025 Jul 19.

DOI:10.1016/j.jenvman.2025.126521
PMID:40684595
Abstract

Accurately predicting syngas composition is essential for optimizing energy production and ensuring environmental sustainability. Despite the growing use of machine learning techniques in this field, publicly available datasets remain limited, and existing datasets contain relatively few samples. To bridge this gap, we generated a comprehensive dataset of 3748 samples under controlled laboratory conditions and publicly shared it on Kaggle (https://www.kaggle.com/datasets/miracnurciner/gasification-dataset). This study aims to identify the most successful machine learning model for predicting H and CH gas concentrations by evaluating nine models: Random Forest (RF), Linear Regression (LR), Decision Tree (DT), Support Vector Regression (Linear and RBF), K-Nearest Neighbors (KNN), Gradient Boosting Regressor (GBR), XGBoost, CatBoost, and LightGBM. Model performance was assessed using multiple metrics, including the coefficient of determination (R), root mean squared error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and explained variance score (EVS). The Friedman test was applied to evaluate the statistical significance of performance differences among the models. The results show that the KNN model achieved the highest predictive performance for both H (R = 0.987, RMSE = 1.253) and CH (R = 0.979, RMSE = 0.920). Friedman test shows that the performance differences between the models are statistically significant (p < 0.001). By integrating Shapley Additive Explanations (SHAP) into the model, the contribution of each feature to the prediction results is clarified. SHAP analysis highlights that temperature and time are the main features affecting H and CH gas. This study highlights the potential of machine learning techniques for biomass gas prediction and advocates for integrating Explainable AI (XAI) methods, establishing a robust foundation for future research. Furthermore, by providing a large, publicly available dataset, this research significantly advances studies in syngas composition prediction.

摘要

准确预测合成气成分对于优化能源生产和确保环境可持续性至关重要。尽管机器学习技术在该领域的应用日益广泛,但公开可用的数据集仍然有限,且现有数据集包含的样本相对较少。为了弥补这一差距,我们在可控的实验室条件下生成了一个包含3748个样本的综合数据集,并在Kaggle(https://www.kaggle.com/datasets/miracnurciner/gasification-dataset)上公开分享。本研究旨在通过评估九种模型来确定预测氢气(H)和甲烷(CH)气体浓度最成功的机器学习模型:随机森林(RF)、线性回归(LR)、决策树(DT)、支持向量回归(线性和径向基函数)、K近邻(KNN)、梯度提升回归器(GBR)、XGBoost、CatBoost和LightGBM。使用多种指标评估模型性能,包括决定系数(R)、均方根误差(RMSE)、平均绝对误差(MAE)、平均绝对百分比误差(MAPE)和解释方差得分(EVS)。应用弗里德曼检验来评估模型之间性能差异的统计显著性。结果表明,KNN模型在预测氢气(R = 0.987,RMSE = 1.253)和甲烷(R = 0.979,RMSE = 0.920)方面均取得了最高的预测性能。弗里德曼检验表明,模型之间的性能差异具有统计学显著性(p < 0.001)。通过将夏普利值(SHAP)集成到模型中,阐明了每个特征对预测结果的贡献。SHAP分析突出表明,温度和时间是影响氢气和甲烷气体的主要特征。本研究突出了机器学习技术在生物质气预测方面的潜力,并提倡集成可解释人工智能(XAI)方法,为未来研究奠定坚实基础。此外,通过提供一个大型的公开可用数据集,本研究显著推进了合成气成分预测的研究。

相似文献

1
Prediction of hydrogen and methane yields from gasification of leather waste using machine learning and explainable AI: An original dataset.使用机器学习和可解释人工智能预测皮革废料气化产生的氢气和甲烷产量:一个原始数据集。
J Environ Manage. 2025 Sep;391:126521. doi: 10.1016/j.jenvman.2025.126521. Epub 2025 Jul 19.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Explainable AI-driven prediction of APE1 inhibitors: enhancing cancer therapy with machine learning models and feature importance analysis.可解释人工智能驱动的APE1抑制剂预测:利用机器学习模型和特征重要性分析增强癌症治疗
Mol Divers. 2025 Feb 21. doi: 10.1007/s11030-025-11133-6.
4
Approaches for predicting dairy cattle methane emissions: from traditional methods to machine learning.预测奶牛甲烷排放的方法:从传统方法到机器学习。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae219.
5
Development of an explainable machine learning model for predicting device-related pressure injuries in clinical settings.开发一种可解释的机器学习模型,用于预测临床环境中与设备相关的压力性损伤。
BMC Med Inform Decis Mak. 2025 Jul 9;25(1):256. doi: 10.1186/s12911-025-03090-9.
6
Machine learning based prediction of geotechnical parameters affecting slope stability in open-pit iron ore mines in high precipitation zone.基于机器学习对高降水区露天铁矿影响边坡稳定性的岩土参数进行预测
Sci Rep. 2025 Jul 1;15(1):21868. doi: 10.1038/s41598-025-99026-4.
7
Optimized feature selection and advanced machine learning for stroke risk prediction in revascularized coronary artery disease patients.优化特征选择与先进机器学习用于预测冠状动脉疾病血运重建患者的卒中风险
BMC Med Inform Decis Mak. 2025 Jul 24;25(1):276. doi: 10.1186/s12911-025-03116-2.
8
Machine Learning Model for Predicting Coronary Heart Disease Risk: Development and Validation Using Insights From a Japanese Population-Based Study.预测冠心病风险的机器学习模型:基于日本人群研究的见解进行开发与验证
JMIR Cardio. 2025 May 12;9:e68066. doi: 10.2196/68066.
9
Application of multimodal machine learning-based analysis for the biomethane yields of NaOH-pretreated biomass.基于多模态机器学习的分析在氢氧化钠预处理生物质生物甲烷产量中的应用。
Sci Rep. 2025 Jul 8;15(1):24372. doi: 10.1038/s41598-025-09527-5.
10
Machine learning framework for oxytetracycline removal using nanostructured cupric oxide supported on magnetic chitosan alginate biocomposite.基于磁性壳聚糖海藻酸盐生物复合材料负载纳米结构氧化铜去除土霉素的机器学习框架
Sci Rep. 2025 Jul 18;15(1):26124. doi: 10.1038/s41598-025-11424-w.