• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用堆叠集成学习对脑血管病的每日住院人数进行可解释预测。

Explainable prediction of daily hospitalizations for cerebrovascular disease using stacked ensemble learning.

机构信息

School of Computer Science and Engineering, University of Electronic Science and Technology of China, No.2006, Xiyuan Ave, West Hi-Tech Zone, 611731, Chengdu, Sichuan, People's Republic of China.

Big Data Research Center, University of Electronic Science and Technology of China, Chengdu, China.

出版信息

BMC Med Inform Decis Mak. 2023 Apr 6;23(1):59. doi: 10.1186/s12911-023-02159-7.

DOI:10.1186/s12911-023-02159-7
PMID:37024922
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10080841/
Abstract

BACKGROUND

With the prevalence of cerebrovascular disease (CD) and the increasing strain on healthcare resources, forecasting the healthcare demands of cerebrovascular patients has significant implications for optimizing medical resources.

METHODS

In this study, a stacking ensemble model comprised of four base learners (ridge regression, random forest, gradient boosting decision tree, and artificial neural network) and a meta learner (elastic net) was proposed for predicting the daily number of hospital admissions (HAs) for CD using the historical HAs data, air quality data, and meteorological data in Chengdu, China from 2015 to 2018. To solve the label imbalance problem, a re-weighting method based on label distribution smoothing was integrated into the meta learner. We trained the model using the data from 2015 to 2017 and evaluated its predictive ability using the data in 2018 based on four metrics, including mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE), and coefficient of determination (R). In addition, the SHapley Additive exPlanations (SHAP) framework was applied to provide explanation for the prediction of our stacking model.

RESULTS

Our proposed model outperformed all the base learners and long short-term memory (LSTM) on two datasets. Particularly, compared with the optimal results obtained by individual models, the MAE, RMSE, and MAPE of the stacking model decreased by 13.9%, 12.7%, and 5.8%, respectively, and the R improved by 6.8% on CD dataset. The model explanation demonstrated that environmental features played a role in further improving the model performance and identified that high temperature and high concentrations of gaseous air pollutants might strongly associate with an increased risk of CD.

CONCLUSIONS

Our stacking model considering environmental exposure is efficient in predicting daily HAs for CD and has practical value in early warning and healthcare resource allocation.

摘要

背景

随着脑血管疾病(CD)的流行和医疗资源压力的增加,预测脑血管病患者的医疗需求对优化医疗资源具有重要意义。

方法

本研究提出了一种堆叠集成模型,该模型由四个基础学习者(岭回归、随机森林、梯度提升决策树和人工神经网络)和一个元学习者(弹性网络)组成,用于根据中国成都 2015 年至 2018 年的历史住院人数(HA)数据、空气质量数据和气象数据预测 CD 的每日 HA 数。为了解决标签不平衡问题,在元学习者中集成了一种基于标签分布平滑的重加权方法。我们使用 2015 年至 2017 年的数据训练模型,并根据 2018 年的数据评估其预测能力,使用的指标包括平均绝对误差(MAE)、均方根误差(RMSE)、平均绝对百分比误差(MAPE)和决定系数(R)。此外,还应用了 SHapley Additive exPlanations(SHAP)框架来为我们的堆叠模型的预测提供解释。

结果

我们提出的模型在两个数据集上的表现均优于所有基础学习者和长短期记忆(LSTM)。特别是,与单个模型的最佳结果相比,堆叠模型的 MAE、RMSE 和 MAPE 分别降低了 13.9%、12.7%和 5.8%,R 提高了 6.8%,在 CD 数据集上。模型解释表明,环境特征在进一步提高模型性能方面发挥了作用,并确定高温和高浓度气态空气污染物可能与 CD 风险增加密切相关。

结论

我们考虑环境暴露的堆叠模型在预测 CD 的每日 HA 方面是有效的,在预警和医疗资源分配方面具有实际价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/5aadad176800/12911_2023_2159_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/acfb7db39a36/12911_2023_2159_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/87d4b73781d5/12911_2023_2159_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/b4c434a662b8/12911_2023_2159_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/6f8f4b6653c4/12911_2023_2159_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/6a1b83ee8589/12911_2023_2159_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/cf936c646e09/12911_2023_2159_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/5aadad176800/12911_2023_2159_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/acfb7db39a36/12911_2023_2159_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/87d4b73781d5/12911_2023_2159_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/b4c434a662b8/12911_2023_2159_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/6f8f4b6653c4/12911_2023_2159_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/6a1b83ee8589/12911_2023_2159_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/cf936c646e09/12911_2023_2159_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ad1/10080841/5aadad176800/12911_2023_2159_Fig7_HTML.jpg

相似文献

1
Explainable prediction of daily hospitalizations for cerebrovascular disease using stacked ensemble learning.使用堆叠集成学习对脑血管病的每日住院人数进行可解释预测。
BMC Med Inform Decis Mak. 2023 Apr 6;23(1):59. doi: 10.1186/s12911-023-02159-7.
2
Predicting the incidence of infectious diarrhea with symptom surveillance data using a stacking-based ensembled model.基于堆叠集成模型的症状监测数据预测感染性腹泻发病率。
BMC Infect Dis. 2024 Feb 26;24(1):265. doi: 10.1186/s12879-024-09138-x.
3
Seasonal prediction of daily PM concentrations with interpretable machine learning: a case study of Beijing, China.基于可解释机器学习的日 PM 浓度季节性预测:以中国北京为例。
Environ Sci Pollut Res Int. 2022 Jun;29(30):45821-45836. doi: 10.1007/s11356-022-18913-9. Epub 2022 Feb 12.
4
Improving the precision of modeling the incidence of hemorrhagic fever with renal syndrome in mainland China with an ensemble machine learning approach.采用集成机器学习方法提高中国大陆肾综合征出血热发病率建模的精度。
PLoS One. 2021 Mar 16;16(3):e0248597. doi: 10.1371/journal.pone.0248597. eCollection 2021.
5
A comparative study of statistical and machine learning models on carbon dioxide emissions prediction of China.中国二氧化碳排放预测的统计和机器学习模型比较研究。
Environ Sci Pollut Res Int. 2023 Nov;30(55):117485-117502. doi: 10.1007/s11356-023-30428-5. Epub 2023 Oct 23.
6
A GA-stacking ensemble approach for forecasting energy consumption in a smart household: A comparative study of ensemble methods.基于 GA 堆叠的智能家居能耗预测集成方法研究:集成方法比较
J Environ Manage. 2024 Jul;364:121264. doi: 10.1016/j.jenvman.2024.121264. Epub 2024 Jun 12.
7
Water quality assessment of a river using deep learning Bi-LSTM methodology: forecasting and validation.基于深度学习 Bi-LSTM 方法的河流水质评估:预测与验证。
Environ Sci Pollut Res Int. 2022 Feb;29(9):12875-12889. doi: 10.1007/s11356-021-13875-w. Epub 2021 May 14.
8
Predicting machine's performance record using the stacked long short-term memory (LSTM) neural networks.利用堆叠长短期记忆(LSTM)神经网络预测机器的性能记录。
J Appl Clin Med Phys. 2022 Mar;23(3):e13558. doi: 10.1002/acm2.13558. Epub 2022 Feb 16.
9
Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure.考虑环境暴露因素的机器学习方法预测心血管病入院高峰日
BMC Med Inform Decis Mak. 2020 May 1;20(1):83. doi: 10.1186/s12911-020-1101-8.
10
Research on prediction of daily admissions of respiratory diseases with comorbid diabetes in Beijing based on long short-term memory recurrent neural network.基于长短时记忆递归神经网络的北京市伴发糖尿病的呼吸系统疾病日住院人数预测研究。
Zhejiang Da Xue Xue Bao Yi Xue Ban. 2022 Feb 25;51(1):1-9. doi: 10.3724/zdxbyxb-2021-0227.

引用本文的文献

1
Predicting hospital outpatient volume using XGBoost: a machine learning approach.使用XGBoost预测医院门诊量:一种机器学习方法。
Sci Rep. 2025 May 16;15(1):17028. doi: 10.1038/s41598-025-01265-y.
2
Estimating the volume of penumbra in rodents using DTI and stack-based ensemble machine learning framework.使用 DTI 和基于堆叠的集成机器学习框架估计啮齿动物的半影体积。
Eur Radiol Exp. 2024 May 15;8(1):59. doi: 10.1186/s41747-024-00455-z.

本文引用的文献

1
Report on Cardiovascular Health and Diseases in China 2021: An Updated Summary.中国心血管健康与疾病报告 2021:概要更新
Biomed Environ Sci. 2022 Jul 20;35(7):573-603. doi: 10.3967/bes2022.079.
2
Association of ambient air pollution with risk of hemorrhagic stroke: A time-stratified case crossover analysis of the Singapore stroke registry.环境空气污染与出血性中风风险的关联:基于新加坡中风登记处的时间分层病例交叉分析
Int J Hyg Environ Health. 2022 Mar;240:113908. doi: 10.1016/j.ijheh.2021.113908. Epub 2021 Dec 30.
3
Interpretability Analysis of One-Year Mortality Prediction for Stroke Patients Based on Deep Neural Network.
基于深度神经网络的中风患者一年死亡率预测的可解释性分析
IEEE J Biomed Health Inform. 2022 Apr;26(4):1903-1910. doi: 10.1109/JBHI.2021.3123657. Epub 2022 Apr 14.
4
Immediate and delayed effects of climatic factors on hospital admissions for schizophrenia in Queensland Australia: A time series analysis.澳大利亚昆士兰州气候因素对精神分裂症住院的即时和延迟影响:时间序列分析。
Environ Res. 2021 Jun;197:111003. doi: 10.1016/j.envres.2021.111003. Epub 2021 Mar 11.
5
Prediction of Daily Blood Sampling Room Visits Based on ARIMA and SES Model.基于 ARIMA 和 SES 模型的每日血样采集室访问量预测。
Comput Math Methods Med. 2020 Sep 3;2020:1720134. doi: 10.1155/2020/1720134. eCollection 2020.
6
Deep learning for predicting the occurrence of cardiopulmonary diseases in Nanjing, China.深度学习预测中国南京心肺疾病的发生。
Chemosphere. 2020 Oct;257:127176. doi: 10.1016/j.chemosphere.2020.127176. Epub 2020 May 27.
7
High ambient temperature in summer and risk of stroke or transient ischemic attack: A national study in Israel.夏季高环境温度与中风或短暂性脑缺血发作风险:以色列的一项全国性研究。
Environ Res. 2020 Aug;187:109678. doi: 10.1016/j.envres.2020.109678. Epub 2020 May 20.
8
Air pollution and hospitalization: an autoregressive distributed lag (ARDL) approach.空气污染与住院治疗:自回归分布滞后(ARDL)方法。
Environ Sci Pollut Res Int. 2020 Aug;27(24):30673-30680. doi: 10.1007/s11356-020-09152-x. Epub 2020 May 29.
9
Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure.考虑环境暴露因素的机器学习方法预测心血管病入院高峰日
BMC Med Inform Decis Mak. 2020 May 1;20(1):83. doi: 10.1186/s12911-020-1101-8.
10
Short-term effects of extreme temperatures on cause specific cardiovascular admissions in Beijing, China.中国北京极端温度对特定病因心血管疾病入院的短期影响。
Environ Res. 2020 Jul;186:109455. doi: 10.1016/j.envres.2020.109455. Epub 2020 Apr 3.