• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

随机森林和特征重要性度量在预测心血管和呼吸疾病中用于区分最有影响的环境因素。

Random Forest and Feature Importance Measures for Discriminating the Most Influential Environmental Factors in Predicting Cardiovascular and Respiratory Diseases.

机构信息

DIBAF Department, University of Tuscia, 01100 Viterbo, Italy.

School of Engineering, University of Basilicata, Viale dell'Ateneo Lucano 10, 85100 Potenza, Italy.

出版信息

Int J Environ Res Public Health. 2024 Jul 2;21(7):867. doi: 10.3390/ijerph21070867.

DOI:10.3390/ijerph21070867
PMID:39063444
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11276884/
Abstract

BACKGROUND

Several studies suggest that environmental and climatic factors are linked to the risk of mortality due to cardiovascular and respiratory diseases; however, it is still unclear which are the most influential ones. This study sheds light on the potentiality of a data-driven statistical approach by providing a case study analysis.

METHODS

Daily admissions to the emergency room for cardiovascular and respiratory diseases are jointly analyzed with daily environmental and climatic parameter values (temperature, atmospheric pressure, relative humidity, carbon monoxide, ozone, particulate matter, and nitrogen dioxide). The Random Forest (RF) model and feature importance measure (FMI) techniques (permutation feature importance (PFI), Shapley Additive exPlanations (SHAP) feature importance, and the derivative-based importance measure (κALE)) are applied for discriminating the role of each environmental and climatic parameter. Data are pre-processed to remove trend and seasonal behavior using the Seasonal Trend Decomposition (STL) method and preliminary analyzed to avoid redundancy of information.

RESULTS

The RF performance is encouraging, being able to predict cardiovascular and respiratory disease admissions with a mean absolute relative error of 0.04 and 0.05 cases per day, respectively. Feature importance measures discriminate parameter behaviors providing importance rankings. Indeed, only three parameters (temperature, atmospheric pressure, and carbon monoxide) were responsible for most of the total prediction accuracy.

CONCLUSIONS

Data-driven and statistical tools, like the feature importance measure, are promising for discriminating the role of environmental and climatic factors in predicting the risk related to cardiovascular and respiratory diseases. Our results reveal the potential of employing these tools in public health policy applications for the development of early warning systems that address health risks associated with climate change, and improving disease prevention strategies.

摘要

背景

多项研究表明,环境和气候因素与心血管和呼吸系统疾病导致的死亡率有关;然而,哪些因素影响最大仍不清楚。本研究通过案例分析,为数据驱动的统计方法提供了一些启示。

方法

对心血管和呼吸系统疾病的急诊每日入院人数与每日环境和气候参数值(温度、大气压、相对湿度、一氧化碳、臭氧、颗粒物和二氧化氮)进行联合分析。随机森林(RF)模型和特征重要性度量(FMI)技术(排列特征重要性(PFI)、Shapley 可加解释(SHAP)特征重要性和基于导数的重要性度量(κALE))用于区分每个环境和气候参数的作用。使用季节性趋势分解(STL)方法对数据进行预处理以去除趋势和季节性行为,并进行初步分析以避免信息冗余。

结果

RF 的性能令人鼓舞,能够分别以 0.04 和 0.05 例/天的平均绝对相对误差预测心血管和呼吸系统疾病的入院人数。特征重要性度量可区分参数行为,提供重要性排名。事实上,只有三个参数(温度、大气压和一氧化碳)对总预测精度的贡献最大。

结论

数据驱动和统计工具,如特征重要性度量,在区分环境和气候因素在预测与心血管和呼吸系统疾病相关的风险方面具有很大的潜力。我们的结果表明,这些工具可用于公共卫生政策应用,以开发应对气候变化相关健康风险的早期预警系统,并改进疾病预防策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/3d6beb9dc358/ijerph-21-00867-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/18aaf57932a8/ijerph-21-00867-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/5442a44bf652/ijerph-21-00867-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/5cb55ead188e/ijerph-21-00867-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/ee27664a2b3e/ijerph-21-00867-g004a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/2ed467425ab3/ijerph-21-00867-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/bbc64f460dd4/ijerph-21-00867-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/3d6beb9dc358/ijerph-21-00867-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/18aaf57932a8/ijerph-21-00867-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/5442a44bf652/ijerph-21-00867-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/5cb55ead188e/ijerph-21-00867-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/ee27664a2b3e/ijerph-21-00867-g004a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/2ed467425ab3/ijerph-21-00867-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/bbc64f460dd4/ijerph-21-00867-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea4d/11276884/3d6beb9dc358/ijerph-21-00867-g007.jpg

相似文献

1
Random Forest and Feature Importance Measures for Discriminating the Most Influential Environmental Factors in Predicting Cardiovascular and Respiratory Diseases.随机森林和特征重要性度量在预测心血管和呼吸疾病中用于区分最有影响的环境因素。
Int J Environ Res Public Health. 2024 Jul 2;21(7):867. doi: 10.3390/ijerph21070867.
2
Analyzing the Interactions between Environmental Parameters and Cardiovascular Diseases Using Random Forest and SHAP Algorithms.使用随机森林和SHAP算法分析环境参数与心血管疾病之间的相互作用
Rev Cardiovasc Med. 2023 Nov 24;24(11):330. doi: 10.31083/j.rcm2411330. eCollection 2023 Nov.
3
Machine Learning Models for Predicting the Occurrence of Respiratory Diseases Using Climatic and Air-Pollution Factors.利用气候和空气污染因素预测呼吸系统疾病发生情况的机器学习模型
Clin Exp Otorhinolaryngol. 2022 May;15(2):168-176. doi: 10.21053/ceo.2021.01536. Epub 2022 Jan 7.
4
Extreme ambient temperatures and cardiorespiratory emergency room visits: assessing risk by comorbid health conditions in a time series study.极端环境温度与急诊室心肺疾病就诊情况:在一项时间序列研究中通过共病健康状况评估风险
Environ Health. 2014 Feb 3;13(1):5. doi: 10.1186/1476-069X-13-5.
5
Multifactorial airborne exposures and respiratory hospital admissions--the example of Santiago de Chile.多因素空气传播暴露与呼吸科住院治疗——以圣地亚哥智利为例。
Sci Total Environ. 2015 Jan 1;502:114-21. doi: 10.1016/j.scitotenv.2014.08.093. Epub 2014 Sep 20.
6
Short-term associations between emergency hospital admissions for respiratory and cardiovascular disease and outdoor air pollution in London.伦敦呼吸系统和心血管疾病急诊入院与室外空气污染之间的短期关联。
Arch Environ Health. 1999 Nov-Dec;54(6):398-411. doi: 10.1080/00039899909603371.
7
Multicity study of air pollution and mortality in Latin America (the ESCALA study).拉丁美洲空气污染与死亡率的多城市研究(ESCALA研究)。
Res Rep Health Eff Inst. 2012 Oct(171):5-86.
8
Effects of long-term exposure to traffic-related air pollution on respiratory and cardiovascular mortality in the Netherlands: the NLCS-AIR study.长期暴露于交通相关空气污染对荷兰呼吸道和心血管疾病死亡率的影响:荷兰长期队列空气污染研究(NLCS-AIR研究)
Res Rep Health Eff Inst. 2009 Mar(139):5-71; discussion 73-89.
9
Part 4. Interaction between air pollution and respiratory viruses: time-series study of daily mortality and hospital admissions in Hong Kong.第4部分. 空气污染与呼吸道病毒之间的相互作用:香港每日死亡率和住院人数的时间序列研究。
Res Rep Health Eff Inst. 2010 Nov(154):283-362.
10
Seasonal prediction of daily PM concentrations with interpretable machine learning: a case study of Beijing, China.基于可解释机器学习的日 PM 浓度季节性预测:以中国北京为例。
Environ Sci Pollut Res Int. 2022 Jun;29(30):45821-45836. doi: 10.1007/s11356-022-18913-9. Epub 2022 Feb 12.

引用本文的文献

1
Machine Learning-Based Random Forest to Predict 3-Year Survival after Endovascular Aneurysm Repair.基于机器学习的随机森林用于预测血管内动脉瘤修复术后3年生存率
Ann Thorac Cardiovasc Surg. 2025;31(1). doi: 10.5761/atcs.oa.25-00036.
2
Machine learning optimization of environmental factors influencing biomass and nutritional composition in local algal species.影响本地藻类物种生物量和营养成分的环境因素的机器学习优化
R Soc Open Sci. 2025 Apr 30;12(4):241336. doi: 10.1098/rsos.241336. eCollection 2025 Apr.
3
Prediction of respiratory diseases based on random forest model.

本文引用的文献

1
Effects of Meteo-Climatic Factors on Hospital Admissions for Cardiovascular Diseases in the City of Bari, Southern Italy.气象气候因素对意大利南部巴里市心血管疾病住院率的影响
Healthcare (Basel). 2023 Feb 26;11(5):690. doi: 10.3390/healthcare11050690.
2
Application of explainable artificial intelligence for healthcare: A systematic review of the last decade (2011-2022).可解释人工智能在医疗保健中的应用:过去十年(2011-2022 年)的系统回顾。
Comput Methods Programs Biomed. 2022 Nov;226:107161. doi: 10.1016/j.cmpb.2022.107161. Epub 2022 Sep 27.
3
Prediction and Diagnosis of Respiratory Disease by Combining Convolutional Neural Network and Bi-directional Long Short-Term Memory Methods.
基于随机森林模型的呼吸系统疾病预测
Front Public Health. 2025 Feb 14;13:1537238. doi: 10.3389/fpubh.2025.1537238. eCollection 2025.
4
Environmental Pollutants as Emerging Concerns for Cardiac Diseases: A Review on Their Impacts on Cardiac Health.环境污染物作为心血管疾病新出现的关注点:关于其对心脏健康影响的综述
Biomedicines. 2025 Jan 20;13(1):241. doi: 10.3390/biomedicines13010241.
卷积神经网络和双向长短时记忆方法相结合对呼吸疾病的预测和诊断。
Front Public Health. 2022 May 4;10:881234. doi: 10.3389/fpubh.2022.881234. eCollection 2022.
4
Social Determinants in Machine Learning Cardiovascular Disease Prediction Models: A Systematic Review.机器学习在心血管疾病预测模型中的社会决定因素:系统评价。
Am J Prev Med. 2021 Oct;61(4):596-605. doi: 10.1016/j.amepre.2021.04.016. Epub 2021 Jul 27.
5
Less is more: Detecting clinical deterioration in the hospital with machine learning using only age, heart rate, and respiratory rate.少即是多:仅使用年龄、心率和呼吸频率通过机器学习检测医院中的临床病情恶化情况。
Resuscitation. 2021 Nov;168:6-10. doi: 10.1016/j.resuscitation.2021.08.024. Epub 2021 Aug 23.
6
Pre-existing and machine learning-based models for cardiovascular risk prediction.基于既有数据和机器学习的心血管风险预测模型。
Sci Rep. 2021 Apr 26;11(1):8886. doi: 10.1038/s41598-021-88257-w.
7
Machine learning prediction in cardiovascular diseases: a meta-analysis.机器学习在心血管疾病中的预测:一项荟萃分析。
Sci Rep. 2020 Sep 29;10(1):16057. doi: 10.1038/s41598-020-72685-1.
8
Peak Outpatient and Emergency Department Visit Forecasting for Patients With Chronic Respiratory Diseases Using Machine Learning Methods: Retrospective Cohort Study.使用机器学习方法对慢性呼吸道疾病患者的门诊和急诊科就诊高峰进行预测:回顾性队列研究。
JMIR Med Inform. 2020 Mar 30;8(3):e13075. doi: 10.2196/13075.
9
The 2019 report of The Lancet Countdown on health and climate change: ensuring that the health of a child born today is not defined by a changing climate.柳叶刀倒计时报告 2019:健康与气候变化——确保今天出生的孩子不因气候变化而失去健康。
Lancet. 2019 Nov 16;394(10211):1836-1878. doi: 10.1016/S0140-6736(19)32596-6.
10
Association between rapid weather changes and incidence of chiefly cardiovascular complaints in the emergency department.快速天气变化与急诊科主要心血管投诉发生率之间的关联。
Am J Emerg Med. 2020 Aug;38(8):1604-1610. doi: 10.1016/j.ajem.2019.158440. Epub 2019 Sep 11.