Suppr超能文献

使用随机森林模型预测内罗毕县因PM2.5空气污染导致的呼吸系统疾病

Predicting Respiratory Diseases Attributed to PM2.5 Air Pollution in Nairobi County Using Random Forest Model.

作者信息

Okeyo Valine Atieno, Orowe Idah, Oguge Nicholas Otienoh

机构信息

University of Nairobi, Department of Mathematics, Kenya.

University of Nairobi, Center for Advanced Studies in Environmental Law and Policy, Kenya.

出版信息

Int J Innov Sci Res Technol. 2024 Jul;9(7):3489-3492. doi: 10.38124/ijisrt/ijisrt24jul1521.

Abstract

This study investigates the predictive capability of a Random Forest model in identifying respiratory diseases attributed to PM2.5 exposure in Nairobi County. Leveraging a comprehensive dataset encompassing demographic and air quality variables, the model demonstrated robust performance metrics, achieving an accuracy of 79.97% and an area under the curve (AUC) of 0.872. These results highlight the model's effectiveness in distinguishing between respiratory and cardiovascular conditions. The model's sensitivity and specificity were 81.88% and 73.27%, respectively, indicating a strong ability to correctly identify both true positives and true negatives. Analysis of feature importance revealed that age and PM2.5 concentrations were the most influential factors in predicting health outcomes, emphasizing the significant impact of air pollution and demographic factors on respiratory and cardiovascular health. Furthermore, the consistent train and test error rates across varying training set sizes suggest the model's stability and generalizability. This study underscores the importance of addressing air quality issues to mitigate the health impacts of PM2.5 exposure in urban settings.

摘要

本研究调查了随机森林模型在识别内罗毕县因接触细颗粒物(PM2.5)而导致的呼吸道疾病方面的预测能力。该模型利用包含人口统计和空气质量变量的综合数据集,展现出稳健的性能指标,准确率达到79.97%,曲线下面积(AUC)为0.872。这些结果凸显了该模型在区分呼吸道疾病和心血管疾病方面的有效性。该模型的灵敏度和特异度分别为81.88%和73.27%,表明其在正确识别真阳性和真阴性方面能力较强。特征重要性分析显示,年龄和PM2.5浓度是预测健康结果的最具影响力因素,强调了空气污染和人口因素对呼吸道和心血管健康的重大影响。此外,不同训练集规模下一致的训练误差率和测试误差率表明该模型具有稳定性和通用性。本研究强调了解决空气质量问题以减轻城市环境中PM2.5暴露对健康影响的重要性。

相似文献

1
Predicting Respiratory Diseases Attributed to PM2.5 Air Pollution in Nairobi County Using Random Forest Model.
Int J Innov Sci Res Technol. 2024 Jul;9(7):3489-3492. doi: 10.38124/ijisrt/ijisrt24jul1521.
3
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
4
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

本文引用的文献

1
Application of machine learning approaches to predict the impact of ambient air pollution on outpatient visits for acute respiratory infections.
Sci Total Environ. 2023 Feb 1;858(Pt 1):159509. doi: 10.1016/j.scitotenv.2022.159509. Epub 2022 Oct 17.
2
Identification of pediatric respiratory diseases using a fine-grained diagnosis system.
J Biomed Inform. 2021 May;117:103754. doi: 10.1016/j.jbi.2021.103754. Epub 2021 Apr 6.
3
When Are Sexist Attitudes Risk Factors for Dating Aggression? The Role of Moral Disengagement in Spanish Adolescents.
Int J Environ Res Public Health. 2021 Feb 17;18(4):1947. doi: 10.3390/ijerph18041947.
4
Air Quality in Africa: Public Health Implications.
Annu Rev Public Health. 2021 Apr 1;42:193-210. doi: 10.1146/annurev-publhealth-100119-113802. Epub 2021 Dec 21.
5
A study of deep learning methods for de-identification of clinical notes in cross-institute settings.
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):232. doi: 10.1186/s12911-019-0935-4.
6
Using Resistin, glucose, age and BMI to predict the presence of breast cancer.
BMC Cancer. 2018 Jan 4;18(1):29. doi: 10.1186/s12885-017-3877-1.
7
Urban air pollution in Sub-Saharan Africa: Time for action.
Environ Pollut. 2017 Jan;220(Pt A):738-743. doi: 10.1016/j.envpol.2016.09.042. Epub 2016 Sep 16.
8
Machine Learning in Medicine.
Circulation. 2015 Nov 17;132(20):1920-30. doi: 10.1161/CIRCULATIONAHA.115.001593.
9
Big data analytics in healthcare: promise and potential.
Health Inf Sci Syst. 2014 Feb 7;2:3. doi: 10.1186/2047-2501-2-3. eCollection 2014.
10
Indoor air pollution and the lung in low- and medium-income countries.
Eur Respir J. 2012 Jul;40(1):239-54. doi: 10.1183/09031936.00190211. Epub 2012 Feb 23.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验