• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于埃塞俄比亚6至23个月儿童肺炎预测及决定因素分层的数据驱动型机器学习算法模型

Data-driven machine learning algorithm model for pneumonia prediction and determinant factor stratification among children aged 6-23 months in Ethiopia.

作者信息

Demsash Addisalem Workie, Abebe Rediet, Gezimu Wubishet, Kitil Gemeda Wakgari, Tizazu Michael Amera, Lembebo Abera, Bekele Firomsa, Alemu Solomon Seyife, Jarso Mohammedamin Hajure, Dube Geleta, Wedajo Lema Fikadu, Purohit Sanju, Kalayou Mulugeta Hayelom

机构信息

Debre Berhan University, Asrat Woldeyes Health Science Campus, Public Health Department, Debre Berihan, Ethiopia.

Mattu University, Health Science College, Mettu, Ethiopia.

出版信息

BMC Infect Dis. 2025 May 2;25(1):647. doi: 10.1186/s12879-025-10916-4.

DOI:10.1186/s12879-025-10916-4
PMID:40316929
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12048943/
Abstract

INTRODUCTION

Pneumonia is the leading cause of child morbidity and mortality and accounts for 5.6 million under-five child deaths. Pneumonia has a significant impact on the quality of life, the country's economy, and the survival of children. Therefore, this study aimed to develop data-driven predictive model using machine learning algorithms to predict pneumonia and stratify the determinant factors among children aged 6-23 months in Ethiopia.

METHODS

A total of 2035 samples of children were used from the 2016 Ethiopian Demographic and Health Survey dataset. Jupyter Notebook from Anaconda Navigators was used for data management and analysis. Important libraries such as Pandas, Seaborn, and Numpy were imported from Python. The data was pre-processed into a training and testing dataset with a 4:1 ratio, and tenfold cross-validation was used to reduce bias and enhance the models' performance. Six machine learning algorithms were used for model building and comparison, and confusion matrix elements were used to evaluate the performance of each algorithm. Principal component analysis and heatmap function were used for correlation detection between features. Feature importance score was used to identify and stratify the most important predictors of pneumonia.

RESULTS

From 2035 total samples, 16.6%, 20.1%, and 24.2% of children had short rapid breath, fever, and cough respectively. The overall magnitude of pneumonia among children aged 6-23 months was 31.3% based on the 2016 EDHS report. A random forest algorithm is the relatively best performance model to predict pneumonia and stratify its determinates with 91.3% accuracy. The health facility visits, child sex, initiation of breastfeeding, birth interval, birth weight, husbands' education, women's age, and region, are the top eight important predictors of pneumonia among children with important scores of more than 5% to 20% respectively.

CONCLUSIONS

Random forest is the best model to predict pneumonia and stratify its determinant factors. The implications of this study are profound for advanced research methodology, tailored to promote effective health interventions such as lifestyle modification and behavioral intervention, based on individuals' unique features, specifically for stakeholders to take proactive childcare interventions. The study would serve as pioneering evidence for future research, and researchers are recommended to use deep learning algorithms to enhance prediction accuracy.

摘要

引言

肺炎是儿童发病和死亡的主要原因,占560万五岁以下儿童死亡病例。肺炎对生活质量、国家经济和儿童生存有着重大影响。因此,本研究旨在使用机器学习算法开发数据驱动的预测模型,以预测埃塞俄比亚6至23个月大儿童的肺炎情况并对决定因素进行分层。

方法

从2016年埃塞俄比亚人口与健康调查数据集选取了总共2035个儿童样本。使用Anaconda Navigators中的Jupyter Notebook进行数据管理和分析。从Python导入了诸如Pandas、Seaborn和Numpy等重要库。数据以4:1的比例预处理为训练和测试数据集,并使用十折交叉验证来减少偏差并提高模型性能。使用六种机器学习算法进行模型构建和比较,并使用混淆矩阵元素来评估每种算法的性能。主成分分析和热图函数用于检测特征之间的相关性。特征重要性得分用于识别和分层肺炎最重要的预测因素。

结果

在总共2035个样本中,分别有16.6%、20.1%和24.2%的儿童出现呼吸急促、发热和咳嗽症状。根据2016年埃塞俄比亚人口与健康调查(EDHS)报告,6至23个月大儿童的肺炎总体发病率为31.3%。随机森林算法是预测肺炎及其决定因素的相对最佳性能模型,准确率达91.3%。医疗机构就诊、儿童性别、母乳喂养开始情况、生育间隔、出生体重、丈夫教育程度、妇女年龄和地区,是肺炎最重要的八个预测因素,重要性得分分别超过5%至20%。

结论

随机森林是预测肺炎及其决定因素的最佳模型。本研究对于先进研究方法具有深远意义,这些方法旨在根据个体独特特征促进有效的健康干预措施,如生活方式改变和行为干预,特别是为利益相关者采取积极的儿童保育干预措施提供依据。该研究将为未来研究提供开创性证据,建议研究人员使用深度学习算法提高预测准确性。

相似文献

1
Data-driven machine learning algorithm model for pneumonia prediction and determinant factor stratification among children aged 6-23 months in Ethiopia.用于埃塞俄比亚6至23个月儿童肺炎预测及决定因素分层的数据驱动型机器学习算法模型
BMC Infect Dis. 2025 May 2;25(1):647. doi: 10.1186/s12879-025-10916-4.
2
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
3
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
4
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
5
Incentives for preventing smoking in children and adolescents.预防儿童和青少年吸烟的激励措施。
Cochrane Database Syst Rev. 2017 Jun 6;6(6):CD008645. doi: 10.1002/14651858.CD008645.pub3.
6
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
7
Education support services for improving school engagement and academic performance of children and adolescents with a chronic health condition.改善患有慢性病的儿童和青少年的学校参与度和学业成绩的教育支持服务。
Cochrane Database Syst Rev. 2023 Feb 8;2(2):CD011538. doi: 10.1002/14651858.CD011538.pub2.
8
Development of machine learning model for predicting prolonged operation time in lumbar stenosis undergoing posterior lumbar interbody fusion: a multicenter study.用于预测接受后路腰椎椎间融合术的腰椎管狭窄症患者手术时间延长的机器学习模型的开发:一项多中心研究。
Spine J. 2025 Mar;25(3):460-473. doi: 10.1016/j.spinee.2024.10.001. Epub 2024 Oct 19.
9
Prediction of Insulin Resistance in Nondiabetic Population Using LightGBM and Cohort Validation of Its Clinical Value: Cross-Sectional and Retrospective Cohort Study.使用LightGBM预测非糖尿病人群的胰岛素抵抗及其临床价值的队列验证:横断面和回顾性队列研究
JMIR Med Inform. 2025 Jun 13;13:e72238. doi: 10.2196/72238.
10
Systematic review and validation of prediction rules for identifying children with serious infections in emergency departments and urgent-access primary care.系统评价和验证预测规则,以识别急诊科和紧急初级保健中严重感染的儿童。
Health Technol Assess. 2012;16(15):1-100. doi: 10.3310/hta16150.

引用本文的文献

1
Correction: Data-driven machine learning algorithm model for pneumonia prediction and determinant factor stratification among children aged 6-23 months in Ethiopia.更正:埃塞俄比亚6至23个月儿童肺炎预测及决定因素分层的数据驱动机器学习算法模型
BMC Infect Dis. 2025 Jun 26;25(1):808. doi: 10.1186/s12879-025-11137-5.

本文引用的文献

1
Factors associated with severe pneumonia among children <5 years, Kasese District, Uganda: a case-control study, January-April 2023.乌干达卡塞塞区5岁以下儿童重症肺炎相关因素:一项病例对照研究,2023年1月至4月
Pneumonia (Nathan). 2024 Jul 25;16(1):13. doi: 10.1186/s41479-024-00134-y.
2
Factors of acute respiratory infection among under-five children across sub-Saharan African countries using machine learning approaches.机器学习方法在撒哈拉以南非洲国家五岁以下儿童急性呼吸道感染因素分析中的应用
Sci Rep. 2024 Jul 9;14(1):15801. doi: 10.1038/s41598-024-65620-1.
3
Mothers' experience of losing infants by death and its predictors in Ethiopia.
母亲在埃塞俄比亚失去婴儿死亡的经历及其预测因素。
PLoS One. 2024 Jun 28;19(6):e0303358. doi: 10.1371/journal.pone.0303358. eCollection 2024.
4
Mental health analysis of international students using machine learning techniques.留学生心理健康的机器学习分析。
PLoS One. 2024 Jun 6;19(6):e0304132. doi: 10.1371/journal.pone.0304132. eCollection 2024.
5
Nutritional supplementation in children with severe pneumonia in Uganda and Kenya (COAST-Nutrition): a phase 2 randomised controlled trial.乌干达和肯尼亚重症肺炎患儿的营养补充(COAST-营养研究):一项2期随机对照试验。
EClinicalMedicine. 2024 May 14;72:102640. doi: 10.1016/j.eclinm.2024.102640. eCollection 2024 Jun.
6
Birth preparedness and pregnancy complication readiness and associated factors among pregnant women in Ethiopia: A multilevel analysis.埃塞俄比亚孕妇的分娩准备和妊娠并发症应对能力及其相关因素:一项多层次分析。
PLOS Glob Public Health. 2024 May 15;4(5):e0003127. doi: 10.1371/journal.pgph.0003127. eCollection 2024.
7
Health System Considerations for Community-Based Implementation of Automated Respiratory Counters to Identify Childhood Pneumonia in 5 Regions of Ethiopia: A Qualitative Study.埃塞俄比亚5个地区基于社区实施自动呼吸计数器以识别儿童肺炎的卫生系统考量:一项定性研究
Int J Health Policy Manag. 2023;12:7385. doi: 10.34172/ijhpm.2023.7385. Epub 2023 Oct 18.
8
Empowering child health: Harnessing machine learning to predict acute respiratory infections in Ethiopian under-fives using demographic and health survey insights.赋能儿童健康:利用机器学习,根据埃塞俄比亚五岁以下儿童人口与健康调查数据预测急性呼吸道感染。
BMC Infect Dis. 2024 Mar 21;24(1):338. doi: 10.1186/s12879-024-09195-2.
9
Spatial variation and determinants of delayed breastfeeding initiation in Ethiopia: spatial and multilevel analysis of recent evidence from EDHS 2019.埃塞俄比亚延迟母乳喂养开始的空间变化及其决定因素:来自 EDHS 2019 的最新证据的空间和多层次分析。
Int Breastfeed J. 2024 Feb 7;19(1):10. doi: 10.1186/s13006-024-00616-1.
10
The effects of reproductive variables on child mortality in Ethiopia: evidence from demographic and health surveys from 2000 to 2016.生殖变量对埃塞俄比亚儿童死亡率的影响:来自 2000 年至 2016 年人口与健康调查的证据。
Reprod Health. 2024 Jan 10;21(1):4. doi: 10.1186/s12978-023-01734-5.