• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在孟加拉国,利用机器学习技术探究与五岁以下儿童死亡相关的因素,并评估儿童死亡风险。

Explore the factors related to the death of offspring under age five and appraise the hazard of child mortality using machine learning techniques in Bangladesh.

作者信息

Rahman Ashikur, Rahman Md Habibur

机构信息

Department of Statistics and Data Science, Jahangirnagar University, Dhaka, 1342, Bangladesh.

出版信息

BMC Public Health. 2025 Jan 29;25(1):360. doi: 10.1186/s12889-025-21460-w.

DOI:10.1186/s12889-025-21460-w
PMID:39881228
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11776272/
Abstract

BACKGROUND

Child mortality is a reliable and significant indicator of a nation's health. Although the child mortality rate in Bangladesh is declining over time, it still needs to drop even more in order to meet the Sustainable Development Goals (SDGs). Machine Learning models are one of the best tools for making more accurate and efficient forecasts and gaining in-depth knowledge. A deeper understanding is crucial for significantly reducing child mortality rates. Accurate predictions using machine learning models can empower authorities to implement timely interventions and raise awareness. So, the study aimed to explore the factors related to child mortality and assess the efficacy of various machine-learning models in predicting child mortality in Bangladesh.

METHODS AND MATERIALS

About Forty-two thousand observations, except the missing observations, were extracted for this study from the Bangladesh Demographic and Health Survey (BDHS) data conducted in 2017-18. The survey utilized a two-stage stratified sampling method, selecting 675 enumeration areas-250 in urban settings and 425 in rural areas-resulting in effective data collection from 672 clusters and 20160 households. The Chi-square test and recursive feature elimination (RFE) are used to find the relevant risk factors of child mortality among the number of factors. Six ML-based algorithms were implemented for predicting child mortality, such as Naïve Bayes, Classification and Regression Trees, Random Forest, C5.0 Classification, Gradient Boosting Machine, and Logistic Regression. Model evaluation metrics like accuracy, specificity, sensitivity, negative predictive value, score, positive predictive value, k-fold cross-validation, and area under the curve (AUC) techniques were used to evaluate the performance of the models.

RESULTS AND DISCUSSION

The child mortality rate is 8.2%, according to the data. The bivariate analysis showed that the child mortality rate was higher among the children whose mothers were uneducated, impoverished, underweight, aged 35-49, and gave birth before age 20. Families' water sources and religious connections had no statistically significant impact on child mortality. The prediction of child mortality using machine learning models is the main objective of this study. None of the machine learning models correctly classified dead occurrences. Therefore, this study conducted over-sampling and under-sampling analysis. Approximately 76727 and 6910 observations were sampled for over-sampling and under-sampling techniques, respectively. According to the findings of the over-sampling data, the Random Forest outperformed all the other models in terms of total performance based on training and testing sets, with an accuracy of seventy percent. The k-fold cross-validation approach demonstrated the Random Forest model's superior performance, and achieved the highest AUC (0.701). On the other hand, the Gradient Boosting Machine has the highest assessment for predicting child mortality in under-sampling analysis. The k-fold cross-validation also illustrated the better performance of the Gradient Boosting Machine.

CONCLUSION

The Gradient Boosting Machine and Random Forest produce the best predictive power for classifying child mortality and may help to ameliorate policy decision-making in this regard.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d08/11776272/4f2aa3006a93/12889_2025_21460_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d08/11776272/862b9870bd05/12889_2025_21460_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d08/11776272/d76b893839c2/12889_2025_21460_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d08/11776272/4f2aa3006a93/12889_2025_21460_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d08/11776272/862b9870bd05/12889_2025_21460_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d08/11776272/d76b893839c2/12889_2025_21460_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d08/11776272/4f2aa3006a93/12889_2025_21460_Fig3_HTML.jpg
摘要

背景

儿童死亡率是一个国家健康状况的可靠且重要指标。尽管孟加拉国的儿童死亡率随时间推移在下降,但为实现可持续发展目标(SDGs),仍需进一步降低。机器学习模型是做出更准确高效预测并获取深入知识的最佳工具之一。深入理解对于大幅降低儿童死亡率至关重要。使用机器学习模型进行准确预测可使当局能够及时采取干预措施并提高认识。因此,本研究旨在探索与儿童死亡率相关的因素,并评估各种机器学习模型在预测孟加拉国儿童死亡率方面的效果。

方法和材料

本研究从2017 - 18年进行的孟加拉国人口与健康调查(BDHS)数据中提取了约4.2万个观测值(不包括缺失观测值)。该调查采用两阶段分层抽样方法,选择了675个枚举区——250个城市地区和425个农村地区——从而从672个集群和20160户家庭有效收集了数据。卡方检验和递归特征消除(RFE)用于在众多因素中找出儿童死亡率的相关风险因素。实施了六种基于机器学习的算法来预测儿童死亡率,如朴素贝叶斯、分类与回归树、随机森林、C5.0分类、梯度提升机和逻辑回归。使用诸如准确率、特异性、敏感性、阴性预测值、得分、阳性预测值、k折交叉验证和曲线下面积(AUC)技术等模型评估指标来评估模型的性能。

结果与讨论

根据数据,儿童死亡率为8.2%。双变量分析表明,母亲未受过教育、贫困、体重过轻、年龄在35 - 49岁以及在20岁之前生育的儿童中,儿童死亡率较高。家庭的水源和宗教联系对儿童死亡率没有统计学上的显著影响。使用机器学习模型预测儿童死亡率是本研究的主要目标。没有一个机器学习模型能正确分类死亡情况。因此,本研究进行了过采样和欠采样分析。分别对过采样和欠采样技术抽取了约76727个和6910个观测值。根据过采样数据的结果,基于训练集和测试集的总体性能,随机森林在所有其他模型中表现最佳,准确率为70%。k折交叉验证方法证明了随机森林模型的卓越性能,并实现了最高的AUC(0.701)。另一方面,在欠采样分析中,梯度提升机在预测儿童死亡率方面的评估最高。k折交叉验证也说明了梯度提升机的更好性能。

结论

梯度提升机和随机森林在分类儿童死亡率方面具有最佳预测能力,可能有助于改善这方面的政策决策。

相似文献

1
Explore the factors related to the death of offspring under age five and appraise the hazard of child mortality using machine learning techniques in Bangladesh.在孟加拉国,利用机器学习技术探究与五岁以下儿童死亡相关的因素,并评估儿童死亡风险。
BMC Public Health. 2025 Jan 29;25(1):360. doi: 10.1186/s12889-025-21460-w.
2
Identifying determinants of malnutrition in under-five children in Bangladesh: insights from the BDHS-2022 cross-sectional study.确定孟加拉国五岁以下儿童营养不良的决定因素:来自2022年孟加拉国人口与健康调查横断面研究的见解
Sci Rep. 2025 Apr 24;15(1):14336. doi: 10.1038/s41598-025-99288-y.
3
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
4
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
5
Assessing risk factors for malnutrition among women in Bangladesh and forecasting malnutrition using machine learning approaches.评估孟加拉国女性营养不良的风险因素,并使用机器学习方法预测营养不良情况。
BMC Nutr. 2024 Feb 1;10(1):22. doi: 10.1186/s40795-023-00808-8.
6
Analyzing and forecasting under-5 mortality trends in Bangladesh using machine learning techniques.使用机器学习技术分析和预测孟加拉国5岁以下儿童死亡率趋势。
PLoS One. 2025 Feb 7;20(2):e0317715. doi: 10.1371/journal.pone.0317715. eCollection 2025.
7
Machine learning algorithms to predict healthcare-seeking behaviors of mothers for acute respiratory infections and their determinants among children under five in sub-Saharan Africa.机器学习算法预测撒哈拉以南非洲五岁以下儿童急性呼吸道感染母亲的就医行为及其决定因素。
Front Public Health. 2024 Jun 19;12:1362392. doi: 10.3389/fpubh.2024.1362392. eCollection 2024.
8
Development and validation of inpatient mortality prediction models for patients with hyperglycemic crisis using machine learning approaches.使用机器学习方法开发和验证高血糖危象患者的住院死亡率预测模型
BMC Endocr Disord. 2025 Mar 27;25(1):86. doi: 10.1186/s12902-025-01873-9.
9
On the interpretability of the SVM model for predicting infant mortality in Bangladesh.关于 SVM 模型预测孟加拉国婴儿死亡率的可解释性。
J Health Popul Nutr. 2024 Oct 27;43(1):170. doi: 10.1186/s41043-024-00646-9.
10
Prediction of undernutrition and identification of its influencing predictors among under-five children in Bangladesh using explainable machine learning algorithms.使用可解释机器学习算法预测孟加拉国五岁以下儿童的营养不良情况并识别其影响因素
PLoS One. 2024 Dec 6;19(12):e0315393. doi: 10.1371/journal.pone.0315393. eCollection 2024.

本文引用的文献

1
Predicting perinatal mortality based on maternal health status and health insurance service using homogeneous ensemble machine learning methods.基于产妇健康状况和医疗保险服务利用同质集成机器学习方法预测围产儿死亡率。
BMC Med Inform Decis Mak. 2022 Dec 28;22(1):341. doi: 10.1186/s12911-022-02084-1.
2
On usage of artificial intelligence for predicting mortality during and post-pregnancy: a systematic review of literature.利用人工智能预测妊娠期间和产后死亡率的研究:文献系统评价。
BMC Med Inform Decis Mak. 2022 Dec 19;22(1):334. doi: 10.1186/s12911-022-02082-3.
3
Comparing machine learning algorithms for predicting COVID-19 mortality.
比较用于预测 COVID-19 死亡率的机器学习算法。
BMC Med Inform Decis Mak. 2022 Jan 4;22(1):2. doi: 10.1186/s12911-021-01742-0.
4
Machine Learning Algorithms to Predict Mortality of Neonates on Mechanical Intubation for Respiratory Failure.用于预测呼吸衰竭机械通气新生儿死亡率的机器学习算法
Biomedicines. 2021 Oct 2;9(10):1377. doi: 10.3390/biomedicines9101377.
5
Predictive model and determinants of under-five child mortality: evidence from the 2014 Ghana demographic and health survey.预测五岁以下儿童死亡率的模型和决定因素:来自 2014 年加纳人口与健康调查的证据。
BMC Public Health. 2019 Jan 14;19(1):64. doi: 10.1186/s12889-019-6390-4.
6
A machine learning approach to estimating preterm infants survival: development of the Preterm Infants Survival Assessment (PISA) predictor.一种用于估计早产儿存活率的机器学习方法:早产儿存活率评估(PISA)预测器的开发。
Sci Rep. 2018 Sep 13;8(1):13743. doi: 10.1038/s41598-018-31920-6.
7
Risk factors of neonatal mortality and child mortality in Bangladesh.孟加拉国新生儿和儿童死亡的风险因素。
J Glob Health. 2018 Jun;8(1):010417. doi: 10.7189/jogh.08.010421.
8
Machine learning, natural language programming, and electronic health records: The next step in the artificial intelligence journey?机器学习、自然语言编程与电子健康记录:人工智能征程的下一步?
J Allergy Clin Immunol. 2018 Jun;141(6):2019-2021.e1. doi: 10.1016/j.jaci.2018.02.025. Epub 2018 Mar 5.
9
Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: The Henry Ford ExercIse Testing (FIT) project.使用SMOTE和集成机器学习方法预测糖尿病:亨利·福特运动测试(FIT)项目。
PLoS One. 2017 Jul 24;12(7):e0179805. doi: 10.1371/journal.pone.0179805. eCollection 2017.
10
Survival analysis of under-five mortality using Cox and frailty models in Ethiopia.在埃塞俄比亚使用Cox模型和脆弱模型对五岁以下儿童死亡率进行生存分析。
J Health Popul Nutr. 2017 Jun 2;36(1):25. doi: 10.1186/s41043-017-0103-3.