• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测东非五价疫苗3接种率下降的堆叠集成机器学习模型。

A stacked ensemble machine learning model for the prediction of pentavalent 3 vaccination dropout in East Africa.

作者信息

Alemayehu Meron Asmamaw, Kebede Shimels Derso, Walle Agmasie Damtew, Mamo Daniel Niguse, Enyew Ermias Bekele, Adem Jibril Bashir

机构信息

Department of Epidemiology and Biostatistics, College of Medicine and Health Sciences, Institute of Public Health, University of Gondar, Gondar, Ethiopia.

Department of Health Informatics, School of Public Health, College of Medicine and Health Science, Wollo University, Dessie, Ethiopia.

出版信息

Front Big Data. 2025 Apr 7;8:1522578. doi: 10.3389/fdata.2025.1522578. eCollection 2025.

DOI:10.3389/fdata.2025.1522578
PMID:40260052
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12009798/
Abstract

INTRODUCTION

Vaccination is critical for reducing childhood mortality, yet completion rates for the third dose of the pentavalent vaccine (Penta 3) in East Africa remain inadequate. This study aims to predict Penta 3 vaccination dropout using a stacking ensemble machine learning model with Demographic and Health Survey (DHS) data. The objective is to identify predictors of dropout and enhance intervention strategies.

METHODS

The study utilized seven base machine learning algorithms to create a stacked ensemble model with three meta-learners: Random Forest (RF), Generalized Linear Model (GLM), and Extreme Gradient Boosting (XGBoost). The H2O package facilitated the development of base learners and the stacking of super learners. Feature selection (FS) and comparisons were performed using the LASSO and Boruta algorithms. The selected features were one-hot encoded, and ordinal encoding was applied where appropriate. Hyperparameter optimization (HPO) and comparisons were conducted using grid search and random search. Model performance was assessed using five key metrics, including accuracy and the area under the curve (AUC). SHAP (Shapley Additive Explanations) values were used to interpret the model outputs and identify influential predictors. The experimental design was employed to present the results.

RESULTS

Four experiments were conducted to evaluate feature selection and HPO methods. All stacked ensemble models outperformed individual learners, with the XGBoost meta-learner optimized with grid search and LASSO FS achieving the highest performance: 93.9% accuracy and 99.4% AUC. While RF and GLM meta-learners were also evaluated, they were outperformed by the XGBoost meta-learner. SHAP analysis revealed key features influencing Penta 3 dropout, including the place of delivery, decision-making autonomy, the mother's level of earning, and healthcare access. Home delivery increased the risk of dropout, while postnatal care by midwives and health insurance coverage lowered dropout likelihood.

CONCLUSION AND RECOMMENDATION

This study provides insights into the factors influencing Penta 3 vaccination dropout in East Africa. To reduce dropout rates, interventions should focus on enhancing maternal livelihood opportunities, improving healthcare access in rural areas, and promoting institutional deliveries.

摘要

引言

疫苗接种对于降低儿童死亡率至关重要,但东非五价疫苗第三剂(Penta 3)的接种完成率仍然不足。本研究旨在使用包含人口与健康调查(DHS)数据的堆叠集成机器学习模型预测Penta 3疫苗接种率下降情况。目的是识别接种率下降的预测因素并加强干预策略。

方法

该研究利用七种基础机器学习算法创建了一个堆叠集成模型,该模型有三个元学习器:随机森林(RF)、广义线性模型(GLM)和极端梯度提升(XGBoost)。H2O软件包促进了基础学习器的开发和超级学习器的堆叠。使用LASSO和Boruta算法进行特征选择(FS)和比较。对选定的特征进行独热编码,并在适当的地方应用序数编码。使用网格搜索和随机搜索进行超参数优化(HPO)和比较。使用包括准确率和曲线下面积(AUC)在内的五个关键指标评估模型性能。SHAP(Shapley加性解释)值用于解释模型输出并识别有影响力的预测因素。采用实验设计展示结果。

结果

进行了四项实验来评估特征选择和HPO方法。所有堆叠集成模型的表现均优于单个学习器,使用网格搜索和LASSO FS优化的XGBoost元学习器性能最高:准确率为93.9%,AUC为99.4%。虽然也对RF和GLM元学习器进行了评估,但它们的表现不如XGBoost元学习器。SHAP分析揭示了影响Penta 3疫苗接种率下降的关键特征,包括分娩地点、决策自主权、母亲的收入水平和医疗保健可及性。在家分娩增加了接种率下降的风险,而助产士的产后护理和医疗保险覆盖则降低了接种率下降的可能性。

结论与建议

本研究深入探讨了影响东非Penta 3疫苗接种率下降的因素。为降低接种率下降,干预措施应侧重于增加孕产妇生计机会、改善农村地区的医疗保健可及性以及促进机构分娩。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/44565c64ba80/fdata-08-1522578-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/4d6ea9555b7a/fdata-08-1522578-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/b9f1f1d810f6/fdata-08-1522578-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/25e0ee3d2370/fdata-08-1522578-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/39b8be4f7de8/fdata-08-1522578-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/10da38171e6c/fdata-08-1522578-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/dbed0d7dc582/fdata-08-1522578-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/19ea736bc533/fdata-08-1522578-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/44565c64ba80/fdata-08-1522578-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/4d6ea9555b7a/fdata-08-1522578-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/b9f1f1d810f6/fdata-08-1522578-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/25e0ee3d2370/fdata-08-1522578-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/39b8be4f7de8/fdata-08-1522578-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/10da38171e6c/fdata-08-1522578-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/dbed0d7dc582/fdata-08-1522578-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/19ea736bc533/fdata-08-1522578-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8be/12009798/44565c64ba80/fdata-08-1522578-g0008.jpg

相似文献

1
A stacked ensemble machine learning model for the prediction of pentavalent 3 vaccination dropout in East Africa.用于预测东非五价疫苗3接种率下降的堆叠集成机器学习模型。
Front Big Data. 2025 Apr 7;8:1522578. doi: 10.3389/fdata.2025.1522578. eCollection 2025.
2
Prediction of gully erosion susceptibility through the lens of the SHapley Additive exPlanations (SHAP) method using a stacking ensemble model.利用堆叠集成模型,通过SHapley加性解释(SHAP)方法预测沟壑侵蚀敏感性。
J Environ Manage. 2025 May;383:125478. doi: 10.1016/j.jenvman.2025.125478. Epub 2025 Apr 25.
3
Machine learning algorithms for prediction of measles one vaccination dropout among 12-23 months children in Ethiopia.用于预测埃塞俄比亚 12-23 个月儿童麻疹一剂疫苗接种流失的机器学习算法。
BMJ Open. 2024 Nov 14;14(11):e089764. doi: 10.1136/bmjopen-2024-089764.
4
Identifying determinants of malnutrition in under-five children in Bangladesh: insights from the BDHS-2022 cross-sectional study.确定孟加拉国五岁以下儿童营养不良的决定因素:来自2022年孟加拉国人口与健康调查横断面研究的见解
Sci Rep. 2025 Apr 24;15(1):14336. doi: 10.1038/s41598-025-99288-y.
5
Enhancing predictive accuracy for urinary tract infections post-pediatric pyeloplasty with explainable AI: an ensemble TabNet approach.使用可解释人工智能提高小儿肾盂成形术后尿路感染的预测准确性:一种集成TabNet方法。
Sci Rep. 2025 Jan 19;15(1):2455. doi: 10.1038/s41598-024-82282-1.
6
Exploring Ensemble Learning Techniques for Infant Mortality Prediction: A Technical Analysis of XGBoost Stacking AdaBoost and Bagging Models.探索用于婴儿死亡率预测的集成学习技术:XGBoost、堆叠、AdaBoost和装袋模型的技术分析
Birth Defects Res. 2025 Feb;117(2):e2443. doi: 10.1002/bdr2.2443.
7
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.
8
Prediction of mortality in sepsis patients using stacked ensemble machine learning algorithm.使用堆叠集成机器学习算法预测脓毒症患者的死亡率。
J Postgrad Med. 2024 Oct 1;70(4):209-216. doi: 10.4103/jpgm.jpgm_357_24. Epub 2024 Dec 6.
9
Optimizing hypertension prediction using ensemble learning approaches.使用集成学习方法优化高血压预测。
PLoS One. 2024 Dec 23;19(12):e0315865. doi: 10.1371/journal.pone.0315865. eCollection 2024.
10
An innovative machine learning approach for slope stability prediction by combining shap interpretability and stacking ensemble learning.一种结合SHAP可解释性和堆叠集成学习的用于边坡稳定性预测的创新机器学习方法。
Environ Sci Pollut Res Int. 2025 May;32(21):12827-12843. doi: 10.1007/s11356-025-36406-3. Epub 2025 May 7.

本文引用的文献

1
The impact of economic growth and recessions on maternal and child health outcomes in sub-Saharan African countries: a systematic literature review.经济增长与衰退对撒哈拉以南非洲国家母婴健康结果的影响:一项系统文献综述
Reprod Health. 2025 Mar 7;22(1):35. doi: 10.1186/s12978-025-01973-8.
2
Incomplete immunization uptake and associated factors among children aged 12-23 months in sub-Saharan African countries; multilevel analysis evidenced from latest demography and health survey data, 2023.撒哈拉以南非洲国家 12-23 个月龄儿童不完全免疫接种率及其相关因素;来自最新人口与健康调查数据的多水平分析,2023 年。
Ital J Pediatr. 2024 May 12;50(1):96. doi: 10.1186/s13052-024-01642-9.
3
Evaluation metrics and statistical tests for machine learning.
机器学习的评估指标和统计检验。
Sci Rep. 2024 Mar 13;14(1):6086. doi: 10.1038/s41598-024-56706-x.
4
Vaccination dropout and wealth related inequality among children aged 12-35 months in remote and underserved settings of Ethiopia: a cross-sectional evaluation survey.埃塞俄比亚偏远及服务欠缺地区12至35个月儿童的疫苗接种中断情况及与财富相关的不平等:一项横断面评估调查
Front Pediatr. 2023 Oct 24;11:1280746. doi: 10.3389/fped.2023.1280746. eCollection 2023.
5
Machine learning algorithms' application to predict childhood vaccination among children aged 12-23 months in Ethiopia: Evidence 2016 Ethiopian Demographic and Health Survey dataset.机器学习算法在预测埃塞俄比亚 12-23 个月儿童疫苗接种率中的应用:基于 2016 年埃塞俄比亚人口与健康调查数据集的证据。
PLoS One. 2023 Oct 18;18(10):e0288867. doi: 10.1371/journal.pone.0288867. eCollection 2023.
6
Factors associated with pentavalent vaccine coverage among 12-23-month-old children in Afghanistan: A cross-sectional study.与阿富汗 12-23 个月龄儿童五联疫苗接种覆盖率相关的因素:一项横断面研究。
PLoS One. 2023 Aug 8;18(8):e0289744. doi: 10.1371/journal.pone.0289744. eCollection 2023.
7
Imbalanced data preprocessing techniques for machine learning: a systematic mapping study.用于机器学习的不平衡数据预处理技术:一项系统映射研究
Knowl Inf Syst. 2023;65(1):31-57. doi: 10.1007/s10115-022-01772-8. Epub 2022 Nov 9.
8
Prevalence and determinants of maternal healthcare utilisation among young women in sub-Saharan Africa: cross-sectional analyses of demographic and health survey data.撒哈拉以南非洲年轻女性中孕产妇医疗保健利用的流行情况及其决定因素:来自人口与健康调查数据的横断面分析。
BMC Public Health. 2022 Apr 5;22(1):647. doi: 10.1186/s12889-022-13037-8.
9
Determinants of pentavalent and measles vaccination dropouts among children aged 12-23 months in The Gambia.冈比亚 12-23 月龄儿童五联疫苗和麻疹疫苗接种失败的决定因素。
BMC Public Health. 2022 Mar 17;22(1):520. doi: 10.1186/s12889-022-12914-6.
10
Vaccination dropout rates among children aged 12-23 months in Democratic Republic of the Congo: a cross-sectional study.刚果民主共和国12至23个月大儿童的疫苗接种辍学率:一项横断面研究。
Arch Public Health. 2022 Jan 5;80(1):18. doi: 10.1186/s13690-021-00782-2.