Kassie Sisay Yitayih, Abuhay Abebe Solomon, Wondirad Mekdes, Fantaw Muket Samrawit, Melke Ayantu, Chereka Alex Ayenew, Ambachew Shibabaw Adamu, Dubale Abiy Tasew, Damtie Yitayish, Ngusie Habtamu Setegn, Walle Agmasie Damtew
Department of Health Informatics, School of Public Health, College of Medicine and Health Science, Hawassa University, Hawassa, Ethiopia.
School of Public Health, College of Medicine and Health Sciences, Hawassa University, Hawassa, Ethiopia.
Front Public Health. 2025 Mar 20;13:1448055. doi: 10.3389/fpubh.2025.1448055. eCollection 2025.
Out-of-pocket payments for health services can lead to health catastrophes and decreased service utilization. To address this issue, community-based health insurance has emerged as a strategy to provide financial protection against the costs of poor health. Despite the efforts made by the government of Ethiopia, enrollment rates have not reached the potential beneficiaries. Therefore, this study aimed to predict and identify the factors influencing community-based health insurance enrollment among reproductive-age women using SHapley Additive exPlanations (SHAP) analysis techniques.
The study was conducted using the recent Demographic Health Survey 2019 dataset. Eight machine learning algorithm classifiers were applied to a total weighted sample of 9,013 reproductive-age women and evaluated using performance metrics to predict community-based health insurance enrollment with Python 3.12.2 software, utilizing the Anaconda extension. Additionally, SHAP analysis was used to identify the key predictors of community-based health insurance enrollment and the disproportionate impact of certain variables on others.
The random forest was the most effective predictive model, achieving an accuracy of 91.64% and an area under the curve of 0.885. The SHAP analysis, based on this superior random forest model, indicated that residence, wealth, the age of the household head, the husband's education level, media exposure, family size, and the number of children under five were the most influential factors affecting enrollment in community-based health insurance.
This study highlights the significance of machine learning in predicting community-based health insurance enrollment and identifying the factors that hinder it. Residence, wealth status, and the age of the household head were identified as the primary predictors. The findings of this research indicate that sociodemographic, sociocultural, and economic factors should be considered when developing and implementing health policies aimed at increasing enrollment among reproductive-age women in Ethiopia, particularly in rural areas, as these factors significantly impact low enrollment levels.
医疗服务的自付费用可能导致健康灾难并降低服务利用率。为解决这一问题,社区医疗保险已成为一种为健康状况不佳的费用提供经济保护的策略。尽管埃塞俄比亚政府做出了努力,但参保率尚未达到潜在受益人群。因此,本研究旨在使用夏普利值附加解释(SHAP)分析技术预测并识别影响育龄妇女参加社区医疗保险的因素。
本研究使用了2019年最新的人口与健康调查数据集。将八种机器学习算法分类器应用于9013名育龄妇女的总加权样本,并使用性能指标进行评估,以使用Python 3.12.2软件(利用Anaconda扩展)预测社区医疗保险参保情况。此外,还使用SHAP分析来确定社区医疗保险参保的关键预测因素以及某些变量对其他变量的不成比例影响。
随机森林是最有效的预测模型,准确率达到91.64%,曲线下面积为0.885。基于这个卓越的随机森林模型的SHAP分析表明,居住地、财富状况、户主年龄、丈夫的教育水平、媒体曝光度、家庭规模以及五岁以下儿童数量是影响社区医疗保险参保的最具影响力因素。
本研究强调了机器学习在预测社区医疗保险参保情况以及识别阻碍参保的因素方面的重要性。居住地、财富状况和户主年龄被确定为主要预测因素。本研究结果表明,在制定和实施旨在提高埃塞俄比亚育龄妇女(特别是农村地区)参保率的卫生政策时,应考虑社会人口、社会文化和经济因素,因为这些因素对低参保率有重大影响。