Tesfa Getanew Aschalew, Demeke Abel Desalegn, Seboka Binyam Tariku, Tebeje Tsion Mulat, Kasaye Mulugeta Desalegn, Gebremeskele Behailu Taye, Hailegebreal Samuel, Ngusie Habtamu Setegn
School of Public Health, College of Medicine and Health Science, Dilla University, Dīla, Ethiopia.
Nursing department, college of Medicine and Health Science, Dilla University, Dīla, Ethiopia.
Sci Rep. 2024 Dec 3;14(1):30047. doi: 10.1038/s41598-024-81197-1.
Pregnancy termination is still a sensitive and continuing public health issue due to several political, economic, religious, and social concerns. This study assesses the applications of machine learning models in the prediction of pregnancy termination using data from eleven national datasets in East Africa. Nine machine learning models, namely: Random Forests (RF), Decision Tree, Logistic Regression, Support Vector Machine, eXtreme Gradient Boosting (XGB), AdaBoost, CatBoost, K-nearest neighbor, and feedforward neural network models were used to predict pregnancy termination, with six evaluation criteria utilized to compare their performance. The pooled prevalence of pregnancy termination in East Africa was found to be 4.56%. All machine learning models had an accuracy of at least 71.8% on average. The RF model provided accuracy, specificity, precision, and AUC of 92.9%, 0.87, 0.91, and 0.93, respectively. The most important variables for predicting pregnancy termination were marital status, age, parity, country of residence, age at first sexual activity, exposure to mass media, and educational attainment. These findings underscore the need for a tailored approach that considers socioeconomic and regional disparities in designing policy initiatives aimed at reducing the rate of pregnancy terminations among younger women in the region.
由于一些政治、经济、宗教和社会问题,终止妊娠仍然是一个敏感且持续存在的公共卫生问题。本研究利用东非11个国家数据集的数据,评估机器学习模型在预测终止妊娠方面的应用。使用了9种机器学习模型,即随机森林(RF)、决策树、逻辑回归、支持向量机、极端梯度提升(XGB)、AdaBoost、CatBoost、K近邻和前馈神经网络模型来预测终止妊娠,并使用6种评估标准来比较它们的性能。结果发现,东非终止妊娠的合并患病率为4.56%。所有机器学习模型的平均准确率至少为71.8%。RF模型的准确率、特异性、精确率和AUC分别为92.9%、0.87、0.91和0.93。预测终止妊娠最重要的变量是婚姻状况、年龄、产次、居住国家、首次性行为年龄、接触大众媒体情况和教育程度。这些发现强调,在设计旨在降低该地区年轻女性终止妊娠率的政策举措时,需要采取一种考虑社会经济和地区差异的针对性方法。