• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 2020 年 SDHS 数据进行机器学习研究,以确定索马里的贫困决定因素。

Machine learning study using 2020 SDHS data to determine poverty determinants in Somalia.

机构信息

School of Postgraduate Studies and Research, Amoud University, Amoud Valley, Borama, Awdal, 25263, Somalia.

Department of Mathematics, LMNO, CNRS-Université de Caen, Campus II, Science 3, 14032, Caen, France.

出版信息

Sci Rep. 2024 Mar 12;14(1):5956. doi: 10.1038/s41598-024-56466-8.

DOI:10.1038/s41598-024-56466-8
PMID:38472298
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10933372/
Abstract

Extensive research has been conducted on poverty in developing countries using conventional regression analysis, which has limited prediction capability. This study aims to address this gap by applying advanced machine learning (ML) methods to predict poverty in Somalia. Utilizing data from the first-ever 2020 Somalia Demographic and Health Survey (SDHS), a cross-sectional study design is considered. ML methods, including random forest (RF), decision tree (DT), support vector machine (SVM), and logistic regression, are tested and applied using R software version 4.1.2, while conventional methods are analyzed using STATA version 17. Evaluation metrics, such as confusion matrix, accuracy, precision, sensitivity, specificity, recall, F1 score, and area under the receiver operating characteristic (AUROC), are employed to assess the performance of predictive models. The prevalence of poverty in Somalia is notable, with approximately seven out of ten Somalis living in poverty, making it one of the highest rates in the region. Among nomadic pastoralists, agro-pastoralists, and internally displaced persons (IDPs), the poverty average stands at 69%, while urban areas have a lower poverty rate of 60%. The accuracy of prediction ranged between 67.21% and 98.36% for the advanced ML methods, with the RF model demonstrating the best performance. The results reveal geographical region, household size, respondent age group, husband employment status, age of household head, and place of residence as the top six predictors of poverty in Somalia. The findings highlight the potential of ML methods to predict poverty and uncover hidden information that traditional statistical methods cannot detect, with the RF model identified as the best classifier for predicting poverty in Somalia.

摘要

针对发展中国家的贫困问题,已经开展了广泛的研究,其中使用了传统的回归分析方法,但这种方法的预测能力有限。本研究旨在通过应用先进的机器学习(ML)方法来解决这一差距,以预测索马里的贫困情况。本研究利用了首次进行的 2020 年索马里人口与健康调查(SDHS)的数据,采用了横断面研究设计。在 R 软件版本 4.1.2 中测试和应用了 ML 方法,包括随机森林(RF)、决策树(DT)、支持向量机(SVM)和逻辑回归,而传统方法则使用 STATA 版本 17 进行分析。采用混淆矩阵、准确性、精度、敏感性、特异性、召回率、F1 分数和接收器操作特征(ROC)曲线下面积(AUROC)等评估指标来评估预测模型的性能。索马里的贫困发生率很高,大约十分之七的索马里人生活贫困,这一比例在该地区是最高的。在游牧牧民、农牧民和国内流离失所者(IDP)中,贫困平均水平为 69%,而城市地区的贫困率较低,为 60%。先进的 ML 方法的预测准确率在 67.21%至 98.36%之间,其中 RF 模型表现最佳。研究结果表明,地理区域、家庭规模、受访者年龄组、丈夫就业状况、户主年龄和居住地是索马里贫困的前六个预测因素。研究结果强调了 ML 方法在预测贫困和揭示传统统计方法无法检测到的隐藏信息方面的潜力,其中 RF 模型被确定为预测索马里贫困的最佳分类器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/1594583312ab/41598_2024_56466_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/1bcb18434d89/41598_2024_56466_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/5739adbaa4e9/41598_2024_56466_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/d68c2c9f816e/41598_2024_56466_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/fed2f6b07db7/41598_2024_56466_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/e4a22107615d/41598_2024_56466_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/da773f82af71/41598_2024_56466_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/e05d4cf40eef/41598_2024_56466_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/1594583312ab/41598_2024_56466_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/1bcb18434d89/41598_2024_56466_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/5739adbaa4e9/41598_2024_56466_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/d68c2c9f816e/41598_2024_56466_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/fed2f6b07db7/41598_2024_56466_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/e4a22107615d/41598_2024_56466_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/da773f82af71/41598_2024_56466_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/e05d4cf40eef/41598_2024_56466_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfeb/10933372/1594583312ab/41598_2024_56466_Fig8_HTML.jpg

相似文献

1
Machine learning study using 2020 SDHS data to determine poverty determinants in Somalia.利用 2020 年 SDHS 数据进行机器学习研究,以确定索马里的贫困决定因素。
Sci Rep. 2024 Mar 12;14(1):5956. doi: 10.1038/s41598-024-56466-8.
2
Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014-15 dataset.机器学习方法在预测卢旺达婴儿死亡率中的应用:对 2014-15 年卢旺达人口健康调查数据集的分析。
BMC Pregnancy Childbirth. 2022 May 4;22(1):388. doi: 10.1186/s12884-022-04699-8.
3
Identifying determinants of malnutrition in under-five children in Bangladesh: insights from the BDHS-2022 cross-sectional study.确定孟加拉国五岁以下儿童营养不良的决定因素:来自2022年孟加拉国人口与健康调查横断面研究的见解
Sci Rep. 2025 Apr 24;15(1):14336. doi: 10.1038/s41598-025-99288-y.
4
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
5
Magnitude and determinants of intimate partner violence against women in Somalia: evidence from the SDHS survey 2020 dataset.索马里针对妇女的亲密伴侣暴力行为的程度及决定因素:来自2020年索马里多指标类集调查数据集的证据
BMC Womens Health. 2025 Jan 15;25(1):22. doi: 10.1186/s12905-024-03539-5.
6
Machine Learning Algorithms for understanding the determinants of under-five Mortality.用于理解五岁以下儿童死亡率决定因素的机器学习算法
BioData Min. 2022 Sep 24;15(1):20. doi: 10.1186/s13040-022-00308-8.
7
Predicting the drop out from the maternal, newborn and child healthcare continuum in three East African Community countries: application of machine learning models.预测三个东非共同体国家母婴和儿童保健连续服务中断的可能性:机器学习模型的应用。
BMC Med Inform Decis Mak. 2023 Sep 25;23(1):191. doi: 10.1186/s12911-023-02305-1.
8
Machine Learning Analysis of Nutrient Associations with Peripheral Arterial Disease: Insights from NHANES 1999-2004.营养物质与外周动脉疾病关联的机器学习分析:来自1999 - 2004年美国国家健康与营养检查调查(NHANES)的见解
Ann Vasc Surg. 2025 May;114:154-162. doi: 10.1016/j.avsg.2024.12.077. Epub 2025 Jan 30.
9
Prevalance and determinants of hypertension among adults in Somalia using Somalia demographic health survey data, SDHS 2020.利用索马里人口健康调查数据(SDHS 2020)评估索马里成年人高血压的患病率及其决定因素。
Curr Probl Cardiol. 2024 Nov;49(11):102783. doi: 10.1016/j.cpcardiol.2024.102783. Epub 2024 Aug 8.
10
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.

引用本文的文献

1
Application of machine learning algorithms and SHAP explanations to predict fertility preference among reproductive women in Somalia.机器学习算法和SHAP解释在预测索马里育龄妇女生育偏好中的应用。
Sci Rep. 2025 Jul 20;15(1):26301. doi: 10.1038/s41598-025-04704-y.
2
Multilevel analysis of prevalence and determinants of diarrhea among under-five children in Somalia: insights from the Somalia demographic and health survey 2020.索马里五岁以下儿童腹泻患病率及决定因素的多水平分析:来自2020年索马里人口与健康调查的见解
BMC Public Health. 2025 Jan 17;25(1):202. doi: 10.1186/s12889-025-21435-x.
3
Magnitude and determinants of intimate partner violence against women in Somalia: evidence from the SDHS survey 2020 dataset.

本文引用的文献

1
Understanding multidimensional poverty in pakistan: implications for regional and demographic-specific policies.了解巴基斯坦的多维贫困:对区域和特定人口政策的影响。
Environ Sci Pollut Res Int. 2023 Jun 6. doi: 10.1007/s11356-023-28026-6.
2
New evidence on the rural poverty and energy choice relationship.农村贫困与能源选择关系的新证据。
Sci Rep. 2023 Feb 27;13(1):3320. doi: 10.1038/s41598-023-29285-6.
索马里针对妇女的亲密伴侣暴力行为的程度及决定因素:来自2020年索马里多指标类集调查数据集的证据
BMC Womens Health. 2025 Jan 15;25(1):22. doi: 10.1186/s12905-024-03539-5.
4
Unimproved source of drinking water and the associated factors: Insights from the 2020 Somalia demographic and health survey.未改善的饮用水源及相关因素:来自2020年索马里人口与健康调查的见解
PLOS Glob Public Health. 2024 Dec 5;4(12):e0003844. doi: 10.1371/journal.pgph.0003844. eCollection 2024.
5
Prevalence and Determinants of Household Access to Improved Latrine Utilization in Somalia: Health Demographic Survey (SHDS) 2020.索马里家庭使用改良厕所情况的患病率及影响因素:2020年卫生人口调查(SHDS)
Environ Health Insights. 2024 Sep 23;18:11786302241284148. doi: 10.1177/11786302241284148. eCollection 2024.