预测埃塞俄比亚西北部的车祸严重程度：一种利用驾驶员、环境和道路状况的机器学习方法。

Predicting car accident severity in Northwest Ethiopia: a machine learning approach leveraging driver, environmental, and road conditions.

作者信息

Mengistu Abraham Keffale, Gedefaw Andualem Enyew, Baykemagn Nebebe Demis, Walle Agmasie Damtew, Yehuala Tirualem Zeleke, Alemayehu Meron Asmamaw, Messelu Mengistu Abebe, Assaye Bayou Tilahun

机构信息

Department of Health Informatics, College of Medicine and Health Sciences, Debre Markos University, Debre Markos, Ethiopia.

Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia.

出版信息

Sci Rep. 2025 Jul 1;15(1):21913. doi: 10.1038/s41598-025-08005-2.

DOI:10.1038/s41598-025-08005-2

PMID:40595301

Abstract

Road traffic accidents (RTAs) in Northwest Ethiopia, a region with a fatality rate of 32.2 per 100,000 residents, pose a critical public health challenge exacerbated by infrastructural deficits and environmental hazards. This study leverages machine learning (ML) to predict accident severity, addressing gaps in localized predictive frameworks for low- and middle-income countries (LMICs). Our study aims to predict the severity of car accidents in Northwest Ethiopia via machine-learning techniques. Using a dataset of 2,000 accidents (2018-2023) from police reports, we integrated driver demographics, behavioral factors (e.g., alcohol use, seatbelt compliance), and environmental conditions (e.g., unpaved roads, weather) in North West Ethiopia. Ten ML models, including Random Forest, XGBoost, and LightGBM, were evaluated after addressing class imbalance via the Synthetic Minority Oversampling Technique (SMOTE). Hyperparameter tuning and Shapley Additive explanations (SHAP) provided model optimization and interpretability. Random Forest outperformed other models, achieving 82% accuracy (AUC-ROC: 0.87) post-tuning. Driver age (mean: 44 years) and environmental factors (e.g., nighttime on unlit roads, rainy conditions) were critical predictors, increasing fatal accident likelihood by 62%. SMOTE improved the accuracy of the outperforming random forest accuracy from 78.6 to 82%. Random Forest exhibited the highest recall (0.82) after optimization, while ensemble methods dominated performance metrics. The study underscores the efficacy of ML in contextualizing accident severity in LMICs, with Random Forest emerging as a robust tool for policymakers. Prioritizing road paving, sobriety checkpoints, and motorcycle safety could mitigate risks, aligning with Sustainable Development Goal 3.6. Future work should address data limitations (underreporting, geospatial gaps) and expand model interpretability.

摘要

埃塞俄比亚西北部的道路交通事故（RTAs）是一个严峻的公共卫生挑战，该地区每10万居民的死亡率为32.2，基础设施不足和环境危害加剧了这一问题。本研究利用机器学习（ML）来预测事故严重程度，以填补低收入和中等收入国家（LMICs）本地化预测框架的空白。我们的研究旨在通过机器学习技术预测埃塞俄比亚西北部汽车事故的严重程度。利用警方报告中的2000起事故（2018 - 2023年）数据集，我们纳入了埃塞俄比亚西北部的驾驶员人口统计学特征、行为因素（如酒精使用、安全带佩戴情况）和环境条件（如未铺砌道路、天气）。在通过合成少数过采样技术（SMOTE）解决类别不平衡问题后，对包括随机森林、XGBoost和LightGBM在内的10种ML模型进行了评估。超参数调整和Shapley附加解释（SHAP）提供了模型优化和可解释性。随机森林在调整后优于其他模型，准确率达到82%（AUC - ROC：0.87）。驾驶员年龄（平均：44岁）和环境因素（如夜间在无照明道路上、下雨情况）是关键预测因素，将致命事故可能性增加了62%。SMOTE将表现最佳的随机森林准确率从78.6%提高到了82%。优化后随机森林的召回率最高（0.82），而集成方法在性能指标方面占主导地位。该研究强调了ML在低收入和中等收入国家将事故严重程度情境化方面的有效性，随机森林成为政策制定者的有力工具。优先进行道路铺设、设立清醒检查站和提高摩托车安全性可以降低风险，这与可持续发展目标3.6相一致。未来的工作应解决数据限制（报告不足、地理空间差距）并扩大模型的可解释性。

相似文献

Predicting car accident severity in Northwest Ethiopia: a machine learning approach leveraging driver, environmental, and road conditions.预测埃塞俄比亚西北部的车祸严重程度：一种利用驾驶员、环境和道路状况的机器学习方法。

Sci Rep. 2025 Jul 1;15(1):21913. doi: 10.1038/s41598-025-08005-2.

Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型：基于多中心队列研究的开发与验证研究

J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.

Interpretable Machine Learning for Serum-Based Metabolomics in Breast Cancer Diagnostics: Insights from Multi-Objective Feature Selection-Driven LightGBM-SHAP Models.用于乳腺癌诊断的基于血清代谢组学的可解释机器学习：多目标特征选择驱动的LightGBM-SHAP模型的见解

Medicina (Kaunas). 2025 Jun 19;61(6):1112. doi: 10.3390/medicina61061112.

Construction and validation of HBV-ACLF bacterial infection diagnosis model based on machine learning.基于机器学习的HBV-ACLF细菌感染诊断模型的构建与验证

BMC Infect Dis. 2025 Jul 1;25(1):847. doi: 10.1186/s12879-025-11199-5.

A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架：方法与验证研究

JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.

Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果：一种针对特定个体见解的新型验证方法。

Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

Machine learning-based drought prediction using Palmer Drought Severity Index and TerraClimate data in Ethiopia.基于机器学习的埃塞俄比亚干旱预测：利用帕尔默干旱严重指数和TerraClimate数据

PLoS One. 2025 Jun 18;20(6):e0326174. doi: 10.1371/journal.pone.0326174. eCollection 2025.

Leveraging machine learning to identify determinants of zero utilization of maternal continuum of care in Ethiopia: Insights from SHAP analysis and the 2019 mini DHS.利用机器学习识别埃塞俄比亚孕产妇连续护理零利用率的决定因素：来自SHAP分析和2019年小型人口与健康调查的见解

PLOS Glob Public Health. 2025 Jun 20;5(6):e0004787. doi: 10.1371/journal.pgph.0004787. eCollection 2025.

Development of machine learning model for predicting prolonged operation time in lumbar stenosis undergoing posterior lumbar interbody fusion: a multicenter study.用于预测接受后路腰椎椎间融合术的腰椎管狭窄症患者手术时间延长的机器学习模型的开发：一项多中心研究。

Spine J. 2025 Mar;25(3):460-473. doi: 10.1016/j.spinee.2024.10.001. Epub 2024 Oct 19.

Predicting Early-Onset Colorectal Cancer in Individuals Below Screening Age Using Machine Learning and Real-World Data: Case Control Study.利用机器学习和真实世界数据预测筛查年龄以下个体的早发性结直肠癌：病例对照研究

JMIR Cancer. 2025 Jun 19;11:e64506. doi: 10.2196/64506.

本文引用的文献

Magnitude of mortality and associated factors among road traffic accident victim children admitted in East and West Gojjam Zone specialized public hospitals Northwest, Ethiopia.埃塞俄比亚西北部东戈贾姆和西戈贾姆地区专科医院收治的道路交通事故受害儿童的死亡率及相关因素

BMC Pediatr. 2025 Feb 25;25(1):135. doi: 10.1186/s12887-024-05314-9.

FLEX-SMOTE: Synthetic over-sampling technique that flexibly adjusts to different minority class distributions.FLEX-SMOTE：一种能灵活适应不同少数类分布的合成过采样技术。

Patterns (N Y). 2024 Oct 9;5(11):101073. doi: 10.1016/j.patter.2024.101073. eCollection 2024 Nov 8.

Survival status and its predictors among adult victims of road traffic accident admitted to public hospitals of Bahir Bar City, Amhara regional state, Northwest, Ethiopia, 2023: multi center retrospective follow-up study.2023 年，在埃塞俄比亚西北阿姆哈拉地区巴希尔巴市公立医院收治的成人道路交通伤害受害者中，生存状况及其预测因素的多中心回顾性随访研究。

BMC Emerg Med. 2024 Sep 30;24(1):177. doi: 10.1186/s12873-024-01093-9.

Confirming the statistically significant superiority of tree-based machine learning algorithms over their counterparts for tabular data.证实基于树的机器学习算法在表格数据方面相对于其对应算法具有统计学上的显著优势。

PLoS One. 2024 Apr 18;19(4):e0301541. doi: 10.1371/journal.pone.0301541. eCollection 2024.

Road Traffic Injuries in South Africa: A Complex Global Health Crisis.南非的道路交通伤害：一场复杂的全球健康危机。

Ann Glob Health. 2024 Apr 5;90(1):26. doi: 10.5334/aogh.4249. eCollection 2024.

Accident severity prediction modeling for road safety using random forest algorithm: an analysis of Indian highways.基于随机森林算法的道路安全事故严重程度预测建模：对印度高速公路的分析。

F1000Res. 2023 Oct 20;12:494. doi: 10.12688/f1000research.133594.2. eCollection 2023.

Road traffic accidental injuries and deaths: A neglected global health issue.道路交通意外伤害与死亡：一个被忽视的全球健康问题。

Health Sci Rep. 2023 May 2;6(5):e1240. doi: 10.1002/hsr2.1240. eCollection 2023 May.

Interpretable machine learning with tree-based shapley additive explanations: Application to metabolomics datasets for binary classification.基于树的 Shapley 加性解释的可解释机器学习：在代谢组学数据集的二元分类中的应用。

PLoS One. 2023 May 4;18(5):e0284315. doi: 10.1371/journal.pone.0284315. eCollection 2023.

Epidemiological characteristics of deaths from road traffic accidents in Addis Ababa, Ethiopia: a study based on traffic police records (2018-2020).基于交警记录的埃塞俄比亚亚的斯亚贝巴道路交通死亡事故的流行病学特征研究（2018-2020 年）。

BMC Emerg Med. 2023 Feb 20;23(1):19. doi: 10.1186/s12873-023-00791-0.

Application of explainable machine learning for real-time safety analysis toward a connected vehicle environment.可解释机器学习在车联网环境实时安全分析中的应用

Accid Anal Prev. 2022 Jun;171:106681. doi: 10.1016/j.aap.2022.106681. Epub 2022 Apr 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

预测埃塞俄比亚西北部的车祸严重程度：一种利用驾驶员、环境和道路状况的机器学习方法。

Predicting car accident severity in Northwest Ethiopia: a machine learning approach leveraging driver, environmental, and road conditions.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献