校正欠抽样后道路交通事故伤害相关死亡率预测模型的比较

Comparison of Prediction Models for Mortality Related to Injuries from Road Traffic Accidents after Correcting for Undersampling.

作者信息

Boo Yookyung, Choi Youngjin

机构信息

Department of Health Administration, Dankook University, Cheonan 31116, Korea.

Department of Healthcare Management, Eulji University, Seongnam 13135, Korea.

出版信息

Int J Environ Res Public Health. 2021 May 24;18(11):5604. doi: 10.3390/ijerph18115604.

DOI:10.3390/ijerph18115604

PMID:34073920

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8197414/

Abstract

In this study, four models-logistic regression (LR), random forest (RF), linear support vector machine (SVM), and radial basis function (RBF)-SVM-were compared for their accuracy in determining mortality caused by road traffic injuries. They were tested using five years of national-level data from the Korea Disease Control and Prevention Agency's (KDCA) National Hospital Discharge In-Depth Survey (2013 through to 2017). Model performance was measured for accuracy, precision, recall, F1 score, and Brier score metrics using classification analysis that included characteristics of patients, accidents, injuries, and illnesses. Due to the number of variables and differing units, the rates of survival and mortality related to road traffic accidents were imbalanced, so the data was corrected and standardized before the classification models' performances were compared. Using the importance analysis, the main diagnosis, the type of injury, the site of the injury, the type of injury, the operation status, the type of accident, the role at the time of the accident, and the sex were selected as the analysis factors. The biggest contributing factor was the role in the accident, which is the driver, and the major sites of the injuries were head injuries and deep injuries. Using selected factors, comparisons of the classification performance of each model indicated RBF-SVM and RF models were superior to the others. Of the SVM models, the RBF kernel model was superior to the linear kernel model; it can be inferred that the performance of the high-dimensional transformed RBF model is superior when the dimension is complex because of the use of multiple variables. The findings suggest there are limitations to analyses involving imbalanced, multidimensional original data, such as data on road traffic mortality. Thus, analyses must be performed after imbalances are corrected.

摘要

在本研究中，对逻辑回归（LR）、随机森林（RF）、线性支持向量机（SVM）和径向基函数（RBF）-SVM这四种模型在确定道路交通伤害所致死亡率方面的准确性进行了比较。使用了韩国疾病控制与预防机构（KDCA）国家医院出院深度调查（2013年至2017年）的五年国家级数据对它们进行测试。通过包括患者、事故、损伤和疾病特征的分类分析，使用准确性、精确性、召回率、F1分数和布里尔分数指标来衡量模型性能。由于变量数量和单位不同，与道路交通事故相关的生存率和死亡率不均衡，因此在比较分类模型性能之前对数据进行了校正和标准化。通过重要性分析，选择主要诊断、损伤类型、损伤部位、损伤类型、手术状态、事故类型、事故发生时的角色以及性别作为分析因素。最大的影响因素是事故中的角色，即驾驶员，损伤的主要部位是头部损伤和深部损伤。使用选定的因素，对每个模型的分类性能进行比较表明，RBF-SVM和RF模型优于其他模型。在SVM模型中，RBF核模型优于线性核模型；可以推断，由于使用了多个变量，当维度复杂时，高维变换后的RBF模型性能更优。研究结果表明，涉及不平衡、多维度原始数据（如道路交通死亡率数据）的分析存在局限性。因此，必须在纠正不平衡之后进行分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c43/8197414/e462c9545e5c/ijerph-18-05604-g001.jpg

相似文献

Comparison of Prediction Models for Mortality Related to Injuries from Road Traffic Accidents after Correcting for Undersampling.

Int J Environ Res Public Health. 2021 May 24;18(11):5604. doi: 10.3390/ijerph18115604.

Comparison of mortality prediction models for road traffic accidents: an ensemble technique for imbalanced data.

BMC Public Health. 2022 Aug 2;22(1):1476. doi: 10.1186/s12889-022-13719-3.

Decrease of morbidity in road traffic accidents in a high income country - an analysis of 24,405 accidents in a 21 year period.

Injury. 2015 Oct;46 Suppl 4:S135-43. doi: 10.1016/S0020-1383(15)30033-4.

Severity analysis of road transport accidents of hazardous materials with machine learning.

Traffic Inj Prev. 2021;22(4):324-329. doi: 10.1080/15389588.2021.1900569. Epub 2021 Apr 13.

Analysis of influencing factors of traffic accidents on urban ring road based on the SVM model optimized by Bayesian method.

PLoS One. 2024 Sep 24;19(9):e0310044. doi: 10.1371/journal.pone.0310044. eCollection 2024.

Prediction of severity of aviation landing accidents using support vector machine models.

Accid Anal Prev. 2023 Jul;187:107043. doi: 10.1016/j.aap.2023.107043. Epub 2023 Apr 20.

Investigating driver injury severity patterns in rollover crashes using support vector machine models.

Accid Anal Prev. 2016 May;90:128-39. doi: 10.1016/j.aap.2016.02.011. Epub 2016 Mar 1.

Accident severity prediction modeling for road safety using random forest algorithm: an analysis of Indian highways.

F1000Res. 2023 Oct 20;12:494. doi: 10.12688/f1000research.133594.2. eCollection 2023.

Class-imbalanced crash prediction based on real-time traffic and weather data: A driving simulator study.

Traffic Inj Prev. 2020;21(3):201-208. doi: 10.1080/15389588.2020.1723794. Epub 2020 Mar 3.

A reliable method for colorectal cancer prediction based on feature selection and support vector machine.

Med Biol Eng Comput. 2019 Apr;57(4):901-912. doi: 10.1007/s11517-018-1930-0. Epub 2018 Nov 26.

引用本文的文献

Essential…but also vulnerable? Work intensification, effort/reward imbalance, fatigue and psychological health of Spanish cargo drivers during the COVID-19 pandemic.

PeerJ. 2022 Mar 8;10:e13050. doi: 10.7717/peerj.13050. eCollection 2022.

本文引用的文献

Investigating factors affecting severity of large truck-involved crashes: Comparison of the SVM and random parameter logit model.

J Safety Res. 2021 Jun;77:151-160. doi: 10.1016/j.jsr.2021.02.012. Epub 2021 Mar 24.

Traffic Crash Severity Prediction-A Synergy by Hybrid Principal Component Analysis and Machine Learning Models.

Int J Environ Res Public Health. 2020 Oct 19;17(20):7598. doi: 10.3390/ijerph17207598.

Main results of the Korea National Hospital Discharge In-depth Injury Survey, 2004-2016.

Epidemiol Health. 2020;42:e2020044. doi: 10.4178/epih.e2020044. Epub 2020 Jun 20.

Injury severity level and associated factors among road traffic accident victims attending emergency department of Tirunesh Beijing Hospital, Addis Ababa, Ethiopia: A cross sectional hospital-based study.

PLoS One. 2019 Sep 26;14(9):e0222793. doi: 10.1371/journal.pone.0222793. eCollection 2019.

The global macroeconomic burden of road injuries: estimates and projections for 166 countries.

Lancet Planet Health. 2019 Sep;3(9):e390-e398. doi: 10.1016/S2542-5196(19)30170-6.

Ordered logistic models of influencing factors on crash injury severity of single and multiple-vehicle downgrade crashes: A case study in Wyoming.

J Safety Res. 2019 Feb;68:107-118. doi: 10.1016/j.jsr.2018.12.006. Epub 2018 Dec 17.

National Hospital Care Survey Demonstration Projects: Pneumonia Inpatient Hospitalizations and Emergency Department Visits.

Natl Health Stat Report. 2018 Aug(116):1-11.

Classification of motor vehicle crash injury severity: A hybrid approach for imbalanced data.

Accid Anal Prev. 2018 Nov;120:250-261. doi: 10.1016/j.aap.2018.08.025. Epub 2018 Aug 30.

Investigating driver injury severity patterns in rollover crashes using support vector machine models.

Accid Anal Prev. 2016 May;90:128-39. doi: 10.1016/j.aap.2016.02.011. Epub 2016 Mar 1.

A data mining approach to investigate the factors influencing the crash severity of motorcycle pillion passengers.

J Safety Res. 2014 Dec;51:93-8. doi: 10.1016/j.jsr.2014.09.004. Epub 2014 Oct 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

校正欠抽样后道路交通事故伤害相关死亡率预测模型的比较

Comparison of Prediction Models for Mortality Related to Injuries from Road Traffic Accidents after Correcting for Undersampling.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献