• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机动车碰撞伤害严重程度分类:一种针对不平衡数据的混合方法。

Classification of motor vehicle crash injury severity: A hybrid approach for imbalanced data.

机构信息

University of Michigan Transportation Research Institute, 2901 Baxter Road, Ann Arbor, MI, 48109, USA; Department of Industrial and Operations Engineering, University of Michigan, 1205 Beal Avenue, Ann Arbor, MI 48109, USA.

Department of Industrial and Operations Engineering, University of Michigan, 1205 Beal Avenue, Ann Arbor, MI 48109, USA.

出版信息

Accid Anal Prev. 2018 Nov;120:250-261. doi: 10.1016/j.aap.2018.08.025. Epub 2018 Aug 30.

DOI:10.1016/j.aap.2018.08.025
PMID:30173007
Abstract

This study aims to classify the injury severity in motor-vehicle crashes with both high accuracy and sensitivity rates. The dataset used in this study contains 297,113 vehicle crashes, obtained from the Michigan Traffic Crash Facts (MTCF) dataset, from 2016-2017. Similar to any other crash dataset, different accident severity classes are not equally represented in MTCF. To account for the imbalanced classes, several techniques have been used, including under-sampling and over-sampling. Using five classification learning models (i.e., Logistic regression, Decision tree, Neural network, Gradient boosting model, and Naïve Bayes classifier), we classify the levels of injury severity and attempt to improve the classification performance by two training-testing methods including Bootstrap aggregation (or bagging) and majority voting. Furthermore, due to the imbalance present in the dataset, we use the geometric mean (G-mean) to evaluate the classification performance. We show that the classification performance is the highest when bagging is used with decision trees, with over-sampling treatment for imbalanced data. The effect of treatments for the imbalanced data is maximized when under-sampling is combined with bagging. In addition to the original five classes of injury severity in the MTCF dataset, we consider two additional classification problems, one with two classes and the other with three classes, to (1) investigate the impact of the number of classes on the performance of classification models, and (2) enable comparing our results with the literature.

摘要

本研究旨在以高准确率和灵敏度对机动车事故中的伤害严重程度进行分类。本研究使用的数据集包含了 297113 起车辆碰撞事故,这些数据来自于 2016 年至 2017 年的密歇根交通碰撞事实(MTCF)数据集。与任何其他碰撞数据集一样,MTCF 中不同的事故严重程度类别并不具有同等代表性。为了处理这些不平衡的类别,我们使用了几种技术,包括欠采样和过采样。我们使用了五个分类学习模型(即逻辑回归、决策树、神经网络、梯度提升模型和朴素贝叶斯分类器)对伤害严重程度进行分类,并尝试通过两种训练-测试方法(即自举聚合(或套袋)和多数投票)来提高分类性能。此外,由于数据集存在不平衡,我们使用几何平均值(G-mean)来评估分类性能。我们发现,当对不平衡数据进行过采样处理并使用决策树进行套袋时,分类性能最高。当与套袋结合使用欠采样时,对不平衡数据的处理效果最大化。除了 MTCF 数据集中原始的五种伤害严重程度类别外,我们还考虑了另外两种分类问题,一种是两类,另一种是三类,目的是(1)研究类别的数量对分类模型性能的影响,(2)使我们的结果与文献进行比较。

相似文献

1
Classification of motor vehicle crash injury severity: A hybrid approach for imbalanced data.机动车碰撞伤害严重程度分类:一种针对不平衡数据的混合方法。
Accid Anal Prev. 2018 Nov;120:250-261. doi: 10.1016/j.aap.2018.08.025. Epub 2018 Aug 30.
2
Crash severity along rural mountainous highways in Malaysia: An application of a combined decision tree and logistic regression model.马来西亚农村山区公路的撞车严重程度:决策树与逻辑回归模型相结合的应用
Traffic Inj Prev. 2018;19(7):741-748. doi: 10.1080/15389588.2018.1482537. Epub 2018 Nov 6.
3
A multinomial logit model-Bayesian network hybrid approach for driver injury severity analyses in rear-end crashes.多变量逻辑回归模型-贝叶斯网络混合方法在后车碰撞中分析驾驶员损伤严重程度。
Accid Anal Prev. 2015 Jul;80:76-88. doi: 10.1016/j.aap.2015.03.036. Epub 2015 Apr 16.
4
Assessment of tire failure related crashes and injury severity on a mountainous freeway: Bayesian binary logit approach.山区高速公路轮胎失效相关碰撞事故及人员伤害严重程度评估:贝叶斯二项逻辑回归方法。
Accid Anal Prev. 2020 Sep;145:105693. doi: 10.1016/j.aap.2020.105693. Epub 2020 Jul 25.
5
Differences in injury severities between 2-vehicle and 3-vehicle crashes.两车事故与三车事故损伤严重程度的差异。
Traffic Inj Prev. 2015;16:289-97. doi: 10.1080/15389588.2014.936012.
6
An explanatory analysis of driver injury severity in rear-end crashes using a decision table/Naïve Bayes (DTNB) hybrid classifier.使用决策表/朴素贝叶斯(DTNB)混合分类器对追尾碰撞中驾驶员伤害严重程度进行解释性分析。
Accid Anal Prev. 2016 May;90:95-107. doi: 10.1016/j.aap.2016.02.002. Epub 2016 Feb 27.
7
Crash injury severity analysis using a two-layer Stacking framework.基于两层堆叠框架的碰撞损伤严重度分析。
Accid Anal Prev. 2019 Jan;122:226-238. doi: 10.1016/j.aap.2018.10.016. Epub 2018 Nov 1.
8
Comparison and validation of injury risk classifiers for advanced automated crash notification systems.先进自动碰撞通知系统损伤风险分类器的比较与验证
Traffic Inj Prev. 2014;15 Suppl 1:S126-33. doi: 10.1080/15389588.2014.927577.
9
Overloading among crash-involved vehicles in China: identification of factors associated with overloading and crash severity.中国涉及碰撞事故车辆的超载情况:与超载和碰撞严重程度相关因素的识别。
Inj Prev. 2019 Feb;25(1):36-46. doi: 10.1136/injuryprev-2017-042599. Epub 2018 Mar 21.
10
Investigating the effect of modeling single-vehicle and multi-vehicle crashes separately on confidence intervals of Poisson-gamma models.分别研究单独建模单车事故和多车事故对泊松-伽马模型置信区间的影响。
Accid Anal Prev. 2010 Jul;42(4):1273-82. doi: 10.1016/j.aap.2010.02.004. Epub 2010 Mar 12.

引用本文的文献

1
Prediction and interpretive of motor vehicle traffic crashes severity based on random forest optimized by meta-heuristic algorithm.基于元启发式算法优化的随机森林对机动车交通事故严重程度的预测与解释
Heliyon. 2024 Aug 8;10(16):e35595. doi: 10.1016/j.heliyon.2024.e35595. eCollection 2024 Aug 30.
2
Identification of determinant factors for crash severity levels occurred in Addis Ababa City, Ethiopia, from 2017 to 2020: using ordinal logistic regression model approach.确定 2017 年至 2020 年在埃塞俄比亚亚的斯亚贝巴市发生的碰撞严重程度级别的决定因素:使用有序逻辑回归模型方法。
BMC Public Health. 2023 Sep 29;23(1):1884. doi: 10.1186/s12889-023-16785-3.
3
Classification of truck-involved crash severity: Dealing with missing, imbalanced, and high dimensional safety data.
卡车事故严重程度分类:处理缺失、不平衡和高维安全数据。
PLoS One. 2023 Mar 22;18(3):e0281901. doi: 10.1371/journal.pone.0281901. eCollection 2023.
4
Hybrid feature selection-based machine learning Classification system for the prediction of injury severity in single and multiple-vehicle accidents.基于混合特征选择的机器学习分类系统,用于预测单车事故和多车事故中的伤害严重程度。
PLoS One. 2022 Feb 2;17(2):e0262941. doi: 10.1371/journal.pone.0262941. eCollection 2022.
5
Crash Injury Severity Prediction Using an Ordinal Classification Machine Learning Approach.基于有序分类机器学习方法的碰撞损伤严重程度预测
Int J Environ Res Public Health. 2021 Nov 3;18(21):11564. doi: 10.3390/ijerph182111564.
6
Comparison of Prediction Models for Mortality Related to Injuries from Road Traffic Accidents after Correcting for Undersampling.校正欠抽样后道路交通事故伤害相关死亡率预测模型的比较
Int J Environ Res Public Health. 2021 May 24;18(11):5604. doi: 10.3390/ijerph18115604.
7
Exploring the mechanism of crashes with automated vehicles using statistical modeling approaches.利用统计建模方法探索自动驾驶汽车事故的机制。
PLoS One. 2019 Mar 28;14(3):e0214550. doi: 10.1371/journal.pone.0214550. eCollection 2019.