Virginia Tech, Blacksburg, VA, 24061, USA.
Mobility Technologies, STV Inc, 1818 Market Street, Philadelphia, PA, 19103, USA.
Sci Rep. 2024 Sep 28;14(1):22431. doi: 10.1038/s41598-024-73134-z.
Single-vehicle crashes, particularly those caused by speeding, result in a disproportionately high number of fatalities and serious injuries compared to other types of crashes involving passenger vehicles. This study aims to identify factors that contribute to driver injury severity in single-vehicle crashes using machine learning models and advanced econometric models, namely mixed logit with heterogeneity in means and variances. National Crash data from the Crash Report Sampling System (CRSS) managed by the National Highway Traffic Safety Administration (NHTSA) between 2016 and 2018 were utilized for this study. XGBoost and Random Forest models were employed to identify the most influential variables using SHAP (Shapley Additive Explanations), while a mixed logit model was utilized to model driver injury severity accounting for unobserved heterogeneity in the data collection process. The results revealed a complex interplay of various factors that contribute to driver injury severity in single-vehicle crashes. These factors included driver characteristics such as demographics (male and female drivers, age below 26 years and between 35 and 45 years), driver actions (reckless driving, driving under the influence), restraint usage (lap-shoulder belt usage and unbelted), roadway and traffic characteristics (non-interstate highways, undivided and divided roadways with positive barriers, curved roadways), environmental conditions (clear and daylight conditions), vehicle characteristics (motorcycles, displacement volumes up to 2500 cc and 5,000-10,000 cc, newer vehicles, Chevy and Ford vehicles), crash characteristics (rollover, run-off-road incidents, collisions with trees), temporal characteristics (midnight to 6 AM, 10 AM to 4 PM, 4th quarter of the analysis period: October to December, and the analysis year of 2017). The findings emphasize the significance of driving behavior and roadway design to speeding behavior. These aspects should be given high priority for driver training as well as the design and maintenance of roadways by relevant agencies.
单车事故,尤其是超速引起的单车事故,与涉及乘用车的其他类型事故相比,造成的死亡和重伤人数比例过高。本研究旨在使用机器学习模型和先进的计量经济学模型(即具有均值和方差异质性的混合对数模型)来确定导致单车事故中驾驶员受伤严重程度的因素。本研究使用了国家公路交通安全管理局(NHTSA)管理的 Crash Report Sampling System(CRSS)在 2016 年至 2018 年期间的国家 Crash 数据。使用 SHAP(Shapley Additive Explanations)来确定 XGBoost 和随机森林模型中最具影响力的变量,而混合对数模型则用于在数据收集过程中存在未观测到的异质性的情况下,对驾驶员受伤严重程度进行建模。研究结果揭示了导致单车事故中驾驶员受伤严重程度的各种因素之间的复杂相互作用。这些因素包括驾驶员特征(男性和女性驾驶员、年龄在 26 岁以下和 35 岁至 45 岁之间)、驾驶员行为(鲁莽驾驶、酒后驾驶)、约束使用(安全带使用和未系安全带)、道路和交通特征(非州际公路、无分隔和有分隔的道路,带正隔离栏、弯道)、环境条件(晴朗和白天条件)、车辆特征(摩托车、排量在 2500cc 及以下和 5000-10000cc、较新车辆、雪佛兰和福特车辆)、碰撞特征(翻车、冲出道路事故、与树木碰撞)、时间特征(午夜至 6 点、上午 10 点至下午 4 点、分析期的第四季度:10 月至 12 月以及分析年 2017 年)。研究结果强调了驾驶行为和道路设计对超速行为的重要性。这些方面应作为驾驶员培训以及相关机构进行道路设计和维护的重点。