Faculty of Architecture, The University of Hong Kong, Hong Kong, China.
Department of Urban Planning and Design, The University of Hong Kong, Hong Kong, China.
Accid Anal Prev. 2021 Dec;163:106431. doi: 10.1016/j.aap.2021.106431. Epub 2021 Nov 7.
With the fast development of economics, road safety is becoming a serious problem. Exploring macro factors is effective to improve road safety. However, the existing studies have some limitations: (1) The existing studies only considered one aspect of macro factors and constructed models based on a few data samples. (2) The methods commonly used cannot address the non-linear relationship or calculate the feature importance. The findings obtained from such models may be limited and biased. To address the limitations, this study proposes a BO-CV-XGBoost framework to explore the macro factors related to traffic fatality rate classes based on a high-dimensional dataset that fully considers the impact of multi-factor interaction with adequate data samples. The proposed framework is applied to a dataset in the US. 453 county-level macro factors are collected from various data sources, covering ten macro aspects, including topography, transportation, etc. The optimized BO-CV-XGBoost model obtains the best classification performance with an AUC of 0.8977 and an accuracy of 85.02%. Compared with other methods, the proposed model has superiority on fatality rate classification. Ten macro factors are identified, including 'Current-dollar GDP', 'highway miles per person', etc. The ten factors contain four aspects of information, including economics, transportation, education, and medical condition. Geographic information system (GIS) techniques are further used for spatial analysis of the identified macro factors. Therefore, targeted and effective measures are accordingly proposed to prevent traffic fatalities and improve road safety.
随着经济的快速发展,道路安全正成为一个严重的问题。探索宏观因素对于提高道路安全水平是有效的。然而,现有研究存在一些局限性:(1)现有研究仅考虑了宏观因素的一个方面,并基于少数数据样本构建模型。(2)常用的方法无法解决非线性关系或计算特征重要性。从这些模型中获得的发现可能是有限的和有偏差的。为了解决这些局限性,本研究提出了一个基于高维数据集的 BO-CV-XGBoost 框架,以探索与交通事故死亡率等级相关的宏观因素,该框架充分考虑了多因素相互作用的影响,并使用了充足的数据样本。该框架应用于美国的一个数据集。从各种数据源中收集了 453 个县级宏观因素,涵盖了十个宏观方面,包括地形、交通等。优化后的 BO-CV-XGBoost 模型获得了最佳的分类性能,AUC 为 0.8977,准确率为 85.02%。与其他方法相比,所提出的模型在死亡率分类方面具有优势。确定了十个宏观因素,包括“当前美元 GDP”、“人均高速公路里程”等。这十个因素包含了经济、交通、教育和医疗状况四个方面的信息。地理信息系统(GIS)技术进一步用于对确定的宏观因素进行空间分析。因此,提出了有针对性和有效的措施来预防交通事故和提高道路安全水平。