Department of Civil and Environmental Engineering, Carnegie Mellon University, Pittsburgh, PA 15213, United States.
Department of Civil and Environmental Engineering, Carnegie Mellon University, Pittsburgh, PA 15213, United States; Heinz College, Carnegie Mellon University, Pittsburgh, PA 15213, United States.
Accid Anal Prev. 2022 Nov;177:106811. doi: 10.1016/j.aap.2022.106811. Epub 2022 Sep 10.
The increasing number of work zone crashes has been a significant concern for road users, transportation agencies, and researchers. Crashes can be caused by work zones, and this effect changes across different work zone configurations, traffic volumes, roadway functional classifications, and weather conditions. This is typically represented by Crash Modification Functions (CMFunctions). However, current methods for developing work zone CMFunctions have two major limitations: (1) They focus on analyzing statistical associations and fail to mitigate the confounding bias due to possible unobservable roadway characteristics; and (2) They cannot address CMFunctions of multiple variables simultaneously, such as weather and traffic conditions, since they are represented using mixed data types (continuous and categorical) that could potentially affect the causal effect of work zones on crashes. In this study, we develop a method that utilizes causal forest with fixed-effect modeling to mitigate the confounding bias while identifying CMFunctions conditioning on various environmental characteristics, including work zone configurations, traffic volume, roadway functional classification, and weather conditions. The developed method was applied to 3378 work zones that occurred in Pennsylvania between 2015 and 2017. The results were validated via a series of robustness tests. The validations demonstrate that this method can mitigate the confounding bias and identify CMFunctions of multiple variables. The results also show that the causal effect of a work zone on crash occurrence is significantly positive (p<0.05) on roadways with high traffic volumes (e.g., > 20,000 vehicles per day) and on medium length (e.g., 2000 to 5000 m) work zones. It appears that having medium-long (e.g., between 6000 and 8000 m) work zones or long duration (e.g., longer than 4 h) work zones do not necessarily lead to extra crashes.
工作区事故数量的增加一直是道路使用者、交通管理机构和研究人员的重大关注点。事故可能是由工作区引起的,这种影响因不同的工作区配置、交通量、道路功能分类和天气条件而变化。这通常由碰撞修正函数(CMFunctions)表示。然而,目前开发工作区 CMFunctions 的方法有两个主要局限性:(1)它们侧重于分析统计关联,而未能减轻由于可能不可观测的道路特征而导致的混淆偏差;(2)它们无法同时解决多个变量的 CMFunctions,例如天气和交通条件,因为它们使用混合数据类型(连续和分类)表示,这可能会影响工作区对事故的因果效应。在这项研究中,我们开发了一种利用因果森林和固定效应建模的方法,以减轻混淆偏差,同时确定各种环境特征(包括工作区配置、交通量、道路功能分类和天气条件)条件下的 CMFunctions。所开发的方法应用于 2015 年至 2017 年期间在宾夕法尼亚州发生的 3378 个工作区。通过一系列稳健性测试验证了结果。验证表明,该方法可以减轻混淆偏差并识别多个变量的 CMFunctions。结果还表明,工作区对碰撞发生的因果效应在交通量大(例如,每天超过 20,000 辆)和中等长度(例如,2000 至 5000 米)的道路上显著为正(p<0.05)。似乎拥有中等长度(例如,6000 至 8000 米之间)或长持续时间(例如,长于 4 小时)的工作区并不一定会导致额外的碰撞。