Xie Yuanchang, Lord Dominique, Zhang Yunlong
Zachry Department of Civil Engineering, Texas A&M University, 3136 TAMU, College Station, TX 77843-3136, United States.
Accid Anal Prev. 2007 Sep;39(5):922-33. doi: 10.1016/j.aap.2006.12.014. Epub 2007 Feb 16.
Statistical models have frequently been used in highway safety studies. They can be utilized for various purposes, including establishing relationships between variables, screening covariates and predicting values. Generalized linear models (GLM) and hierarchical Bayes models (HBM) have been the most common types of model favored by transportation safety analysts. Over the last few years, researchers have proposed the back-propagation neural network (BPNN) model for modeling the phenomenon under study. Compared to GLMs and HBMs, BPNNs have received much less attention in highway safety modeling. The reasons are attributed to the complexity for estimating this kind of model as well as the problem related to "over-fitting" the data. To circumvent the latter problem, some statisticians have proposed the use of Bayesian neural network (BNN) models. These models have been shown to perform better than BPNN models while at the same time reducing the difficulty associated with over-fitting the data. The objective of this study is to evaluate the application of BNN models for predicting motor vehicle crashes. To accomplish this objective, a series of models was estimated using data collected on rural frontage roads in Texas. Three types of models were compared: BPNN, BNN and the negative binomial (NB) regression models. The results of this study show that in general both types of neural network models perform better than the NB regression model in terms of data prediction. Although the BPNN model can occasionally provide better or approximately equivalent prediction performance compared to the BNN model, in most cases its prediction performance is worse than the BNN model. In addition, the data fitting performance of the BPNN model is consistently worse than the BNN model, which suggests that the BNN model has better generalization abilities than the BPNN model and can effectively alleviate the over-fitting problem without significantly compromising the nonlinear approximation ability. The results also show that BNNs could be used for other useful analyses in highway safety, including the development of accident modification factors and for improving the prediction capabilities for evaluating different highway design alternatives.
统计模型在公路安全研究中经常被使用。它们可用于各种目的,包括建立变量之间的关系、筛选协变量和预测值。广义线性模型(GLM)和分层贝叶斯模型(HBM)一直是交通安全分析师最喜欢的最常见模型类型。在过去几年中,研究人员提出了反向传播神经网络(BPNN)模型来对所研究的现象进行建模。与GLM和HBM相比,BPNN在公路安全建模中受到的关注要少得多。原因在于估计这类模型的复杂性以及与数据“过度拟合”相关的问题。为了规避后一个问题,一些统计学家提出使用贝叶斯神经网络(BNN)模型。这些模型已被证明比BPNN模型表现更好,同时降低了与数据过度拟合相关的难度。本研究的目的是评估BNN模型在预测机动车碰撞方面的应用。为实现这一目标,使用在德克萨斯州农村临街道路上收集的数据估计了一系列模型。比较了三种类型的模型:BPNN、BNN和负二项式(NB)回归模型。本研究结果表明,总体而言,两种类型的神经网络模型在数据预测方面都比NB回归模型表现更好。虽然BPNN模型偶尔可以提供比BNN模型更好或近似等效的预测性能,但在大多数情况下,其预测性能比BNN模型差。此外,BPNN模型的数据拟合性能始终比BNN模型差,这表明BNN模型比BPNN模型具有更好的泛化能力,并且可以有效缓解过度拟合问题,而不会显著损害非线性逼近能力。结果还表明,BNN可用于公路安全中的其他有用分析,包括事故修正因子的开发以及提高评估不同公路设计方案的预测能力。