Miaou S P, Lum H
Center for Transportation Analysis, Oak Ridge National Laboratory, TN 37831.
Accid Anal Prev. 1993 Dec;25(6):689-709. doi: 10.1016/0001-4575(93)90034-t.
The statistical properties of four regression models--two conventional linear regression models and two Poisson regression models--are investigated in terms of their ability to model vehicle accidents and highway geometric design relationships. Potential limitations of these models pertaining to their underlying distributional assumptions, estimation procedures, functional form of accident rate, and sensitivity to short road sections, are identified. Important issues, such as the treatment of vehicle exposure and traffic conditions, and data uncertainties due to sampling and nonsampling errors, are also discussed. Roadway and truck accident data from the Highway Safety Information System (HSIS), a highway safety data base administered by the Federal Highway Administration (FHWA), have been employed to illustrate the use and the limitations of these models. It is demonstrated that the conventional linear regression models lack the distributional property to describe adequately random, discrete, nonnegative, and typically sporadic vehicle accident events on the road. As a result, these models are not appropriate to make probabilistic statements about vehicle accidents, and the test statistics derived from these models are questionable. The Poisson regression models, on the other hand, possess most of the desirable statistical properties in developing the relationships. However, if the vehicle accident data are found to be significantly overdispersed relative to its mean, then using the Poisson regression models may overstate or understate the likelihood of vehicle accidents on the road. More general probability distributions may have to be considered.
研究了四种回归模型——两种传统线性回归模型和两种泊松回归模型——在模拟车辆事故与公路几何设计关系方面的统计特性。确定了这些模型在其潜在分布假设、估计程序、事故率函数形式以及对短路段敏感性方面的潜在局限性。还讨论了一些重要问题,如车辆暴露和交通条件的处理,以及由于抽样和非抽样误差导致的数据不确定性。已采用由联邦公路管理局(FHWA)管理的公路安全数据库——公路安全信息系统(HSIS)中的道路和卡车事故数据,来说明这些模型的使用和局限性。结果表明,传统线性回归模型缺乏充分描述道路上随机、离散、非负且通常为偶发性车辆事故事件的分布特性。因此,这些模型不适用于对车辆事故进行概率陈述,并且从这些模型得出的检验统计量也值得怀疑。另一方面,泊松回归模型在建立关系时具有大多数理想的统计特性。然而,如果发现车辆事故数据相对于其均值存在显著的过度离散,那么使用泊松回归模型可能会高估或低估道路上车辆事故的可能性。可能不得不考虑更一般的概率分布。