Suppr超能文献

交通事故数据中的漏报、参数偏差以及损伤严重程度模型的结构

Underreporting in traffic accident data, bias in parameters and the structure of injury severity models.

作者信息

Yamamoto Toshiyuki, Hashiji Junpei, Shankar Venkataraman N

机构信息

Department of Civil Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Japan.

出版信息

Accid Anal Prev. 2008 Jul;40(4):1320-9. doi: 10.1016/j.aap.2007.10.016. Epub 2008 Mar 5.

Abstract

Injury severities in traffic accidents are usually recorded on ordinal scales, and statistical models have been applied to investigate the effects of driver factors, vehicle characteristics, road geometrics and environmental conditions on injury severity. The unknown parameters in the models are in general estimated assuming random sampling from the population. Traffic accident data however suffer from underreporting effects, especially for lower injury severities. As a result, traffic accident data can be regarded as outcome-based samples with unknown population shares of the injury severities. An outcome-based sample is overrepresented by accidents of higher severities. As a result, outcome-based samples result in biased parameters which skew our inferences on the effect of key safety variables such as safety belt usage. The pseudo-likelihood function for the case with unknown population shares, which is the same as the conditional maximum likelihood for the case with known population shares, is applied in this study to examine the effects of severity underreporting on the parameter estimates. Sequential binary probit models and ordered-response probit models of injury severity are developed and compared in this study. Sequential binary probit models assume that the factors determining the severity change according to the level of the severity itself, while ordered-response probit models assume that the same factors correlate across all levels of severity. Estimation results suggest that the sequential binary probit models outperform the ordered-response probit models, and that the coefficient estimates for lap and shoulder belt use are biased if underreporting is not considered. Mean parameter bias due to underreporting can be significant. The findings show that underreporting on the outcome dimension may induce bias in inferences on a variety of factors. In particular, if underreporting is not accounted for, the marginal impacts of a variety of factors appear to be overestimated. Fixed objects and environmental conditions are overestimated in their impact on injury severity, as is the effect of separate lap and shoulder belt use. Combined lap and shoulder belt usage appears to be unaffected. The parameter bias is most pronounced when underreporting of possible injury accidents in addition to property damage only accidents is taken into account.

摘要

交通事故中的伤害严重程度通常按有序尺度记录,并且已经应用统计模型来研究驾驶员因素、车辆特征、道路几何形状和环境条件对伤害严重程度的影响。模型中的未知参数通常在假设从总体中随机抽样的情况下进行估计。然而,交通事故数据存在漏报效应,尤其是对于较低的伤害严重程度。因此,交通事故数据可被视为基于结果的样本,其中伤害严重程度的总体比例未知。基于结果的样本中,较高严重程度的事故占比过高。结果,基于结果的样本会导致参数偏差,从而扭曲我们对诸如安全带使用等关键安全变量影响的推断。本研究应用了总体比例未知情况下的伪似然函数,它与总体比例已知情况下的条件最大似然函数相同,以检验严重程度漏报对参数估计的影响。本研究开发并比较了伤害严重程度的顺序二元概率模型和有序响应概率模型。顺序二元概率模型假设决定严重程度的因素会根据严重程度本身的水平而变化,而有序响应概率模型假设相同的因素在所有严重程度水平上都具有相关性。估计结果表明,顺序二元概率模型优于有序响应概率模型,并且如果不考虑漏报情况,腰部安全带和肩部安全带使用的系数估计会产生偏差。由于漏报导致的平均参数偏差可能很大。研究结果表明,结果维度上的漏报可能会在对各种因素的推断中引发偏差。特别是,如果不考虑漏报情况,各种因素的边际影响似乎会被高估。固定物体和环境条件对伤害严重程度的影响被高估,单独使用腰部安全带和肩部安全带的效果也是如此。腰部安全带和肩部安全带联合使用的效果似乎不受影响。当除了仅造成财产损失的事故外,还考虑可能的伤害事故的漏报情况时,参数偏差最为明显。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验