• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于对具有重尾的过度分散计数数据进行建模的半参数负二项广义线性模型:特征及在碰撞数据中的应用

A semiparametric negative binomial generalized linear model for modeling over-dispersed count data with a heavy tail: Characteristics and applications to crash data.

作者信息

Shirazi Mohammadali, Lord Dominique, Dhavala Soma Sekhar, Geedipally Srinivas Reddy

机构信息

Zachry Department of Civil Engineering, Texas A&M University, College Station, TX 77843, United States.

Perceptron Learning Solutions Pvt Ltd, Bengaluru, India.

出版信息

Accid Anal Prev. 2016 Jun;91:10-8. doi: 10.1016/j.aap.2016.02.020. Epub 2016 Mar 3.

DOI:10.1016/j.aap.2016.02.020
PMID:26945472
Abstract

Crash data can often be characterized by over-dispersion, heavy (long) tail and many observations with the value zero. Over the last few years, a small number of researchers have started developing and applying novel and innovative multi-parameter models to analyze such data. These multi-parameter models have been proposed for overcoming the limitations of the traditional negative binomial (NB) model, which cannot handle this kind of data efficiently. The research documented in this paper continues the work related to multi-parameter models. The objective of this paper is to document the development and application of a flexible NB generalized linear model with randomly distributed mixed effects characterized by the Dirichlet process (NB-DP) to model crash data. The objective of the study was accomplished using two datasets. The new model was compared to the NB and the recently introduced model based on the mixture of the NB and Lindley (NB-L) distributions. Overall, the research study shows that the NB-DP model offers a better performance than the NB model once data are over-dispersed and have a heavy tail. The NB-DP performed better than the NB-L when the dataset has a heavy tail, but a smaller percentage of zeros. However, both models performed similarly when the dataset contained a large amount of zeros. In addition to a greater flexibility, the NB-DP provides a clustering by-product that allows the safety analyst to better understand the characteristics of the data, such as the identification of outliers and sources of dispersion.

摘要

碰撞数据通常具有过度离散、重(长)尾以及许多值为零的观测值等特征。在过去几年中,少数研究人员已开始开发和应用新颖创新的多参数模型来分析此类数据。提出这些多参数模型是为了克服传统负二项式(NB)模型的局限性,传统负二项式模型无法有效处理这类数据。本文记录的研究延续了与多参数模型相关的工作。本文的目的是记录一种灵活的具有狄利克雷过程(NB-DP)特征的随机分布混合效应的NB广义线性模型的开发和应用,以对碰撞数据进行建模。该研究目标通过使用两个数据集得以实现。将新模型与NB模型以及最近基于NB和林德利(NB-L)分布混合引入的模型进行了比较。总体而言,研究表明,一旦数据过度离散且具有重尾,NB-DP模型比NB模型具有更好的性能。当数据集具有重尾但零值百分比较小时,NB-DP的表现优于NB-L。然而,当数据集中包含大量零值时,两种模型的表现相似。除了具有更大的灵活性外,NB-DP还提供了一个聚类副产品,使安全分析师能够更好地理解数据的特征,例如异常值的识别和离散源。

相似文献

1
A semiparametric negative binomial generalized linear model for modeling over-dispersed count data with a heavy tail: Characteristics and applications to crash data.一种用于对具有重尾的过度分散计数数据进行建模的半参数负二项广义线性模型:特征及在碰撞数据中的应用
Accid Anal Prev. 2016 Jun;91:10-8. doi: 10.1016/j.aap.2016.02.020. Epub 2016 Mar 3.
2
The negative binomial-Lindley generalized linear model: characteristics and application using crash data.负二项-林德利广义线性模型:基于事故数据的特点与应用
Accid Anal Prev. 2012 Mar;45:258-65. doi: 10.1016/j.aap.2011.07.012. Epub 2011 Aug 6.
3
The negative binomial-Lindley distribution as a tool for analyzing crash data characterized by a large amount of zeros.负二项式-林德利分布作为一种分析具有大量零值的碰撞数据的工具。
Accid Anal Prev. 2011 Sep;43(5):1738-42. doi: 10.1016/j.aap.2011.04.004. Epub 2011 Apr 29.
4
Finite mixture Negative Binomial-Lindley for modeling heterogeneous crash data with many zero observations.有限混合负二项式-林德利模型在具有大量零观测值的异质碰撞数据建模中的应用。
Accid Anal Prev. 2022 Sep;175:106765. doi: 10.1016/j.aap.2022.106765. Epub 2022 Aug 7.
5
Derivation of the Empirical Bayesian method for the Negative Binomial-Lindley generalized linear model with application in traffic safety.经验贝叶斯方法在负二项式-林德利广义线性模型中的推导及其在交通安全中的应用。
Accid Anal Prev. 2022 Jun;170:106638. doi: 10.1016/j.aap.2022.106638. Epub 2022 Mar 24.
6
Assessing the Negative Binomial-Lindley model for crash hotspot identification: Insights from Monte Carlo simulation analysis.评估用于事故热点识别的负二项式-林德利模型:蒙特卡罗模拟分析的见解。
Accid Anal Prev. 2024 May;199:107478. doi: 10.1016/j.aap.2024.107478. Epub 2024 Mar 7.
7
The negative Binomial-Lindley model with Time-Dependent Parameters: Accounting for temporal variations and excess zero observations in crash data.带时变参数的负二项式-Lindley 模型:在碰撞数据中考虑时间变化和过零观测。
Accid Anal Prev. 2024 Nov;207:107711. doi: 10.1016/j.aap.2024.107711. Epub 2024 Jul 30.
8
Application of the Conway-Maxwell-Poisson generalized linear model for analyzing motor vehicle crashes.康威-麦克斯韦-泊松广义线性模型在分析机动车碰撞事故中的应用。
Accid Anal Prev. 2008 May;40(3):1123-34. doi: 10.1016/j.aap.2007.12.003. Epub 2008 Jan 4.
9
Crash data modeling with a generalized estimator.广义估计器的碰撞数据建模。
Accid Anal Prev. 2018 Aug;117:340-345. doi: 10.1016/j.aap.2018.04.026. Epub 2018 May 12.
10
Functional forms of the negative binomial models in safety performance functions for rural two-lane intersections.安全性能函数中农村双车道交叉口负二项模型的功能形式。
Accid Anal Prev. 2019 Mar;124:193-201. doi: 10.1016/j.aap.2019.01.015. Epub 2019 Jan 18.

引用本文的文献

1
Effects of a Habitat Integrity Gradient on the Diversity of Odonates in the Legal Amazonia Zone of the Brazilian State of Maranhão.栖息地完整性梯度对巴西马拉尼昂州法定亚马逊地区蜻蜓目多样性的影响。
Neotrop Entomol. 2025 Jan 17;54(1):24. doi: 10.1007/s13744-024-01240-8.
2
Identification of Two Molecular Subtypes of Hepatocellular Carcinoma Based on Dysregulated Immune LncRNAs.基于失调的免疫长链非编码RNA鉴定肝细胞癌的两种分子亚型
Front Mol Biosci. 2021 Nov 23;8:625858. doi: 10.3389/fmolb.2021.625858. eCollection 2021.
3
Mapping wind erosion hazard with regression-based machine learning algorithms.
基于回归的机器学习算法进行风蚀危害制图。
Sci Rep. 2020 Nov 24;10(1):20494. doi: 10.1038/s41598-020-77567-0.
4
The Application of Non-Parametric Count Models for the Modeling of Female's Accident Rates in Hamadan Province from 2009 to 2016.非参数计数模型在2009年至2016年哈马丹省女性事故率建模中的应用
Iran J Public Health. 2020 Apr;49(4):763-772.
5
Subject level clustering using a negative binomial model for small transcriptomic studies.使用负二项模型进行小转录组研究的主题水平聚类。
BMC Bioinformatics. 2018 Dec 12;19(1):474. doi: 10.1186/s12859-018-2556-9.