关于使用零膨胀模型和障碍模型对疫苗不良事件计数数据进行建模

On the use of zero-inflated and hurdle models for modeling vaccine adverse event count data.

作者信息

Rose C E, Martin S W, Wannemuehler K A, Plikaytis B D

机构信息

Bacterial Vaccine-Preventable Diseases Branch, Division of Epidemiology and Surveillance, CDC, Atlanta, Georgia 30333, USA.

出版信息

J Biopharm Stat. 2006;16(4):463-81. doi: 10.1080/10543400600719384.

DOI:10.1080/10543400600719384

PMID:16892908

Abstract

We compared several modeling strategies for vaccine adverse event count data in which the data are characterized by excess zeroes and heteroskedasticity. Count data are routinely modeled using Poisson and Negative Binomial (NB) regression but zero-inflated and hurdle models may be advantageous in this setting. Here we compared the fit of the Poisson, Negative Binomial (NB), zero-inflated Poisson (ZIP), zero-inflated Negative Binomial (ZINB), Poisson Hurdle (PH), and Negative Binomial Hurdle (NBH) models. In general, for public health studies, we may conceptualize zero-inflated models as allowing zeroes to arise from at-risk and not-at-risk populations. In contrast, hurdle models may be conceptualized as having zeroes only from an at-risk population. Our results illustrate, for our data, that the ZINB and NBH models are preferred but these models are indistinguishable with respect to fit. Choosing between the zero-inflated and hurdle modeling framework, assuming Poisson and NB models are inadequate because of excess zeroes, should generally be based on the study design and purpose. If the study's purpose is inference then modeling framework should be considered. For example, if the study design leads to count endpoints with both structural and sample zeroes then generally the zero-inflated modeling framework is more appropriate, while in contrast, if the endpoint of interest, by design, only exhibits sample zeroes (e.g., at-risk participants) then the hurdle model framework is generally preferred. Conversely, if the study's primary purpose it is to develop a prediction model then both the zero-inflated and hurdle modeling frameworks should be adequate.

摘要

我们比较了几种针对疫苗不良事件计数数据的建模策略，这类数据的特点是存在过多零值和异方差性。计数数据通常使用泊松回归和负二项式（NB）回归进行建模，但在这种情况下，零膨胀模型和门槛模型可能更具优势。在此，我们比较了泊松模型、负二项式（NB）模型、零膨胀泊松（ZIP）模型、零膨胀负二项式（ZINB）模型、泊松门槛（PH）模型和负二项式门槛（NBH）模型的拟合情况。一般来说，对于公共卫生研究，我们可以将零膨胀模型理解为允许零值来自有风险和无风险人群。相比之下，门槛模型可以理解为零值仅来自有风险人群。我们的结果表明，对于我们的数据，ZINB模型和NBH模型更受青睐，但就拟合度而言，这些模型难以区分。在零膨胀模型和门槛模型框架之间进行选择时，假设由于过多零值而使泊松模型和NB模型不适用，通常应基于研究设计和目的。如果研究目的是进行推断，那么应考虑建模框架。例如，如果研究设计导致计数终点既有结构零值又有样本零值，那么一般零膨胀建模框架更合适，相反，如果感兴趣的终点按设计仅呈现样本零值（如有风险参与者），那么通常更倾向于门槛模型框架。相反，如果研究的主要目的是开发预测模型，那么零膨胀模型和门槛模型框架都应该适用。

相似文献

On the use of zero-inflated and hurdle models for modeling vaccine adverse event count data.

J Biopharm Stat. 2006;16(4):463-81. doi: 10.1080/10543400600719384.

Statistical modelling of falls count data with excess zeros.

Inj Prev. 2011 Aug;17(4):266-70. doi: 10.1136/ip.2011.031740. Epub 2011 Jun 8.

Statistical modelling for falls count data.

Accid Anal Prev. 2010 Mar;42(2):384-92. doi: 10.1016/j.aap.2009.08.018. Epub 2009 Oct 1.

The utility of the zero-inflated Poisson and zero-inflated negative binomial models: a case study of cross-sectional and longitudinal DMF data examining the effect of socio-economic status.

Community Dent Oral Epidemiol. 2004 Jun;32(3):183-9. doi: 10.1111/j.1600-0528.2004.00155.x.

Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory.

Accid Anal Prev. 2005 Jan;37(1):35-46. doi: 10.1016/j.aap.2004.02.004.

Count data distributions and their zero-modified equivalents as a framework for modelling microbial data with a relatively high occurrence of zero counts.

Int J Food Microbiol. 2010 Jan 1;136(3):268-77. doi: 10.1016/j.ijfoodmicro.2009.10.016. Epub 2009 Oct 28.

What statistical method should be used to evaluate risk factors associated with dmfs index? Evidence from the National Pathfinder Survey of 4-year-old Italian children.

Community Dent Oral Epidemiol. 2009 Dec;37(6):539-46. doi: 10.1111/j.1600-0528.2009.00500.x. Epub 2009 Oct 21.

Zero inflated statistical count models for analysing the costs imposed by GERD and dyspepsia.

Arab J Gastroenterol. 2013 Dec;14(4):165-8. doi: 10.1016/j.ajg.2013.09.004. Epub 2013 Nov 28.

On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.

Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.

[Selection of advantage prediction model for forest fire occurrence in Tahe, Daxing'an Mountain].

Ying Yong Sheng Tai Xue Bao. 2014 Mar;25(3):731-7.

引用本文的文献

Modeling County-Level Rare Disease Prevalence Using Bayesian Hierarchical Sampling Weighted Zero-Inflated Regression.

J Data Sci. 2023 Jan;21(1):145-157. doi: 10.6339/22-JDS1049.

Multilevel modeling in single-case studies with zero-inflated and overdispersed count data.

Behav Res Methods. 2024 Apr;56(4):2765-2781. doi: 10.3758/s13428-024-02359-7. Epub 2024 Feb 21.

Spatio-temporal modeling of traffic accidents incidence on urban road networks based on an explicit network triangulation.

J Appl Stat. 2022 Jul 29;50(16):3229-3250. doi: 10.1080/02664763.2022.2104822. eCollection 2023.

Discounting of Hyper-Palatable Food and Money: Associations with Food Addiction Symptoms.

Nutrients. 2023 Sep 16;15(18):4008. doi: 10.3390/nu15184008.

Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data.

R Soc Open Sci. 2023 Aug 23;10(8):221226. doi: 10.1098/rsos.221226. eCollection 2023 Aug.

Environmental Complexity and Reduced Stocking Density Promote Positive Behavioral Outcomes in Broiler Chickens.

Animals (Basel). 2023 Jun 23;13(13):2074. doi: 10.3390/ani13132074.

Impact of MidMed, a general practitioner-led modified comprehensive geriatric assessment for patients with frailty.

Age Ageing. 2023 Mar 1;52(3). doi: 10.1093/ageing/afad006.

Models for Zero-Inflated and Overdispersed Correlated Count Data: An Application to Cigarette Use.

Nicotine Tob Res. 2023 Apr 6;25(5):996-1003. doi: 10.1093/ntr/ntac253.

Enriching captivity conditions with natural elements does not prevent the loss of wild-like gut microbiota but shapes its compositional variation in two small mammals.

Microbiologyopen. 2022 Oct;11(5):e1318. doi: 10.1002/mbo3.1318.

Symptom Presence and Symptom Severity as Unique Indicators of Psychopathology: An Application of Multidimensional Zero-Inflated and Hurdle Graded Response Models.

Educ Psychol Meas. 2022 Oct;82(5):938-966. doi: 10.1177/00131644211061820. Epub 2021 Dec 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

关于使用零膨胀模型和障碍模型对疫苗不良事件计数数据进行建模

On the use of zero-inflated and hurdle models for modeling vaccine adverse event count data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献