• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于零膨胀和过度分散计数响应的参数模型和非参数模型的性能。

On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.

作者信息

Tang Wan, Lu Naiji, Chen Tian, Wang Wenjuan, Gunzler Douglas David, Han Yu, Tu Xin M

机构信息

Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, U.S.A.

Department of Management, Harbin Institute of Technology, Harbin, China.

出版信息

Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.

DOI:10.1002/sim.6560
PMID:26078035
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4592387/
Abstract

Zero-inflated Poisson (ZIP) and negative binomial (ZINB) models are widely used to model zero-inflated count responses. These models extend the Poisson and negative binomial (NB) to address excessive zeros in the count response. By adding a degenerate distribution centered at 0 and interpreting it as describing a non-risk group in the population, the ZIP (ZINB) models a two-component population mixture. As in applications of Poisson and NB, the key difference between ZIP and ZINB is the allowance for overdispersion by the ZINB in its NB component in modeling the count response for the at-risk group. Overdispersion arising in practice too often does not follow the NB, and applications of ZINB to such data yield invalid inference. If sources of overdispersion are known, other parametric models may be used to directly model the overdispersion. Such models too are subject to assumed distributions. Further, this approach may not be applicable if information about the sources of overdispersion is unavailable. In this paper, we propose a distribution-free alternative and compare its performance with these popular parametric models as well as a moment-based approach proposed by Yu et al. [Statistics in Medicine 2013; 32: 2390-2405]. Like the generalized estimating equations, the proposed approach requires no elaborate distribution assumptions. Compared with the approach of Yu et al., it is more robust to overdispersed zero-inflated responses. We illustrate our approach with both simulated and real study data.

摘要

零膨胀泊松(ZIP)模型和零膨胀负二项式(ZINB)模型被广泛用于对零膨胀计数响应进行建模。这些模型扩展了泊松模型和负二项式(NB)模型,以解决计数响应中过多的零值问题。通过添加一个以0为中心的退化分布,并将其解释为描述总体中的一个非风险组,ZIP(ZINB)模型构建了一个双组分总体混合模型。与泊松模型和NB模型的应用一样,ZIP和ZINB之间的关键区别在于,在对风险组的计数响应进行建模时,ZINB在其NB分量中允许存在过度离散。实际中出现的过度离散往往不遵循NB分布,将ZINB应用于此类数据会产生无效推断。如果已知过度离散的来源,可以使用其他参数模型直接对过度离散进行建模。此类模型也受假设分布的影响。此外,如果无法获得有关过度离散来源的信息,这种方法可能不适用。在本文中,我们提出了一种无分布替代方法,并将其性能与这些流行的参数模型以及Yu等人提出的基于矩的方法进行比较[《医学统计学》2013年;32:2390 - 2405]。与广义估计方程一样,所提出的方法不需要复杂的分布假设。与Yu等人的方法相比,它对过度离散的零膨胀响应更具稳健性。我们用模拟数据和实际研究数据说明了我们的方法。

相似文献

1
On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.关于零膨胀和过度分散计数响应的参数模型和非参数模型的性能。
Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.
2
A comparison of statistical methods for modeling count data with an application to hospital length of stay.一种用于对计数数据建模的统计方法比较及其在住院时间中的应用。
BMC Med Res Methodol. 2022 Aug 4;22(1):211. doi: 10.1186/s12874-022-01685-8.
3
Multilevel modeling in single-case studies with zero-inflated and overdispersed count data.零膨胀和过离散计数数据的单病例研究中的多层次建模。
Behav Res Methods. 2024 Apr;56(4):2765-2781. doi: 10.3758/s13428-024-02359-7. Epub 2024 Feb 21.
4
Distribution-free Inference of Zero-inated Binomial Data for Longitudinal Studies.纵向研究中零膨胀二项式数据的无分布推断
J Appl Stat. 2015 Oct 1;42(10):2203-2219. doi: 10.1080/02664763.2015.1023270. Epub 2015 Mar 18.
5
On the use of zero-inflated and hurdle models for modeling vaccine adverse event count data.关于使用零膨胀模型和障碍模型对疫苗不良事件计数数据进行建模
J Biopharm Stat. 2006;16(4):463-81. doi: 10.1080/10543400600719384.
6
Statistical modelling of falls count data with excess zeros.基于过零数据的跌倒计数资料的统计建模。
Inj Prev. 2011 Aug;17(4):266-70. doi: 10.1136/ip.2011.031740. Epub 2011 Jun 8.
7
Marginalized zero-inflated negative binomial regression with application to dental caries.边缘化零膨胀负二项回归及其在龋齿研究中的应用
Stat Med. 2016 May 10;35(10):1722-35. doi: 10.1002/sim.6804. Epub 2015 Nov 15.
8
A semiparametric marginalized zero-inflated model for analyzing healthcare utilization panel data with missingness.一种用于分析存在缺失值的医疗保健利用面板数据的半参数边际零膨胀模型。
J Appl Stat. 2019;46(16):2862-2883. doi: 10.1080/02664763.2019.1620705. Epub 2019 May 22.
9
Variable selection for distribution-free models for longitudinal zero-inflated count responses.纵向零膨胀计数响应的无分布模型的变量选择
Stat Med. 2016 Jul 20;35(16):2770-85. doi: 10.1002/sim.6892. Epub 2016 Feb 4.
10
Analyzing hospitalization data: potential limitations of Poisson regression.分析住院数据:泊松回归的潜在局限性
Nephrol Dial Transplant. 2015 Aug;30(8):1244-9. doi: 10.1093/ndt/gfv071. Epub 2015 Mar 25.

引用本文的文献

1
Modeling County-Level Rare Disease Prevalence Using Bayesian Hierarchical Sampling Weighted Zero-Inflated Regression.使用贝叶斯分层抽样加权零膨胀回归对县级罕见病患病率进行建模。
J Data Sci. 2023 Jan;21(1):145-157. doi: 10.6339/22-JDS1049.
2
A comparison of statistical methods for modeling count data with an application to hospital length of stay.一种用于对计数数据建模的统计方法比较及其在住院时间中的应用。
BMC Med Res Methodol. 2022 Aug 4;22(1):211. doi: 10.1186/s12874-022-01685-8.
3
A Distribution-Free Model for Longitudinal Metagenomic Count Data.无分布模型在纵向宏基因组计数数据中的应用。
Genes (Basel). 2022 Jul 1;13(7):1183. doi: 10.3390/genes13071183.
4
Relationship between political partisanship and COVID-19 deaths: future implications for public health.政治党派与 COVID-19 死亡之间的关系:对公共卫生的未来影响。
J Public Health (Oxf). 2022 Aug 25;44(3):716-723. doi: 10.1093/pubmed/fdab136.
5
A semiparametric marginalized zero-inflated model for analyzing healthcare utilization panel data with missingness.一种用于分析存在缺失值的医疗保健利用面板数据的半参数边际零膨胀模型。
J Appl Stat. 2019;46(16):2862-2883. doi: 10.1080/02664763.2019.1620705. Epub 2019 May 22.
6
A GEE-type approach to untangle structural and random zeros in predictors.一种基于广义估计方程(GEE)的方法,用于解决预测变量中的结构零和随机零问题。
Stat Methods Med Res. 2019 Dec;28(12):3683-3696. doi: 10.1177/0962280218812228. Epub 2018 Nov 26.
7
A test of inflated zeros for Poisson regression models.泊松回归模型中零膨胀的检验。
Stat Methods Med Res. 2019 Apr;28(4):1157-1169. doi: 10.1177/0962280217749991. Epub 2017 Dec 28.

本文引用的文献

1
Predictors and moderators of outcomes of HIV/STD sex risk reduction interventions in substance abuse treatment programs: a pooled analysis of two randomized controlled trials.在物质滥用治疗项目中,HIV/STD 性行为风险降低干预措施的结局预测因子和调节因素:两项随机对照试验的汇总分析。
Subst Abuse Treat Prev Policy. 2014 Jan 16;9:3. doi: 10.1186/1747-597X-9-3.
2
A class of distribution-free models for longitudinal mediation analysis.一类用于纵向中介分析的无分布模型。
Psychometrika. 2014 Oct;79(4):543-68. doi: 10.1007/s11336-013-9355-z. Epub 2013 Nov 22.
3
Causal inference for Mann-Whitney-Wilcoxon rank sum and other nonparametric statistics.Mann-Whitney-Wilcoxon 秩和检验和其他非参数统计的因果推断。
Stat Med. 2014 Apr 15;33(8):1261-71. doi: 10.1002/sim.6026. Epub 2013 Oct 16.
4
Distribution-free models for longitudinal count responses with overdispersion and structural zeros.具有过离散和结构零的纵向计数响应的无分布模型。
Stat Med. 2013 Jun 30;32(14):2390-405. doi: 10.1002/sim.5691. Epub 2012 Dec 12.
5
Modeling Count Outcomes from HIV Risk Reduction Interventions: A Comparison of Competing Statistical Models for Count Responses.对降低HIV风险干预措施的计数结果进行建模:计数响应的竞争统计模型比较
AIDS Res Treat. 2012;2012:593569. doi: 10.1155/2012/593569. Epub 2012 Mar 25.
6
Motivational and skills training HIV/sexually transmitted infection sexual risk reduction groups for men.针对男性的艾滋病病毒/性传播感染性风险降低小组的动机与技能培训
J Subst Abuse Treat. 2009 Sep;37(2):138-50. doi: 10.1016/j.jsat.2008.11.008. Epub 2009 Jan 15.
7
The effect of a major cigarette price change on smoking behavior in california: a zero-inflated negative binomial model.加州香烟价格大幅变动对吸烟行为的影响:零膨胀负二项式模型
Health Econ. 2004 Aug;13(8):781-91. doi: 10.1002/hec.849.
8
Analysis of data with excess zeros.对含过多零值的数据进行分析。
Stat Methods Med Res. 2002 Aug;11(4):297-302. doi: 10.1191/0962280202sm289ra.
9
Zero-inflated models for regression analysis of count data: a study of growth and development.用于计数数据回归分析的零膨胀模型:生长与发育研究
Stat Med. 2002 May 30;21(10):1461-9. doi: 10.1002/sim.1088.
10
Zero-inflated Poisson and binomial regression with random effects: a case study.具有随机效应的零膨胀泊松和二项式回归:一个案例研究。
Biometrics. 2000 Dec;56(4):1030-9. doi: 10.1111/j.0006-341x.2000.01030.x.