• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

理清统计建模中的结构零和随机零。

Untangle the Structural and Random Zeros in Statistical Modelings.

作者信息

Tang W, He H, Wang W J, Chen D G

机构信息

Department of Global Biostatistics & Data Science, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA70122, USA.

Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA70122, USA.

出版信息

J Appl Stat. 2018;45(9):1714-1733. doi: 10.1080/02664763.2017.1391180. Epub 2017 Oct 24.

DOI:10.1080/02664763.2017.1391180
PMID:30906098
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6426322/
Abstract

Count data with structural zeros are common in public health applications. There are considerable researches focusing on zero-inflated models such as zero-inflated Poisson (ZIP) and zero-inflated Negative Binomial (ZINB) models for such zero-inflated count data when used as response variable. However, when such variables are used as predictors, the difference between structural and random zeros is often ignored and may result in biased estimates. One remedy is to include an indicator of the structural zero in the model as a predictor if observed. However, structural zeros are often not observed in practice, in which case no statistical method is available to address the bias issue. This paper is aimed to fill this methodological gap by developing parametric methods to model zero-inflated count data when used as predictors based on the maximum likelihood approach. The response variable can be any type of data including continuous, binary, count or even zero-inflated count responses. Simulation studies are performed to assess the numerical performance of this new approach when sample size is small to moderate. A real data example is also used to demonstrate the application of this method.

摘要

在公共卫生应用中,带有结构零的计数数据很常见。当此类零膨胀计数数据用作响应变量时,有大量研究聚焦于零膨胀模型,如零膨胀泊松(ZIP)模型和零膨胀负二项式(ZINB)模型。然而,当此类变量用作预测变量时,结构零和随机零之间的差异常常被忽略,这可能导致估计有偏差。一种补救方法是,如果观察到结构零,就在模型中纳入一个结构零的指标作为预测变量。然而,在实际中结构零往往无法观察到,在这种情况下,没有统计方法可用于解决偏差问题。本文旨在通过基于最大似然法开发参数方法来对用作预测变量的零膨胀计数数据进行建模,以填补这一方法学空白。响应变量可以是任何类型的数据,包括连续数据、二元数据、计数数据,甚至是零膨胀计数响应数据。进行模拟研究以评估当样本量从小到中等时这种新方法的数值性能。还使用了一个实际数据示例来展示该方法的应用。

相似文献

1
Untangle the Structural and Random Zeros in Statistical Modelings.理清统计建模中的结构零和随机零。
J Appl Stat. 2018;45(9):1714-1733. doi: 10.1080/02664763.2017.1391180. Epub 2017 Oct 24.
2
A GEE-type approach to untangle structural and random zeros in predictors.一种基于广义估计方程(GEE)的方法,用于解决预测变量中的结构零和随机零问题。
Stat Methods Med Res. 2019 Dec;28(12):3683-3696. doi: 10.1177/0962280218812228. Epub 2018 Nov 26.
3
On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.关于零膨胀和过度分散计数响应的参数模型和非参数模型的性能。
Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.
4
Distribution-free Inference of Zero-inated Binomial Data for Longitudinal Studies.纵向研究中零膨胀二项式数据的无分布推断
J Appl Stat. 2015 Oct 1;42(10):2203-2219. doi: 10.1080/02664763.2015.1023270. Epub 2015 Mar 18.
5
Multilevel modeling in single-case studies with zero-inflated and overdispersed count data.零膨胀和过离散计数数据的单病例研究中的多层次建模。
Behav Res Methods. 2024 Apr;56(4):2765-2781. doi: 10.3758/s13428-024-02359-7. Epub 2024 Feb 21.
6
A comparison of statistical methods for modeling count data with an application to hospital length of stay.一种用于对计数数据建模的统计方法比较及其在住院时间中的应用。
BMC Med Res Methodol. 2022 Aug 4;22(1):211. doi: 10.1186/s12874-022-01685-8.
7
A semiparametric marginalized zero-inflated model for analyzing healthcare utilization panel data with missingness.一种用于分析存在缺失值的医疗保健利用面板数据的半参数边际零膨胀模型。
J Appl Stat. 2019;46(16):2862-2883. doi: 10.1080/02664763.2019.1620705. Epub 2019 May 22.
8
A simulation study of the performance of statistical models for count outcomes with excessive zeros.计数结局中过度零的统计模型性能的模拟研究。
Stat Med. 2024 Oct 30;43(24):4752-4767. doi: 10.1002/sim.10198. Epub 2024 Aug 28.
9
Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory.机动车碰撞事故的泊松、泊松-伽马和零膨胀回归模型:平衡统计拟合与理论
Accid Anal Prev. 2005 Jan;37(1):35-46. doi: 10.1016/j.aap.2004.02.004.
10
Bivariate zero-inflated regression for count data: a Bayesian approach with application to plant counts.计数数据的双变量零膨胀回归:一种贝叶斯方法及其在植物计数中的应用
Int J Biostat. 2010;6(1):Article 27. doi: 10.2202/1557-4679.1229.

引用本文的文献

1
Prediction Modeling With Many Correlated and Zero-Inflated Predictors: Assessing the Nonnegative Garrote Approach.具有多个相关和零膨胀预测变量的预测建模:评估非负约束岭回归方法。
Stat Med. 2025 Apr;44(8-9):e70062. doi: 10.1002/sim.70062.
2
Endometrial resection and ablation versus hysterectomy for heavy menstrual bleeding.子宫内膜切除术和消融术与子宫切除术治疗月经过多。
Cochrane Database Syst Rev. 2021 Feb 23;2(2):CD000329. doi: 10.1002/14651858.CD000329.pub4.
3
Progestogens or progestogen-releasing intrauterine systems for uterine fibroids (other than preoperative medical therapy).用于子宫肌瘤的孕激素或释放孕激素的宫内节育系统(术前药物治疗除外)。
Cochrane Database Syst Rev. 2020 Nov 23;11(11):CD008994. doi: 10.1002/14651858.CD008994.pub3.
4
A GEE-type approach to untangle structural and random zeros in predictors.一种基于广义估计方程(GEE)的方法,用于解决预测变量中的结构零和随机零问题。
Stat Methods Med Res. 2019 Dec;28(12):3683-3696. doi: 10.1177/0962280218812228. Epub 2018 Nov 26.

本文引用的文献

1
On the implication of structural zeros as independent variables in regression analysis: applications to alcohol research.关于回归分析中作为自变量的结构零的含义:在酒精研究中的应用
J Data Sci. 2014 Jul;12(3):439-460.
2
Testing a Model of Self-Management of Fluid Intake in Community-Residing Long-term Indwelling Urinary Catheter Users.对社区居住的长期留置导尿管使用者液体摄入自我管理模式的测试
Nurs Res. 2016 Mar-Apr;65(2):97-106. doi: 10.1097/NNR.0000000000000140.
3
Distribution-free models for longitudinal count responses with overdispersion and structural zeros.具有过离散和结构零的纵向计数响应的无分布模型。
Stat Med. 2013 Jun 30;32(14):2390-405. doi: 10.1002/sim.5691. Epub 2012 Dec 12.
4
Alcohol, conscientiousness and event-level condom use.酒精、尽责性与事件级别的 condom 使用。
Br J Health Psychol. 2011 Nov;16(4):828-45. doi: 10.1111/j.2044-8287.2011.02019.x. Epub 2011 Mar 31.
5
New variable selection methods for zero-inflated count data with applications to the substance abuse field.带有应用于物质滥用领域的零膨胀计数数据的新变量选择方法。
Stat Med. 2011 Aug 15;30(18):2326-40. doi: 10.1002/sim.4268. Epub 2011 May 12.
6
Randomized trials of alcohol-use interventions with college students and their parents: lessons from the Transitions Project.大学生及其父母的酒精使用干预措施的随机试验:来自“过渡项目”的经验教训。
Clin Trials. 2011 Apr;8(2):205-13. doi: 10.1177/1740774510396387. Epub 2011 Jan 26.
7
Alcohol outlet density, levels of drinking and alcohol-related harm in New Zealand: a national study.新西兰的酒吧密度、饮酒水平和与酒精相关的伤害:一项全国性研究。
J Epidemiol Community Health. 2011 Oct;65(10):841-6. doi: 10.1136/jech.2009.104935. Epub 2010 Oct 14.
8
Parental alcohol involvement and adolescent alcohol expectancies predict alcohol involvement in male adolescents.父母的酒精涉入情况和青少年对酒精的期望预测了男性青少年的酒精涉入情况。
Psychol Addict Behav. 2010 Sep;24(3):386-96. doi: 10.1037/a0019801.
9
When should clinicians switch treatments? An application of signal detection theory to two treatments for women with alcohol use disorders.何时临床医生应转换治疗方法?信号检测理论在两种酒精使用障碍女性治疗方法中的应用。
Behav Res Ther. 2010 Jun;48(6):524-30. doi: 10.1016/j.brat.2010.03.001. Epub 2010 Mar 7.
10
Motivational and skills training HIV/sexually transmitted infection sexual risk reduction groups for men.针对男性的艾滋病病毒/性传播感染性风险降低小组的动机与技能培训
J Subst Abuse Treat. 2009 Sep;37(2):138-50. doi: 10.1016/j.jsat.2008.11.008. Epub 2009 Jan 15.