• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计数结局中过度零的统计模型性能的模拟研究。

A simulation study of the performance of statistical models for count outcomes with excessive zeros.

机构信息

Department of Population and Community Health, University of North Texas Health Science Center, Fort Worth, Texas, USA.

Norden Lofts, White Plains, New York, USA.

出版信息

Stat Med. 2024 Oct 30;43(24):4752-4767. doi: 10.1002/sim.10198. Epub 2024 Aug 28.

DOI:10.1002/sim.10198
PMID:39193779
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11483204/
Abstract

BACKGROUND

Outcome measures that are count variables with excessive zeros are common in health behaviors research. Examples include the number of standard drinks consumed or alcohol-related problems experienced over time. There is a lack of empirical data about the relative performance of prevailing statistical models for assessing the efficacy of interventions when outcomes are zero-inflated, particularly compared with recently developed marginalized count regression approaches for such data.

METHODS

The current simulation study examined five commonly used approaches for analyzing count outcomes, including two linear models (with outcomes on raw and log-transformed scales, respectively) and three prevailing count distribution-based models (ie, Poisson, negative binomial, and zero-inflated Poisson (ZIP) models). We also considered the marginalized zero-inflated Poisson (MZIP) model, a novel alternative that estimates the overall effects on the population mean while adjusting for zero-inflation. Motivated by alcohol misuse prevention trials, extensive simulations were conducted to evaluate and compare the statistical power and Type I error rate of the statistical models and approaches across data conditions that varied in sample size ( to 500), zero rate (0.2 to 0.8), and intervention effect sizes.

RESULTS

Under zero-inflation, the Poisson model failed to control the Type I error rate, resulting in higher than expected false positive results. When the intervention effects on the zero (vs. non-zero) and count parts were in the same direction, the MZIP model had the highest statistical power, followed by the linear model with outcomes on the raw scale, negative binomial model, and ZIP model. The performance of the linear model with a log-transformed outcome variable was unsatisfactory.

CONCLUSIONS

The MZIP model demonstrated better statistical properties in detecting true intervention effects and controlling false positive results for zero-inflated count outcomes. This MZIP model may serve as an appealing analytical approach to evaluating overall intervention effects in studies with count outcomes marked by excessive zeros.

摘要

背景

在健康行为研究中,结果测量是具有过多零值的计数变量是很常见的。例如,随着时间的推移,消耗的标准饮料数量或经历的与酒精相关的问题。当结果为零膨胀时,缺乏关于流行的统计模型评估干预效果的相对性能的经验数据,特别是与最近为这种数据开发的边缘化计数回归方法相比。

方法

本研究通过模拟研究考察了分析计数结果的五种常用方法,包括两种线性模型(分别基于原始和对数转换的结果)和三种基于流行的计数分布的模型(即泊松、负二项和零膨胀泊松(ZIP)模型)。我们还考虑了边缘化零膨胀泊松(MZIP)模型,这是一种新颖的替代方法,可在调整零膨胀的同时估计对总体平均值的总体影响。受酒精滥用预防试验的启发,进行了广泛的模拟,以评估和比较在不同数据条件下,各种统计模型和方法的统计功效和 I 型错误率,数据条件包括样本量(500 至 500)、零率(0.2 至 0.8)和干预效果大小。

结果

在零膨胀下,泊松模型未能控制 I 型错误率,导致高于预期的假阳性结果。当干预对零(与非零)和计数部分的影响方向相同时,MZIP 模型具有最高的统计功效,其次是原始尺度结果的线性模型、负二项式模型和 ZIP 模型。对数转换结果变量的线性模型表现不佳。

结论

MZIP 模型在检测真实干预效果和控制零膨胀计数结果的假阳性结果方面表现出更好的统计特性。对于计数结果标记有过多零的研究,MZIP 模型可能是一种有吸引力的分析方法,可以评估总体干预效果。

相似文献

1
A simulation study of the performance of statistical models for count outcomes with excessive zeros.计数结局中过度零的统计模型性能的模拟研究。
Stat Med. 2024 Oct 30;43(24):4752-4767. doi: 10.1002/sim.10198. Epub 2024 Aug 28.
2
A comparison of statistical methods for modeling count data with an application to hospital length of stay.一种用于对计数数据建模的统计方法比较及其在住院时间中的应用。
BMC Med Res Methodol. 2022 Aug 4;22(1):211. doi: 10.1186/s12874-022-01685-8.
3
Sample size calculation for cluster randomized trials with zero-inflated count outcomes.群组随机试验中具有零膨胀计数结果的样本量计算。
Stat Med. 2022 May 30;41(12):2191-2204. doi: 10.1002/sim.9350. Epub 2022 Feb 9.
4
Evaluation of negative binomial and zero-inflated negative binomial models for the analysis of zero-inflated count data: application to the telemedicine for children with medical complexity trial.零膨胀计数数据的负二项式和零膨胀负二项式模型评估:在医疗复杂性儿童远程医疗试验中的应用。
Trials. 2023 Sep 27;24(1):613. doi: 10.1186/s13063-023-07648-8.
5
Multilevel modeling in single-case studies with zero-inflated and overdispersed count data.零膨胀和过离散计数数据的单病例研究中的多层次建模。
Behav Res Methods. 2024 Apr;56(4):2765-2781. doi: 10.3758/s13428-024-02359-7. Epub 2024 Feb 21.
6
On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.关于零膨胀和过度分散计数响应的参数模型和非参数模型的性能。
Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.
7
Predicting the multivariate zero-inflated counts: A novel model averaging method under Pearson loss.预测多元零膨胀计数:皮尔逊损失下的一种新的模型平均方法。
Stat Med. 2024 May 20;43(11):2096-2121. doi: 10.1002/sim.10052. Epub 2024 Mar 15.
8
Marginalized zero-inflated Poisson models with missing covariates.具有缺失协变量的边缘化零膨胀泊松模型。
Biom J. 2018 Jul;60(4):845-858. doi: 10.1002/bimj.201600249. Epub 2018 May 11.
9
Testing inflated zeros in binomial regression models.检验二项回归模型中的膨胀零值。
Biom J. 2021 Jan;63(1):59-80. doi: 10.1002/bimj.202000028. Epub 2020 Sep 23.
10
A GEE-type approach to untangle structural and random zeros in predictors.一种基于广义估计方程(GEE)的方法,用于解决预测变量中的结构零和随机零问题。
Stat Methods Med Res. 2019 Dec;28(12):3683-3696. doi: 10.1177/0962280218812228. Epub 2018 Nov 26.

引用本文的文献

1
Testing Theory-Enhanced Messaging to Promote COVID-19 Vaccination: Results from a Randomized Controlled Trial.测试理论强化信息以促进新冠疫苗接种:一项随机对照试验的结果
Res Sq. 2025 May 2:rs.3.rs-6472442. doi: 10.21203/rs.3.rs-6472442/v1.
2
The selection of statistical models for reporting count outcomes and intervention effects in brief alcohol intervention trials: A review and recommendations.简短酒精干预试验中用于报告计数结果和干预效果的统计模型选择:综述与建议
Alcohol Clin Exp Res (Hoboken). 2024 Jan;48(1):16-28. doi: 10.1111/acer.15232. Epub 2023 Dec 6.
3
Which is Better for Individual Participant Data Meta-Analysis of Zero-Inflated Count Outcomes, One-Step or Two-Step Analysis? A Simulation Study.

本文引用的文献

1
The selection of statistical models for reporting count outcomes and intervention effects in brief alcohol intervention trials: A review and recommendations.简短酒精干预试验中用于报告计数结果和干预效果的统计模型选择:综述与建议
Alcohol Clin Exp Res (Hoboken). 2024 Jan;48(1):16-28. doi: 10.1111/acer.15232. Epub 2023 Dec 6.
2
Phases of methodological research in biostatistics-Building the evidence base for new methods.生物统计学方法研究的阶段——为新方法构建证据基础。
Biom J. 2024 Jan;66(1):e2200222. doi: 10.1002/bimj.202200222. Epub 2023 Feb 3.
3
Brief Alcohol Interventions are Effective through 6 Months: Findings from Marginalized Zero-inflated Poisson and Negative Binomial Models in a Two-step IPD Meta-analysis.
零膨胀计数结局的个体参与者数据荟萃分析中,一步法和两步法哪个更好?一项模拟研究。
Multivariate Behav Res. 2023 Nov-Dec;58(6):1090-1105. doi: 10.1080/00273171.2023.2173135. Epub 2023 Mar 23.
边缘零膨胀泊松和负二项模型两步 IPD 荟萃分析发现,短期酒精干预在 6 个月内有效。
Prev Sci. 2023 Nov;24(8):1608-1621. doi: 10.1007/s11121-022-01420-1. Epub 2022 Aug 17.
4
Sample size calculation for cluster randomized trials with zero-inflated count outcomes.群组随机试验中具有零膨胀计数结果的样本量计算。
Stat Med. 2022 May 30;41(12):2191-2204. doi: 10.1002/sim.9350. Epub 2022 Feb 9.
5
A bias correction method in meta-analysis of randomized clinical trials with no adjustments for zero-inflated outcomes.Meta 分析中对零膨胀结局未调整的随机临床试验的偏倚校正方法。
Stat Med. 2021 Nov 20;40(26):5894-5909. doi: 10.1002/sim.9161. Epub 2021 Sep 3.
6
A tutorial on individual participant data meta-analysis using Bayesian multilevel modeling to estimate alcohol intervention effects across heterogeneous studies.使用贝叶斯多层次建模对个体参与者数据进行元分析教程,以估计跨异质研究的酒精干预效果。
Addict Behav. 2019 Jul;94:162-170. doi: 10.1016/j.addbeh.2019.01.032. Epub 2019 Jan 23.
7
Using simulation studies to evaluate statistical methods.运用模拟研究评估统计方法。
Stat Med. 2019 May 20;38(11):2074-2102. doi: 10.1002/sim.8086. Epub 2019 Jan 16.
8
Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies.迈向基于证据的计算统计学:从临床研究中汲取关于真实数据基准研究作用和设计的经验教训。
BMC Med Res Methodol. 2017 Sep 9;17(1):138. doi: 10.1186/s12874-017-0417-2.
9
Matching the Statistical Model to the Research Question for Dental Caries Indices with Many Zero Counts.针对具有大量零计数的龋齿指数,将统计模型与研究问题相匹配。
Caries Res. 2017;51(3):198-208. doi: 10.1159/000452675. Epub 2017 Mar 15.
10
Brief motivational interventions for college student drinking may not be as powerful as we think: an individual participant-level data meta-analysis.针对大学生饮酒问题的简短动机干预可能并不像我们认为的那样有效:一项个体参与者层面数据的荟萃分析。
Alcohol Clin Exp Res. 2015 May;39(5):919-31. doi: 10.1111/acer.12714.