• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
On testing for homogeneity with zero-inflated models through the lens of model misspecification.基于模型误设视角对零膨胀模型进行齐性检验
Int Stat Rev. 2022 Apr;90(1):62-77. doi: 10.1111/insr.12462. Epub 2021 Jul 5.
2
A robust score test of homogeneity for zero-inflated count data.针对零膨胀计数数据的稳健齐性得分检验。
Stat Methods Med Res. 2020 Dec;29(12):3653-3665. doi: 10.1177/0962280220937324. Epub 2020 Jul 10.
3
A GEE-type approach to untangle structural and random zeros in predictors.一种基于广义估计方程(GEE)的方法,用于解决预测变量中的结构零和随机零问题。
Stat Methods Med Res. 2019 Dec;28(12):3683-3696. doi: 10.1177/0962280218812228. Epub 2018 Nov 26.
4
Marginalized multilevel hurdle and zero-inflated models for overdispersed and correlated count data with excess zeros.用于具有过多零值的过度分散和相关计数数据的边缘化多级障碍模型和零膨胀模型。
Stat Med. 2014 Nov 10;33(25):4402-19. doi: 10.1002/sim.6237. Epub 2014 Jun 23.
5
Modelling count data with excessive zeros: the need for class prediction in zero-inflated models and the issue of data generation in choosing between zero-inflated and generic mixture models for dental caries data.对过多零值进行计数数据分析:零膨胀模型中类别预测的必要性,以及针对龋齿数据在零膨胀模型和通用混合模型之间选择时的数据生成问题。
Stat Med. 2009 Dec 10;28(28):3539-53. doi: 10.1002/sim.3699.
6
A Simple Chi-Square Statistic for Testing Homogeneity of Zero-Inflated Distributions.用于检验零膨胀分布同质性的简单卡方统计量。
Open J Stat. 2015 Oct;5(6):483-493. doi: 10.4236/ojs.2015.56050. Epub 2015 Oct 13.
7
Assessment and Selection of Competing Models for Zero-Inflated Microbiome Data.零膨胀微生物组数据竞争模型的评估与选择
PLoS One. 2015 Jul 6;10(7):e0129606. doi: 10.1371/journal.pone.0129606. eCollection 2015.
8
A quasi-score statistic for homogeneity testing against covariate-varying heterogeneity.一种用于针对协变量变化的异质性进行齐性检验的拟评分统计量。
Scand Stat Theory Appl. 2018 Sep;45(3):465-481. doi: 10.1111/sjos.12308. Epub 2017 Dec 14.
9
Zero-inflated models for adjusting varying exposures: a cautionary note on the pitfalls of using offset.用于调整不同暴露因素的零膨胀模型:关于使用偏移量陷阱的警示说明。
J Appl Stat. 2020 Jul 25;49(1):1-23. doi: 10.1080/02664763.2020.1796943. eCollection 2022.
10
On some aspects of a zero-inflated overdispersed model and its applications.关于零膨胀过度分散模型的某些方面及其应用。
J Appl Stat. 2019 Jul 24;47(3):506-523. doi: 10.1080/02664763.2019.1645098. eCollection 2020.

本文引用的文献

1
On Lagrange Multiplier Tests in Multidimensional Item Response Theory: Information Matrices and Model Misspecification.关于多维项目反应理论中的拉格朗日乘数检验:信息矩阵与模型误设
Educ Psychol Meas. 2018 Aug;78(4):653-678. doi: 10.1177/0013164417714506. Epub 2017 Jul 6.
2
Modeling zero-modified count and semicontinuous data in health services research part 2: case studies.卫生服务研究中零修正计数和半连续数据的建模 第2部分:案例研究
Stat Med. 2016 Nov 30;35(27):5094-5112. doi: 10.1002/sim.7063. Epub 2016 Aug 8.
3
Modeling zero-modified count and semicontinuous data in health services research Part 1: background and overview.卫生服务研究中零修正计数和半连续数据的建模 第1部分:背景与概述
Stat Med. 2016 Nov 30;35(27):5070-5093. doi: 10.1002/sim.7050. Epub 2016 Aug 8.
4
A sup-score test for the cure fraction in mixture models for long-term survivors.长期存活者混合模型中治愈率的超分数检验。
Biometrics. 2016 Dec;72(4):1348-1357. doi: 10.1111/biom.12514. Epub 2016 Apr 14.
5
On the efficiency of score tests for homogeneity in two-component parametric models for discrete data.关于离散数据的双组分参数模型中同质性得分检验的效率
Biometrics. 2012 Sep;68(3):975-82. doi: 10.1111/j.1541-0420.2011.01737.x. Epub 2012 Feb 20.
6
Two-component mixture cure rate model with spline estimated nonparametric components.具有样条估计非参数分量的双组分混合物治愈率模型。
Biometrics. 2012 Sep;68(3):726-35. doi: 10.1111/j.1541-0420.2011.01715.x. Epub 2011 Dec 14.
7
Zero-inflated and hurdle models of count data with extra zeros: examples from an HIV-risk reduction intervention trial.带有额外零值的计数数据的零膨胀和障碍模型:来自 HIV 风险降低干预试验的实例。
Am J Drug Alcohol Abuse. 2011 Sep;37(5):367-75. doi: 10.3109/00952990.2011.597280.
8
CD4 cell count and HIV DNA level are independent predictors of disease progression after primary HIV type 1 infection in untreated patients.在未经治疗的患者中,CD4细胞计数和HIV DNA水平是原发性1型HIV感染后疾病进展的独立预测指标。
Clin Infect Dis. 2006 Mar 1;42(5):709-15. doi: 10.1086/500213. Epub 2006 Jan 24.
9
Explained variation and predictive accuracy in general parametric statistical models: the role of model misspecification.一般参数统计模型中的解释变异和预测准确性:模型误设的作用。
Lifetime Data Anal. 2004 Dec;10(4):461-72. doi: 10.1007/s10985-004-4778-6.
10
Cure fraction estimation from the mixture cure models for grouped survival data.基于分组生存数据的混合治愈模型的治愈分数估计
Stat Med. 2004 Jun 15;23(11):1733-47. doi: 10.1002/sim.1774.

基于模型误设视角对零膨胀模型进行齐性检验

On testing for homogeneity with zero-inflated models through the lens of model misspecification.

作者信息

Hsu Wei-Wen, Mawella Nadeesha R, Todem David

机构信息

Department of Statistics, Kansas State University, Manhattan, KS 66506, USA.

Department of Mathematics and Statistics, University of Missouri-Kansas City, Kansas City, MO 64110, USA.

出版信息

Int Stat Rev. 2022 Apr;90(1):62-77. doi: 10.1111/insr.12462. Epub 2021 Jul 5.

DOI:10.1111/insr.12462
PMID:35601991
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9122237/
Abstract

In many applications of two-component mixture models such as the popular zero-inflated model for discrete-valued data, it is customary for the data analyst to evaluate the inherent heterogeneity in view of observed data. To this end, the score test, acclaimed for its simplicity, is routinely performed. It has long been recognized that this test may behave erratically under model misspecification, but the implications of this behavior remain poorly understood for popular two-component mixture models. For the special case of zero-inflated count models, we use data simulations and theoretical arguments to evaluate this behavior and discuss its implications in settings where the working model is restrictive with regard to the true data generating mechanism. We enrich this discussion with an analysis of count data in HIV research, where a one-component model is shown to fit the data reasonably well despite apparent extra zeros. These results suggest that a rejection of homogeneity does not imply that the underlying mixture model is appropriate. Rather, such a rejection simply implies that the mixture model should be carefully interpreted in the light of potential model misspecifications, and further evaluated against other competing models.

摘要

在双组分混合模型的许多应用中,例如用于离散值数据的流行的零膨胀模型,数据分析师通常会根据观测数据评估内在的异质性。为此,以其简单性而广受赞誉的得分检验经常被执行。长期以来,人们已经认识到,在模型误设的情况下,该检验可能表现不稳定,但对于流行的双组分混合模型,这种行为的影响仍知之甚少。对于零膨胀计数模型的特殊情况,我们使用数据模拟和理论论证来评估这种行为,并讨论其在工作模型对真实数据生成机制具有限制性的情况下的影响。我们通过对HIV研究中的计数数据进行分析来丰富这一讨论,其中一个单组分模型尽管存在明显的额外零值,但仍显示出对数据拟合得相当好。这些结果表明,对同质性的拒绝并不意味着潜在的混合模型是合适的。相反,这种拒绝仅仅意味着应该根据潜在的模型误设仔细解释混合模型,并针对其他竞争模型进行进一步评估。