• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结构零值和零膨胀模型。

Structural zeroes and zero-inflated models.

作者信息

He Hua, Tang Wan, Wang Wenjuan, Crits-Christoph Paul

机构信息

Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY, USA ; Veterans Integrated Service Network, Center of Excellence for Suicide Prevention, Canandaigua VA Medical Center, Canandaigua, NY, USA ; Department of Psychiatry, University of Rochester Medical Center, Rochester, NY, USA.

Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY, USA.

出版信息

Shanghai Arch Psychiatry. 2014 Aug;26(4):236-42. doi: 10.3969/j.issn.1002-0829.2014.04.008.

DOI:10.3969/j.issn.1002-0829.2014.04.008
PMID:25317011
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4194007/
Abstract

In psychosocial and behavioral studies count outcomes recording the frequencies of the occurrence of some health or behavior outcomes (such as the number of unprotected sexual behaviors during a period of time) often contain a preponderance of zeroes because of the presence of 'structural zeroes' that occur when some subjects are not at risk for the behavior of interest. Unlike random zeroes (responses that can be greater than zero, but are zero due to sampling variability), structural zeroes are usually very different, both statistically and clinically. False interpretations of results and study findings may result if differences in the two types of zeroes are ignored. However, in practice, the status of the structural zeroes is often not observed and this latent nature complicates the data analysis. In this article, we focus on one model, the zero-inflated Poisson (ZIP) regression model that is commonly used to address zero-inflated data. We first give a brief overview of the issues of structural zeroes and the ZIP model. We then given an illustration of ZIP with data from a study on HIV-risk sexual behaviors among adolescent girls. Sample codes in SAS and Stata are also included to help perform and explain ZIP analyses.

摘要

在社会心理和行为学研究中,计数结果记录某些健康或行为结果的发生频率(例如一段时间内无保护性行为的次数),由于存在“结构零值”,往往包含大量零值。当一些受试者没有发生所关注行为的风险时,就会出现结构零值。与随机零值(响应值可以大于零,但由于抽样变异性而为零)不同,结构零值在统计学和临床上通常有很大差异。如果忽略这两种零值的差异,可能会导致对结果和研究发现的错误解读。然而,在实际中,结构零值的情况往往未被观察到,这种潜在性质使数据分析变得复杂。在本文中,我们聚焦于一种模型,即零膨胀泊松(ZIP)回归模型,它常用于处理零膨胀数据。我们首先简要概述结构零值问题和ZIP模型。然后用一项关于少女艾滋风险性行为研究的数据对ZIP模型进行说明。还包括SAS和Stata中的示例代码,以帮助进行和解释ZIP分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/145c/4194007/f668a16e5fd4/sap-26-04-236-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/145c/4194007/f139b9a1a10b/sap-26-04-236-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/145c/4194007/f668a16e5fd4/sap-26-04-236-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/145c/4194007/f139b9a1a10b/sap-26-04-236-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/145c/4194007/f668a16e5fd4/sap-26-04-236-g003.jpg

相似文献

1
Structural zeroes and zero-inflated models.结构零值和零膨胀模型。
Shanghai Arch Psychiatry. 2014 Aug;26(4):236-42. doi: 10.3969/j.issn.1002-0829.2014.04.008.
2
On the use of zero-inflated and hurdle models for modeling vaccine adverse event count data.关于使用零膨胀模型和障碍模型对疫苗不良事件计数数据进行建模
J Biopharm Stat. 2006;16(4):463-81. doi: 10.1080/10543400600719384.
3
A Marginalized Zero-inflated Poisson Regression Model with Random Effects.一种具有随机效应的边缘化零膨胀泊松回归模型。
J R Stat Soc Ser C Appl Stat. 2015 Nov;64(5):815-830. doi: 10.1111/rssc.12104. Epub 2015 Apr 30.
4
A new modified biased estimator for Zero inflated Poisson regression model.一种用于零膨胀泊松回归模型的新型修正偏差估计器。
Heliyon. 2024 Jan 12;10(3):e24225. doi: 10.1016/j.heliyon.2024.e24225. eCollection 2024 Feb 15.
5
EM Adaptive LASSO-A Multilocus Modeling Strategy for Detecting SNPs Associated with Zero-inflated Count Phenotypes.EM自适应套索法——一种用于检测与零膨胀计数表型相关单核苷酸多态性的多位点建模策略
Front Genet. 2016 Mar 30;7:32. doi: 10.3389/fgene.2016.00032. eCollection 2016.
6
Untangle the Structural and Random Zeros in Statistical Modelings.理清统计建模中的结构零和随机零。
J Appl Stat. 2018;45(9):1714-1733. doi: 10.1080/02664763.2017.1391180. Epub 2017 Oct 24.
7
A doubly-inflated Poisson regression for correlated count data.用于相关计数数据的双重充气泊松回归。
J Appl Stat. 2020 May 1;48(6):1111-1127. doi: 10.1080/02664763.2020.1757049. eCollection 2021.
8
A GEE-type approach to untangle structural and random zeros in predictors.一种基于广义估计方程(GEE)的方法,用于解决预测变量中的结构零和随机零问题。
Stat Methods Med Res. 2019 Dec;28(12):3683-3696. doi: 10.1177/0962280218812228. Epub 2018 Nov 26.
9
Random effect exponentiated-exponential geometric model for clustered/longitudinal zero-inflated count data.用于聚类/纵向零膨胀计数数据的随机效应指数化指数几何模型。
J Appl Stat. 2019 Dec 26;47(12):2272-2288. doi: 10.1080/02664763.2019.1706726. eCollection 2020.
10
What statistical method should be used to evaluate risk factors associated with dmfs index? Evidence from the National Pathfinder Survey of 4-year-old Italian children.应采用何种统计方法来评估与 dmfs 指数相关的危险因素?来自意大利 4 岁儿童国家探路者调查的证据。
Community Dent Oral Epidemiol. 2009 Dec;37(6):539-46. doi: 10.1111/j.1600-0528.2009.00500.x. Epub 2009 Oct 21.

引用本文的文献

1
BAYESIAN DIFFERENTIAL CAUSAL DIRECTED ACYCLIC GRAPHS FOR OBSERVATIONAL ZERO-INFLATED COUNTS WITH AN APPLICATION TO TWO-SAMPLE SINGLE-CELL DATA.用于观测零膨胀计数的贝叶斯差异因果有向无环图及其在双样本单细胞数据中的应用
Ann Appl Stat. 2025 Sep;19(3):1908-1930. doi: 10.1214/25-aoas2042. Epub 2025 Aug 28.
2
Model-Based Causal Discovery for Zero-Inflated Count Data.基于模型的零膨胀计数数据因果发现
J Mach Learn Res. 2023;24.
3
Untargeted metabolomics reveals anion and organ-specific metabolic responses of salinity tolerance in willow.

本文引用的文献

1
On the implication of structural zeros as independent variables in regression analysis: applications to alcohol research.关于回归分析中作为自变量的结构零的含义:在酒精研究中的应用
J Data Sci. 2014 Jul;12(3):439-460.
2
Distribution-free models for longitudinal count responses with overdispersion and structural zeros.具有过离散和结构零的纵向计数响应的无分布模型。
Stat Med. 2013 Jun 30;32(14):2390-405. doi: 10.1002/sim.5691. Epub 2012 Dec 12.
3
Modeling Count Outcomes from HIV Risk Reduction Interventions: A Comparison of Competing Statistical Models for Count Responses.
非靶向代谢组学揭示柳树耐盐性的阴离子和器官特异性代谢反应。
Plant J. 2025 Apr;122(1):e70160. doi: 10.1111/tpj.70160.
4
Modelling Count Data in Psychological Research: An Applied Tutorial.心理学研究中的计数数据建模:应用教程
Int J Psychol. 2025 Apr;60(2):e70018. doi: 10.1002/ijop.70018.
5
Modeling the determinants of cigarette consumption in Iranian households with children under 5 years of age using the Income and Expenditure Survey 2021.利用2021年家庭收支调查对伊朗有5岁以下儿童家庭的卷烟消费决定因素进行建模。
Arch Public Health. 2025 Jan 26;83(1):25. doi: 10.1186/s13690-024-01496-x.
6
Curvilinear incidence models for parity in the entire fertility range for cancers of the breast, ovary, and endometrium: A follow-up of the Norwegian 1960 Census.乳腺癌、卵巢癌和子宫内膜癌整个生育范围内产次的曲线发病率模型:挪威1960年人口普查随访研究
Int J Cancer. 2025 Jun 1;156(11):2118-2126. doi: 10.1002/ijc.35312. Epub 2025 Jan 3.
7
Predictors and number of antenatal care visits among reproductive age women in Sub-Saharan Africa further analysis of recent demographic and health survey from 2017-2023: Zero-inflated negative binomial regression.撒哈拉以南非洲育龄妇女产前保健就诊次数的预测因素及分析:2017-2023 年最新人口与健康调查的零膨胀负二项回归分析。
PLoS One. 2024 Oct 22;19(10):e0302297. doi: 10.1371/journal.pone.0302297. eCollection 2024.
8
COVID-19 in nursing homes: Geographic diffusion and regional risk factors from January 1 to July 26, 2020 of the pandemic.2020 年 1 月 1 日至 7 月 26 日,养老院中的 COVID-19:大流行期间的地理扩散和区域风险因素。
PLoS One. 2024 Aug 15;19(8):e0308339. doi: 10.1371/journal.pone.0308339. eCollection 2024.
9
Multilevel negative binomial analysis of factors associated with numbers of antenatal care contacts in low and middle income countries: Findings from 59 nationally representative datasets.多水平负二项分析与中低收入国家产前保健接触次数相关的因素:来自 59 个国家代表性数据集的结果。
PLoS One. 2024 Apr 18;19(4):e0301542. doi: 10.1371/journal.pone.0301542. eCollection 2024.
10
Three-Inflated Poisson Distribution and its Application in Suicide Cases of India During Covid-19 Pandemic.三充气泊松分布及其在新冠疫情期间印度自杀案例中的应用。
Ann Data Sci. 2022;9(5):1103-1127. doi: 10.1007/s40745-022-00372-1. Epub 2022 Mar 15.
对降低HIV风险干预措施的计数结果进行建模:计数响应的竞争统计模型比较
AIDS Res Treat. 2012;2012:593569. doi: 10.1155/2012/593569. Epub 2012 Mar 25.
4
Alcohol, conscientiousness and event-level condom use.酒精、尽责性与事件级别的 condom 使用。
Br J Health Psychol. 2011 Nov;16(4):828-45. doi: 10.1111/j.2044-8287.2011.02019.x. Epub 2011 Mar 31.
5
New variable selection methods for zero-inflated count data with applications to the substance abuse field.带有应用于物质滥用领域的零膨胀计数数据的新变量选择方法。
Stat Med. 2011 Aug 15;30(18):2326-40. doi: 10.1002/sim.4268. Epub 2011 May 12.
6
Effects of major depression on crack use and arrests among women in drug court.重大抑郁症对参与毒品法庭的女性使用可卡因和被捕的影响。
Addiction. 2011 Jul;106(7):1279-86. doi: 10.1111/j.1360-0443.2011.03389.x. Epub 2011 Apr 7.
7
Randomized trials of alcohol-use interventions with college students and their parents: lessons from the Transitions Project.大学生及其父母的酒精使用干预措施的随机试验:来自“过渡项目”的经验教训。
Clin Trials. 2011 Apr;8(2):205-13. doi: 10.1177/1740774510396387. Epub 2011 Jan 26.
8
Alcohol outlet density, levels of drinking and alcohol-related harm in New Zealand: a national study.新西兰的酒吧密度、饮酒水平和与酒精相关的伤害:一项全国性研究。
J Epidemiol Community Health. 2011 Oct;65(10):841-6. doi: 10.1136/jech.2009.104935. Epub 2010 Oct 14.
9
Risk behaviors among adolescent girls in an HIV prevention trial.一项艾滋病预防试验中少女的危险行为。
West J Nurs Res. 2011 Aug;33(5):690-711. doi: 10.1177/0193945910379220. Epub 2010 Oct 4.
10
Parental alcohol involvement and adolescent alcohol expectancies predict alcohol involvement in male adolescents.父母的酒精涉入情况和青少年对酒精的期望预测了男性青少年的酒精涉入情况。
Psychol Addict Behav. 2010 Sep;24(3):386-96. doi: 10.1037/a0019801.