• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于分析过度分散和零修正计数数据的新回归模型。

A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data.

作者信息

Bertoli Wesley, Conceição Katiane S, Andrade Marinho G, Louzada Francisco

机构信息

Department of Statistics, Federal University of Technology, Paraná, Av. Sete de Setembro, 3165 Rebouças, Curitiba 80230-901, PR, Brazil.

Department of Applied Mathematics and Statistics, Institute of Mathematical and Computer Sciences, University of São Paulo, Av. Trab. São Carlense, 400 Parque Arnold Schimidt, São Carlos 13566-590, SP, Brazil.

出版信息

Entropy (Basel). 2021 May 21;23(6):646. doi: 10.3390/e23060646.

DOI:10.3390/e23060646
PMID:34064281
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8224290/
Abstract

Count datasets are traditionally analyzed using the ordinary Poisson distribution. However, said model has its applicability limited, as it can be somewhat restrictive to handling specific data structures. In this case, the need arises for obtaining alternative models that accommodate, for example, overdispersion and zero modification (inflation/deflation at the frequency of zeros). In practical terms, these are the most prevalent structures ruling the nature of discrete phenomena nowadays. Hence, this paper's primary goal was to jointly address these issues by deriving a fixed-effects regression model based on the hurdle version of the Poisson-Sujatha distribution. In this framework, the zero modification is incorporated by considering that a binary probability model determines which outcomes are zero-valued, and a zero-truncated process is responsible for generating positive observations. Posterior inferences for the model parameters were obtained from a fully Bayesian approach based on the g-prior method. Intensive Monte Carlo simulation studies were performed to assess the Bayesian estimators' empirical properties, and the obtained results have been discussed. The proposed model was considered for analyzing a real dataset, and its competitiveness regarding some well-established fixed-effects models for count data was evaluated. A sensitivity analysis to detect observations that may impact parameter estimates was performed based on standard divergence measures. The Bayesian -value and the randomized quantile residuals were considered for the task of model validation.

摘要

传统上,计数数据集使用普通泊松分布进行分析。然而,该模型的适用性有限,因为它在处理特定数据结构时可能具有一定的局限性。在这种情况下,就需要获得替代模型,例如能够处理过度分散和零修正(零频率处的膨胀/收缩)的模型。实际上,这些是当今支配离散现象本质的最普遍结构。因此,本文的主要目标是通过推导基于泊松 - 苏贾塔分布障碍版本的固定效应回归模型来共同解决这些问题。在此框架下,通过考虑二元概率模型确定哪些结果为零值,并由零截断过程负责生成正观测值来纳入零修正。模型参数的后验推断是通过基于g先验方法的全贝叶斯方法获得的。进行了密集的蒙特卡罗模拟研究以评估贝叶斯估计量的经验性质,并对所得结果进行了讨论。考虑使用所提出的模型分析一个真实数据集,并评估其相对于一些成熟的计数数据固定效应模型的竞争力。基于标准散度度量进行了敏感性分析,以检测可能影响参数估计的观测值。考虑使用贝叶斯p值和随机分位数残差进行模型验证任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/dacba7a9a795/entropy-23-00646-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/9b5f22381472/entropy-23-00646-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/9386f69cc15b/entropy-23-00646-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/c5d896e6ccbf/entropy-23-00646-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/98a43ecdcf84/entropy-23-00646-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/d89580b030b8/entropy-23-00646-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/2ba03eb7043b/entropy-23-00646-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/dacba7a9a795/entropy-23-00646-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/9b5f22381472/entropy-23-00646-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/9386f69cc15b/entropy-23-00646-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/c5d896e6ccbf/entropy-23-00646-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/98a43ecdcf84/entropy-23-00646-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/d89580b030b8/entropy-23-00646-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/2ba03eb7043b/entropy-23-00646-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcb5/8224290/dacba7a9a795/entropy-23-00646-g007.jpg

相似文献

1
A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data.一种用于分析过度分散和零修正计数数据的新回归模型。
Entropy (Basel). 2021 May 21;23(6):646. doi: 10.3390/e23060646.
2
A new mixed-effects regression model for the analysis of zero-modified hierarchical count data.用于分析零修正层次计数数据的新混合效应回归模型。
Biom J. 2021 Jan;63(1):81-104. doi: 10.1002/bimj.202000046. Epub 2020 Oct 19.
3
Marginalized multilevel hurdle and zero-inflated models for overdispersed and correlated count data with excess zeros.用于具有过多零值的过度分散和相关计数数据的边缘化多级障碍模型和零膨胀模型。
Stat Med. 2014 Nov 10;33(25):4402-19. doi: 10.1002/sim.6237. Epub 2014 Jun 23.
4
A comparison of statistical methods for modeling count data with an application to hospital length of stay.一种用于对计数数据建模的统计方法比较及其在住院时间中的应用。
BMC Med Res Methodol. 2022 Aug 4;22(1):211. doi: 10.1186/s12874-022-01685-8.
5
Multilevel modeling in single-case studies with zero-inflated and overdispersed count data.零膨胀和过离散计数数据的单病例研究中的多层次建模。
Behav Res Methods. 2024 Apr;56(4):2765-2781. doi: 10.3758/s13428-024-02359-7. Epub 2024 Feb 21.
6
Zero-modified Poisson model: Bayesian approach, influence diagnostics, and an application to a Brazilian leptospirosis notification data.零修正泊松模型:贝叶斯方法、影响诊断及其在巴西钩端螺旋体病通报数据中的应用
Biom J. 2013 Sep;55(5):661-78. doi: 10.1002/bimj.201100175. Epub 2013 Apr 8.
7
A Bayesian approach for analyzing zero-inflated clustered count data with dispersion.贝叶斯方法分析具有离散度的零膨胀聚类计数数据。
Stat Med. 2018 Feb 28;37(5):801-812. doi: 10.1002/sim.7541. Epub 2017 Nov 6.
8
Models for Zero-Inflated and Overdispersed Correlated Count Data: An Application to Cigarette Use.零膨胀和过离散相关计数数据模型:在吸烟中的应用。
Nicotine Tob Res. 2023 Apr 6;25(5):996-1003. doi: 10.1093/ntr/ntac253.
9
Models for analyzing zero-inflated and overdispersed count data: an application to cigarette and marijuana use.用于分析零膨胀和过度分散计数数据的模型:在香烟和大麻使用中的应用。
Nicotine Tob Res. 2018 Apr 18;22(8):1390-8. doi: 10.1093/ntr/nty072.
10
Spatiotemporal hurdle models for zero-inflated count data: Exploring trends in emergency department visits.用于零膨胀计数数据的时空障碍模型:探索急诊科就诊趋势
Stat Methods Med Res. 2016 Dec;25(6):2558-2576. doi: 10.1177/0962280214527079. Epub 2014 Mar 28.

引用本文的文献

1
Determinants of sexually transmitted infections among female sex workers in Ethiopia: a count regression model approach.埃塞俄比亚女性性工作者中性传播感染的决定因素:计数回归模型方法。
Front Public Health. 2023 Aug 4;11:1190085. doi: 10.3389/fpubh.2023.1190085. eCollection 2023.

本文引用的文献

1
A weighted negative binomial Lindley distribution with applications to dispersed data.一种加权负二项式林德利分布及其在离散数据中的应用。
An Acad Bras Cienc. 2018 Jul-Sep;90(3):2617-2642. doi: 10.1590/0001-3765201820170733.
2
AN EXACT GOODNESS-OF-FIT TEST BASED ON THE OCCUPANCY PROBLEMS TO STUDY ZERO-INFLATION AND ZERO-DEFLATION IN BIOLOGICAL DOSIMETRY DATA.一种基于占位问题的精确拟合优度检验,用于研究生物剂量学数据中的零膨胀和零收缩现象。
Radiat Prot Dosimetry. 2018 Jun 1;179(4):317-326. doi: 10.1093/rpd/ncx285.
3
Zero-inflated regression models for radiation-induced chromosome aberration data: A comparative study.
辐射诱导染色体畸变数据的零膨胀回归模型:一项比较研究。
Biom J. 2016 Mar;58(2):259-79. doi: 10.1002/bimj.201400233. Epub 2015 Oct 13.
4
Zero-modified Poisson model: Bayesian approach, influence diagnostics, and an application to a Brazilian leptospirosis notification data.零修正泊松模型:贝叶斯方法、影响诊断及其在巴西钩端螺旋体病通报数据中的应用
Biom J. 2013 Sep;55(5):661-78. doi: 10.1002/bimj.201100175. Epub 2013 Apr 8.
5
Improved base-calling and quality scores for 454 sequencing based on a Hurdle Poisson model.基于障碍泊松模型提高 454 测序的碱基调用和质量评分。
BMC Bioinformatics. 2012 Nov 15;13:303. doi: 10.1186/1471-2105-13-303.
6
Zero-inflated and hurdle models of count data with extra zeros: examples from an HIV-risk reduction intervention trial.带有额外零值的计数数据的零膨胀和障碍模型:来自 HIV 风险降低干预试验的实例。
Am J Drug Alcohol Abuse. 2011 Sep;37(5):367-75. doi: 10.3109/00952990.2011.597280.
7
An Information Matrix Prior for Bayesian Analysis in Generalized Linear Models with High Dimensional Data.高维数据广义线性模型贝叶斯分析中的信息矩阵先验
Stat Sin. 2009;19(4):1641-1663.
8
Bayesian case influence diagnostics for survival models.生存模型的贝叶斯案例影响诊断
Biometrics. 2009 Mar;65(1):116-24. doi: 10.1111/j.1541-0420.2008.01037.x. Epub 2008 Apr 18.
9
Chromosome aberration analysis and the influence of mitotic delay after simulated partial-body exposure with high doses of sparsely and densely ionising radiation.高剂量稀疏和密集电离辐射模拟局部身体照射后的染色体畸变分析及有丝分裂延迟的影响
Radiat Environ Biophys. 2006 May;45(1):45-54. doi: 10.1007/s00411-006-0036-5. Epub 2006 Mar 25.
10
A zero-inflated Poisson model of migration frequency.迁移频率的零膨胀泊松模型。
Int Reg Sci Rev. 1996;19(3):211-22. doi: 10.1177/016001769601900302.