• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

数据聚集对计数数据模型中离散估计的影响。

The effect of data aggregation on dispersion estimates in count data models.

机构信息

Department of Mathematical Sciences, Durham University, Durham, UK.

Bundesamt für Strahlenschutz (BfS), Oberschleissheim, Germany.

出版信息

Int J Biostat. 2021 May 7;18(1):183-202. doi: 10.1515/ijb-2020-0079.

DOI:10.1515/ijb-2020-0079
PMID:33962495
Abstract

For the modelling of count data, aggregation of the raw data over certain subgroups or predictor configurations is common practice. This is, for instance, the case for count data biomarkers of radiation exposure. Under the Poisson law, count data can be aggregated without loss of information on the Poisson parameter, which remains true if the Poisson assumption is relaxed towards quasi-Poisson. However, in biodosimetry in particular, but also beyond, the question of how the dispersion estimates for quasi-Poisson models behave under data aggregation have received little attention. Indeed, for real data sets featuring unexplained heterogeneities, dispersion estimates can increase strongly after aggregation, an effect which we will demonstrate and quantify explicitly for some scenarios. The increase in dispersion estimates implies an inflation of the parameter standard errors, which, however, by comparison with random effect models, can be shown to serve a corrective purpose. The phenomena are illustrated by -H2AX foci data as used for instance in radiation biodosimetry for the calibration of dose-response curves.

摘要

对于计数数据的建模,常见的做法是对某些子组或预测器配置的原始数据进行聚合。例如,在辐射暴露的计数生物标志物数据中就是这种情况。在泊松定律下,计数数据可以在不丢失泊松参数信息的情况下进行聚合,如果泊松假设放宽为拟泊松,情况仍然如此。然而,特别是在生物剂量学中,以及除此之外,关于拟泊松模型的分散估计在数据聚合下的行为如何的问题,很少受到关注。事实上,对于具有未解释异质性的实际数据集,在聚合后分散估计值可以强烈增加,我们将为一些情况明确地展示和量化这种效应。分散估计值的增加意味着参数标准误差的膨胀,然而,与随机效应模型相比,可以证明其具有纠正目的。这些现象通过例如在辐射生物剂量学中用于校准剂量反应曲线的 -H2AX 焦点数据来说明。

相似文献

1
The effect of data aggregation on dispersion estimates in count data models.数据聚集对计数数据模型中离散估计的影响。
Int J Biostat. 2021 May 7;18(1):183-202. doi: 10.1515/ijb-2020-0079.
2
Using observation-level random effects to model overdispersion in count data in ecology and evolution.使用观察水平的随机效应模型来模拟生态学和进化中的计数数据的过离散。
PeerJ. 2014 Oct 9;2:e616. doi: 10.7717/peerj.616. eCollection 2014.
3
Approaches for dealing with various sources of overdispersion in modeling count data: Scale adjustment versus modeling.处理计数数据建模中各种过度分散来源的方法:尺度调整与建模。
Stat Methods Med Res. 2017 Aug;26(4):1802-1823. doi: 10.1177/0962280215588569. Epub 2015 May 31.
4
Revisiting dispersion in count data item response theory models: The Conway-Maxwell-Poisson counts model.重新审视计数数据项目反应理论模型中的分散性:康威-马克斯韦尔-泊松计数模型。
Br J Math Stat Psychol. 2020 Nov;73 Suppl 1:32-50. doi: 10.1111/bmsp.12184. Epub 2019 Aug 16.
5
Zero-inflated regression models for radiation-induced chromosome aberration data: A comparative study.辐射诱导染色体畸变数据的零膨胀回归模型:一项比较研究。
Biom J. 2016 Mar;58(2):259-79. doi: 10.1002/bimj.201400233. Epub 2015 Oct 13.
6
A comparison of observation-level random effect and Beta-Binomial models for modelling overdispersion in Binomial data in ecology & evolution.用于模拟生态学与进化领域二项式数据过分散的观测水平随机效应模型与贝塔-二项式模型的比较
PeerJ. 2015 Jul 21;3:e1114. doi: 10.7717/peerj.1114. eCollection 2015.
7
Modeling motor vehicle crashes using Poisson-gamma models: examining the effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter.使用泊松-伽马模型对机动车碰撞事故进行建模:研究低样本均值和小样本量对固定离散参数估计的影响。
Accid Anal Prev. 2006 Jul;38(4):751-66. doi: 10.1016/j.aap.2006.02.001. Epub 2006 Mar 20.
8
Beyond mean modelling: Bias due to misspecification of dispersion in Poisson-inverse Gaussian regression.
Biom J. 2019 Mar;61(2):333-342. doi: 10.1002/bimj.201700218. Epub 2018 Jul 12.
9
Application of the Poisson-Tweedie distribution in analyzing crash frequency data.泊松-图伊分布在分析碰撞频率数据中的应用。
Accid Anal Prev. 2020 Mar;137:105456. doi: 10.1016/j.aap.2020.105456. Epub 2020 Feb 6.
10
Sensitivity of test for overdispersion in Poisson regression.泊松回归中过度离散检验的敏感性。
Biom J. 2005 Apr;47(2):167-76. doi: 10.1002/bimj.200310096.

引用本文的文献

1
Relationship between air quality and asthma-related emergency hospital admissions in Mexico City 2017-2019.2017-2019 年墨西哥城空气质量与哮喘相关急诊住院的关系
Thorax. 2023 Dec 15;79(1):43-49. doi: 10.1136/thorax-2022-219262.
2
A Sequential Cross-Sectional Analysis Producing Robust Weekly COVID-19 Rates for South East Asian Countries.东南亚国家稳健的每周 COVID-19 发病率的序列横断面分析。
Viruses. 2023 Jul 18;15(7):1572. doi: 10.3390/v15071572.