• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分析住院数据:泊松回归的潜在局限性

Analyzing hospitalization data: potential limitations of Poisson regression.

作者信息

Weaver Colin G, Ravani Pietro, Oliver Matthew J, Austin Peter C, Quinn Robert R

机构信息

Department of Community Health Sciences, University of Calgary, Calgary, Alberta, Canada.

Department of Community Health Sciences, University of Calgary, Calgary, Alberta, Canada Department of Medicine, University of Calgary, Calgary, Alberta, Canada.

出版信息

Nephrol Dial Transplant. 2015 Aug;30(8):1244-9. doi: 10.1093/ndt/gfv071. Epub 2015 Mar 25.

DOI:10.1093/ndt/gfv071
PMID:25813274
Abstract

BACKGROUND

Poisson regression is commonly used to analyze hospitalization data when outcomes are expressed as counts (e.g. number of days in hospital). However, data often violate the assumptions on which Poisson regression is based. More appropriate extensions of this model, while available, are rarely used.

METHODS

We compared hospitalization data between 206 patients treated with hemodialysis (HD) and 107 treated with peritoneal dialysis (PD) using Poisson regression and compared results from standard Poisson regression with those obtained using three other approaches for modeling count data: negative binomial (NB) regression, zero-inflated Poisson (ZIP) regression and zero-inflated negative binomial (ZINB) regression. We examined the appropriateness of each model and compared the results obtained with each approach.

RESULTS

During a mean 1.9 years of follow-up, 183 of 313 patients (58%) were never hospitalized (indicating an excess of 'zeros'). The data also displayed overdispersion (variance greater than mean), violating another assumption of the Poisson model. Using four criteria, we determined that the NB and ZINB models performed best. According to these two models, patients treated with HD experienced similar hospitalization rates as those receiving PD {NB rate ratio (RR): 1.04 [bootstrapped 95% confidence interval (CI): 0.49-2.20]; ZINB summary RR: 1.21 (bootstrapped 95% CI 0.60-2.46)}. Poisson and ZIP models fit the data poorly and had much larger point estimates than the NB and ZINB models [Poisson RR: 1.93 (bootstrapped 95% CI 0.88-4.23); ZIP summary RR: 1.84 (bootstrapped 95% CI 0.88-3.84)].

CONCLUSIONS

We found substantially different results when modeling hospitalization data, depending on the approach used. Our results argue strongly for a sound model selection process and improved reporting around statistical methods used for modeling count data.

摘要

背景

当结果以计数形式表示(如住院天数)时,泊松回归常用于分析住院数据。然而,数据常常违反泊松回归所基于的假设。虽然有更合适的该模型扩展方法,但很少被使用。

方法

我们使用泊松回归比较了206例接受血液透析(HD)治疗的患者和107例接受腹膜透析(PD)治疗的患者的住院数据,并将标准泊松回归的结果与使用其他三种计数数据建模方法获得的结果进行比较:负二项式(NB)回归、零膨胀泊松(ZIP)回归和零膨胀负二项式(ZINB)回归。我们检查了每个模型的适用性,并比较了每种方法获得的结果。

结果

在平均1.9年的随访期间,313例患者中有183例(58%)从未住院(表明“零值”过多)。数据还显示过度离散(方差大于均值),这违反了泊松模型的另一个假设。使用四个标准,我们确定NB和ZINB模型表现最佳。根据这两个模型,接受HD治疗的患者与接受PD治疗的患者的住院率相似{NB率比(RR):1.04[自抽样95%置信区间(CI):0.49 - 2.20];ZINB汇总RR:1.21(自抽样95%CI 0.60 - 2.46)}。泊松模型和ZIP模型对数据的拟合较差,且点估计值比NB和ZINB模型大得多[泊松RR:1.93(自抽样95%CI 0.88 - 4.23);ZIP汇总RR:1.84(自抽样95%CI 0.88 - 3.84)]。

结论

我们发现,根据所使用的方法,在对住院数据进行建模时会得到截然不同的结果。我们的结果有力地支持了一个合理的模型选择过程,并改进围绕计数数据建模所使用的统计方法的报告。

相似文献

1
Analyzing hospitalization data: potential limitations of Poisson regression.分析住院数据:泊松回归的潜在局限性
Nephrol Dial Transplant. 2015 Aug;30(8):1244-9. doi: 10.1093/ndt/gfv071. Epub 2015 Mar 25.
2
Statistical modelling of falls count data with excess zeros.基于过零数据的跌倒计数资料的统计建模。
Inj Prev. 2011 Aug;17(4):266-70. doi: 10.1136/ip.2011.031740. Epub 2011 Jun 8.
3
A comparison of statistical methods for modeling count data with an application to hospital length of stay.一种用于对计数数据建模的统计方法比较及其在住院时间中的应用。
BMC Med Res Methodol. 2022 Aug 4;22(1):211. doi: 10.1186/s12874-022-01685-8.
4
On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.关于零膨胀和过度分散计数响应的参数模型和非参数模型的性能。
Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.
5
Zero inflated statistical count models for analysing the costs imposed by GERD and dyspepsia.用于分析胃食管反流病(GERD)和消化不良所带来成本的零膨胀统计计数模型。
Arab J Gastroenterol. 2013 Dec;14(4):165-8. doi: 10.1016/j.ajg.2013.09.004. Epub 2013 Nov 28.
6
On the use of zero-inflated and hurdle models for modeling vaccine adverse event count data.关于使用零膨胀模型和障碍模型对疫苗不良事件计数数据进行建模
J Biopharm Stat. 2006;16(4):463-81. doi: 10.1080/10543400600719384.
7
Comparison of methods for modelling a count outcome with excess zeros: application to Activities of Daily Living (ADL-s).比较用于建模带有超额零的计数结果的方法:在日常生活活动(ADL-s)中的应用。
J Epidemiol Community Health. 2011 Mar;65(3):205-10. doi: 10.1136/jech.2008.079640. Epub 2010 Jul 30.
8
Social hidden groups size analyzing: application of count regression models for excess zeros.社会隐藏群体规模分析:计数回归模型在处理过多零值中的应用
J Res Health Sci. 2013 Sep 17;13(2):143-50.
9
Statistical modelling for falls count data.用于跌倒计数数据的统计建模。
Accid Anal Prev. 2010 Mar;42(2):384-92. doi: 10.1016/j.aap.2009.08.018. Epub 2009 Oct 1.
10
Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory.机动车碰撞事故的泊松、泊松-伽马和零膨胀回归模型:平衡统计拟合与理论
Accid Anal Prev. 2005 Jan;37(1):35-46. doi: 10.1016/j.aap.2004.02.004.

引用本文的文献

1
Meteorological factors association with under-five children diarrhea incidence in central Gondar zone, Northwest Ethiopia. A time series study.埃塞俄比亚西北部贡德尔中部地区五岁以下儿童腹泻发病率与气象因素的关联。一项时间序列研究。
BMC Res Notes. 2025 May 9;18(1):208. doi: 10.1186/s13104-025-07270-8.
2
Spatiotemporal variation of under-5 children diarrhea incidence and associated meteorological factors in central Gondar zone, Northwest Ethiopia. A retrospective time series study.埃塞俄比亚西北部贡德尔中部地区5岁以下儿童腹泻发病率及相关气象因素的时空变化。一项回顾性时间序列研究。
BMC Infect Dis. 2025 Mar 19;25(1):380. doi: 10.1186/s12879-025-10772-2.
3
Food Insecurity and Clinical Outcomes in Surgical Trauma Patients.
外科创伤患者的食物不安全状况与临床结局
JAMA Surg. 2025 May 1;160(5):545-552. doi: 10.1001/jamasurg.2025.0045.
4
Joint effects of ill-health, health shocks and social protection on the intensive margin of labour supply: evidence from Malawi.健康不佳、健康冲击与社会保护对劳动力供给集约边际的联合影响:来自马拉维的证据。
Health Econ Rev. 2024 Sep 17;14(1):75. doi: 10.1186/s13561-024-00548-w.
5
Pitfalls in time-to-event analysis of registry data: a tutorial based on simulated and real cases.登记数据的事件发生时间分析中的陷阱:基于模拟和实际案例的教程
Front Epidemiol. 2024 Jul 11;4:1386922. doi: 10.3389/fepid.2024.1386922. eCollection 2024.
6
Health Care Utilization and Costs Associated With Empagliflozin in Older Adults With Type 2 Diabetes.恩格列净治疗老年 2 型糖尿病患者的医疗利用度和费用。
Diabetes Care. 2024 Nov 1;47(11):1900-1907. doi: 10.2337/dc24-0270.
7
Modeling County-Level Rare Disease Prevalence Using Bayesian Hierarchical Sampling Weighted Zero-Inflated Regression.使用贝叶斯分层抽样加权零膨胀回归对县级罕见病患病率进行建模。
J Data Sci. 2023 Jan;21(1):145-157. doi: 10.6339/22-JDS1049.
8
Factors associated with prolonged hospitalization of patients with corona virus disease (COVID-19) in Uganda: a retrospective cohort study.乌干达冠状病毒病(COVID-19)患者长期住院相关因素:一项回顾性队列研究。
Trop Med Health. 2022 Dec 28;50(1):100. doi: 10.1186/s41182-022-00491-8.
9
Elder abuse and hospitalization in rural Malaysia.马来西亚农村的虐待老人与住院情况。
PLoS One. 2022 Jun 24;17(6):e0270163. doi: 10.1371/journal.pone.0270163. eCollection 2022.
10
The impact of stroke, cognitive function and post-stroke cognitive impairment (PSCI) on healthcare utilisation in Ireland: a cross-sectional nationally representative study.中风、认知功能及中风后认知障碍(PSCI)对爱尔兰医疗保健利用情况的影响:一项全国代表性横断面研究。
BMC Health Serv Res. 2022 Mar 29;22(1):414. doi: 10.1186/s12913-022-07837-2.