• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于对计数数据建模的统计方法比较及其在住院时间中的应用。

A comparison of statistical methods for modeling count data with an application to hospital length of stay.

机构信息

School of Mathematical and Statistical Sciences, University of Texas Rio Grande Valley, One West University Boulevard, Brownsville CampusBrownsville, TX, 78520, USA.

出版信息

BMC Med Res Methodol. 2022 Aug 4;22(1):211. doi: 10.1186/s12874-022-01685-8.

DOI:10.1186/s12874-022-01685-8
PMID:35927612
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9351158/
Abstract

BACKGROUND

Hospital length of stay (LOS) is a key indicator of hospital care management efficiency, cost of care, and hospital planning. Hospital LOS is often used as a measure of a post-medical procedure outcome, as a guide to the benefit of a treatment of interest, or as an important risk factor for adverse events. Therefore, understanding hospital LOS variability is always an important healthcare focus. Hospital LOS data can be treated as count data, with discrete and non-negative values, typically right skewed, and often exhibiting excessive zeros. In this study, we compared the performance of the Poisson, negative binomial (NB), zero-inflated Poisson (ZIP), and zero-inflated negative binomial (ZINB) regression models using simulated and empirical data.

METHODS

Data were generated under different simulation scenarios with varying sample sizes, proportions of zeros, and levels of overdispersion. Analysis of hospital LOS was conducted using empirical data from the Medical Information Mart for Intensive Care database.

RESULTS

Results showed that Poisson and ZIP models performed poorly in overdispersed data. ZIP outperformed the rest of the regression models when the overdispersion is due to zero-inflation only. NB and ZINB regression models faced substantial convergence issues when incorrectly used to model equidispersed data. NB model provided the best fit in overdispersed data and outperformed the ZINB model in many simulation scenarios with combinations of zero-inflation and overdispersion, regardless of the sample size. In the empirical data analysis, we demonstrated that fitting incorrect models to overdispersed data leaded to incorrect regression coefficients estimates and overstated significance of some of the predictors.

CONCLUSIONS

Based on this study, we recommend to the researchers that they consider the ZIP models for count data with zero-inflation only and NB models for overdispersed data or data with combinations of zero-inflation and overdispersion. If the researcher believes there are two different data generating mechanisms producing zeros, then the ZINB regression model may provide greater flexibility when modeling the zero-inflation and overdispersion.

摘要

背景

住院时长(LOS)是医院管理效率、医疗成本和医院规划的关键指标。医院 LOS 通常被用作医疗后程序结果的衡量标准,作为治疗效果的指导,或作为不良事件的重要风险因素。因此,了解医院 LOS 的变化一直是医疗保健的重点。医院 LOS 数据可以视为计数数据,具有离散的非负数值,通常呈右偏态分布,并且经常出现大量零值。在这项研究中,我们使用模拟数据和实际数据比较了泊松、负二项式(NB)、零膨胀泊松(ZIP)和零膨胀负二项式(ZINB)回归模型的性能。

方法

在不同的模拟场景下,根据样本量、零值比例和过度离散程度的变化生成数据。使用来自重症监护医疗信息集市数据库的实际数据对医院 LOS 进行分析。

结果

结果表明,泊松和 ZIP 模型在过度离散数据下表现不佳。当过度离散仅由于零膨胀引起时,ZIP 模型优于其他回归模型。当错误地用于模拟等分散数据时,NB 和 ZINB 回归模型会遇到严重的收敛问题。NB 模型在过度离散数据中提供了最佳拟合,并且在许多具有零膨胀和过度离散组合的模拟场景中,无论样本量如何,都优于 ZINB 模型。在实际数据分析中,我们证明了将错误模型拟合到过度离散数据中会导致回归系数估计错误,并夸大了一些预测因子的显著性。

结论

基于这项研究,我们建议研究人员对于仅具有零膨胀的计数数据考虑使用 ZIP 模型,对于过度离散数据或具有零膨胀和过度离散组合的数据考虑使用 NB 模型。如果研究人员认为有两种不同的数据生成机制产生零值,则在对零膨胀和过度离散进行建模时,ZINB 回归模型可能提供更大的灵活性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/9351158/5f7e609fb513/12874_2022_1685_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/9351158/5f7e609fb513/12874_2022_1685_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7b3/9351158/5f7e609fb513/12874_2022_1685_Fig1_HTML.jpg

相似文献

1
A comparison of statistical methods for modeling count data with an application to hospital length of stay.一种用于对计数数据建模的统计方法比较及其在住院时间中的应用。
BMC Med Res Methodol. 2022 Aug 4;22(1):211. doi: 10.1186/s12874-022-01685-8.
2
On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.关于零膨胀和过度分散计数响应的参数模型和非参数模型的性能。
Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.
3
Multilevel modeling in single-case studies with zero-inflated and overdispersed count data.零膨胀和过离散计数数据的单病例研究中的多层次建模。
Behav Res Methods. 2024 Apr;56(4):2765-2781. doi: 10.3758/s13428-024-02359-7. Epub 2024 Feb 21.
4
Models for analyzing zero-inflated and overdispersed count data: an application to cigarette and marijuana use.用于分析零膨胀和过度分散计数数据的模型:在香烟和大麻使用中的应用。
Nicotine Tob Res. 2018 Apr 18;22(8):1390-8. doi: 10.1093/ntr/nty072.
5
Evaluation of negative binomial and zero-inflated negative binomial models for the analysis of zero-inflated count data: application to the telemedicine for children with medical complexity trial.零膨胀计数数据的负二项式和零膨胀负二项式模型评估:在医疗复杂性儿童远程医疗试验中的应用。
Trials. 2023 Sep 27;24(1):613. doi: 10.1186/s13063-023-07648-8.
6
Statistical modelling of falls count data with excess zeros.基于过零数据的跌倒计数资料的统计建模。
Inj Prev. 2011 Aug;17(4):266-70. doi: 10.1136/ip.2011.031740. Epub 2011 Jun 8.
7
Analyzing hospitalization data: potential limitations of Poisson regression.分析住院数据:泊松回归的潜在局限性
Nephrol Dial Transplant. 2015 Aug;30(8):1244-9. doi: 10.1093/ndt/gfv071. Epub 2015 Mar 25.
8
A simulation study of the performance of statistical models for count outcomes with excessive zeros.计数结局中过度零的统计模型性能的模拟研究。
Stat Med. 2024 Oct 30;43(24):4752-4767. doi: 10.1002/sim.10198. Epub 2024 Aug 28.
9
Marginalized zero-inflated negative binomial regression with application to dental caries.边缘化零膨胀负二项回归及其在龋齿研究中的应用
Stat Med. 2016 May 10;35(10):1722-35. doi: 10.1002/sim.6804. Epub 2015 Nov 15.
10
A score test for overdispersion in zero-inflated poisson mixed regression model.零膨胀泊松混合回归模型中过度离散的得分检验。
Stat Med. 2007 Mar 30;26(7):1608-22. doi: 10.1002/sim.2616.

引用本文的文献

1
Relationship between Household Tuberculosis and Socioeconomic and Bioenvironmental Factors: A Statistical Model Approach Using NFHS-5 Data.家庭结核病与社会经济和生物环境因素之间的关系:一种使用国家家庭健康调查-5数据的统计模型方法
Indian J Community Med. 2025 Jul-Aug;50(4):689-693. doi: 10.4103/ijcm.ijcm_191_24. Epub 2025 Feb 21.
2
Does a take-home dose program result in better patient adherence to methadone? Evidence from Vietnam.带回家剂量计划是否能提高患者对美沙酮的依从性?来自越南的证据。
Harm Reduct J. 2025 Jul 28;22(1):131. doi: 10.1186/s12954-025-01279-9.
3
Does change in area-level deprivation, change health outcomes? A latent class growth analysis of population data.

本文引用的文献

1
The association between opening a short stay paediatric assessment unit and trends in short stay hospital admissions.开设短期儿科评估单位与短期住院人数趋势之间的关联。
BMC Health Serv Res. 2021 May 29;21(1):523. doi: 10.1186/s12913-021-06541-x.
2
Preoperative Physical Therapy Results in Shorter Length of Stay and Discharge Disposition Following Total Knee Arthroplasty: A Retrospective Study.术前物理治疗可缩短全膝关节置换术后的住院时间和出院处置时间:一项回顾性研究。
J Rehabil Med Clin Commun. 2019 May 23;2:1000017. doi: 10.2340/20030711-1000017. eCollection 2019.
3
Statistical models for analyzing count data: predictors of length of stay among HIV patients in Portugal using a multilevel model.
地区层面的贫困变化会改变健康结果吗?一项基于人口数据的潜在类别增长分析。
SSM Popul Health. 2025 Jun 11;31:101826. doi: 10.1016/j.ssmph.2025.101826. eCollection 2025 Sep.
4
Zero-inflated models for the evaluation of colorectal polyps in colon cancer screening studies-a value-based biostatistics practice.用于结肠癌筛查研究中评估结直肠息肉的零膨胀模型——基于价值的生物统计学实践
PeerJ. 2025 May 26;13:e19504. doi: 10.7717/peerj.19504. eCollection 2025.
5
Prolonged Length of Stay at Out-Of-State Trauma Centers: Potential Role for Repatriation.在州外创伤中心的住院时间延长:遣返的潜在作用。
J Am Coll Surg. 2025 May 16. doi: 10.1097/XCS.0000000000001449.
6
Modeling the Microsurgical Learning Curve Using a Poisson-Based Statistical Approach for Skill Assessment.使用基于泊松分布的统计方法评估技能来模拟显微外科学习曲线。
Cureus. 2025 Apr 25;17(4):e83009. doi: 10.7759/cureus.83009. eCollection 2025 Apr.
7
The Prevalence, Risk Factors, and Clinical Outcomes of Vitamin C Deficiency in Adult Hospitalised Patients: A Retrospective Observational Study.成年住院患者维生素C缺乏的患病率、危险因素及临床结局:一项回顾性观察研究
Nutrients. 2025 Mar 25;17(7):1131. doi: 10.3390/nu17071131.
8
Development of Multiservice Machine Learning Models to Predict Postsurgical Length of Stay and Discharge Disposition at the Time of Case Posting.开发多服务机器学习模型以预测病例发布时的术后住院时间和出院处置情况。
Ann Surg Open. 2025 Jan 31;6(1):e547. doi: 10.1097/AS9.0000000000000547. eCollection 2025 Mar.
9
Acute pain trajectories in elderly patients with fragility hip fractures.老年脆性髋部骨折患者的急性疼痛轨迹
Bone. 2025 Apr;193:117428. doi: 10.1016/j.bone.2025.117428. Epub 2025 Feb 22.
10
Low falls and inpatient complications increase risk for longer length of stay in older persons admitted following trauma.轻度跌倒和住院并发症会增加创伤后入院的老年人住院时间延长的风险。
BMC Geriatr. 2025 Feb 14;25(1):98. doi: 10.1186/s12877-025-05755-6.
用于分析计数数据的统计模型:使用多层模型对葡萄牙艾滋病毒患者住院时间的预测因素
BMC Health Serv Res. 2021 Apr 21;21(1):372. doi: 10.1186/s12913-021-06389-1.
4
A Study of Factors Affecting the Length of Hospital Stay of COVID-19 Patients by Cox-Proportional Hazard Model in a South Indian Tertiary Care Hospital.一项在印度南部一家三级护理医院中应用 Cox 比例风险模型研究影响 COVID-19 患者住院时间的因素的研究。
J Prim Care Community Health. 2021 Jan-Dec;12:21501327211000231. doi: 10.1177/21501327211000231.
5
Cost-Effectiveness Analysis of Type 2 Diabetes Mellitus (T2DM) Treatment in Patients with Complications of Kidney and Peripheral Vascular Diseases in Indonesia.印度尼西亚肾和外周血管疾病并发症患者2型糖尿病(T2DM)治疗的成本效益分析
Healthcare (Basel). 2021 Feb 16;9(2):211. doi: 10.3390/healthcare9020211.
6
Predicting Length of Stay and Discharge Destination for Surgical Patients: A Cohort Study.预测手术患者的住院时间和出院去向:一项队列研究。
Int J Environ Res Public Health. 2020 Dec 18;17(24):9490. doi: 10.3390/ijerph17249490.
7
COVID-19 length of hospital stay: a systematic review and data synthesis.COVID-19 住院时间:系统评价和数据综合。
BMC Med. 2020 Sep 3;18(1):270. doi: 10.1186/s12916-020-01726-3.
8
Costs and Length of Stay of Hospitalizations due to Diabetes-Related Complications.因糖尿病相关并发症导致的住院费用和住院时间。
J Diabetes Res. 2019 Sep 8;2019:2363292. doi: 10.1155/2019/2363292. eCollection 2019.
9
Excess length of hospital stay due to healthcare acquired infections: methodologies evaluation.因医疗保健相关感染导致的住院时间延长:方法学评估
Ann Ig. 2019 Sep-Oct;31(5):507-516. doi: 10.7416/ai.2019.2311.
10
Improving length of stay prediction using a hidden Markov model.使用隐马尔可夫模型改善住院时间预测。
AMIA Jt Summits Transl Sci Proc. 2019 May 6;2019:425-434. eCollection 2019.