• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用公开可用的行政数据库预测30天再入院情况。一种条件逻辑回归建模方法。

Predicting 30-day Hospital Readmission with Publicly Available Administrative Database. A Conditional Logistic Regression Modeling Approach.

作者信息

Zhu K, Lou Z, Zhou J, Ballester N, Kong N, Parikh P

机构信息

Nan Kong, 206 S. Martin Jischke Dr., West Lafayette, IN 47907, USA, E-mail:

出版信息

Methods Inf Med. 2015;54(6):560-7. doi: 10.3414/ME14-02-0017. Epub 2015 Nov 9.

DOI:10.3414/ME14-02-0017
PMID:26548400
Abstract

INTRODUCTION

This article is part of the Focus Theme of Methods of Information in Medicine on "Big Data and Analytics in Healthcare".

BACKGROUND

Hospital readmissions raise healthcare costs and cause significant distress to providers and patients. It is, therefore, of great interest to healthcare organizations to predict what patients are at risk to be readmitted to their hospitals. However, current logistic regression based risk prediction models have limited prediction power when applied to hospital administrative data. Meanwhile, although decision trees and random forests have been applied, they tend to be too complex to understand among the hospital practitioners.

OBJECTIVES

Explore the use of conditional logistic regression to increase the prediction accuracy.

METHODS

We analyzed an HCUP statewide inpatient discharge record dataset, which includes patient demographics, clinical and care utilization data from California. We extracted records of heart failure Medicare beneficiaries who had inpatient experience during an 11-month period. We corrected the data imbalance issue with under-sampling. In our study, we first applied standard logistic regression and decision tree to obtain influential variables and derive practically meaning decision rules. We then stratified the original data set accordingly and applied logistic regression on each data stratum. We further explored the effect of interacting variables in the logistic regression modeling. We conducted cross validation to assess the overall prediction performance of conditional logistic regression (CLR) and compared it with standard classification models.

RESULTS

The developed CLR models outperformed several standard classification models (e.g., straightforward logistic regression, stepwise logistic regression, random forest, support vector machine). For example, the best CLR model improved the classification accuracy by nearly 20% over the straightforward logistic regression model. Furthermore, the developed CLR models tend to achieve better sensitivity of more than 10% over the standard classification models, which can be translated to correct labeling of additional 400 - 500 readmissions for heart failure patients in the state of California over a year. Lastly, several key predictor identified from the HCUP data include the disposition location from discharge, the number of chronic conditions, and the number of acute procedures.

CONCLUSIONS

It would be beneficial to apply simple decision rules obtained from the decision tree in an ad-hoc manner to guide the cohort stratification. It could be potentially beneficial to explore the effect of pairwise interactions between influential predictors when building the logistic regression models for different data strata. Judicious use of the ad-hoc CLR models developed offers insights into future development of prediction models for hospital readmissions, which can lead to better intuition in identifying high-risk patients and developing effective post-discharge care strategies. Lastly, this paper is expected to raise the awareness of collecting data on additional markers and developing necessary database infrastructure for larger-scale exploratory studies on readmission risk prediction.

摘要

引言

本文是《医学信息方法》关于“医疗保健中的大数据与分析”这一重点主题的一部分。

背景

医院再入院会增加医疗成本,并给医护人员和患者带来极大困扰。因此,医疗保健机构非常有兴趣预测哪些患者有再次入院的风险。然而,当前基于逻辑回归的风险预测模型应用于医院管理数据时,预测能力有限。同时,尽管决策树和随机森林已被应用,但它们对于医院从业者来说往往过于复杂而难以理解。

目的

探索使用条件逻辑回归来提高预测准确性。

方法

我们分析了一个全州范围的医疗保健成本和利用项目(HCUP)住院患者出院记录数据集,其中包括来自加利福尼亚州的患者人口统计学、临床和护理利用数据。我们提取了在11个月期间有住院经历的心力衰竭医疗保险受益人的记录。我们通过欠采样纠正了数据不平衡问题。在我们的研究中,我们首先应用标准逻辑回归和决策树来获得有影响的变量并得出具有实际意义的决策规则。然后我们对原始数据集进行分层,并对每个数据层应用逻辑回归。我们进一步探索了逻辑回归建模中交互变量的影响。我们进行交叉验证以评估条件逻辑回归(CLR)的整体预测性能,并将其与标准分类模型进行比较。

结果

所开发的CLR模型优于几种标准分类模型(例如,直接逻辑回归、逐步逻辑回归、随机森林、支持向量机)。例如,最佳的CLR模型比直接逻辑回归模型的分类准确率提高了近20%。此外,所开发的CLR模型比标准分类模型的敏感性往往高出10%以上,这意味着在加利福尼亚州,一年中可以多正确识别400 - 500例心力衰竭患者的再入院情况。最后,从HCUP数据中识别出的几个关键预测因素包括出院处置地点、慢性病数量和急性手术数量。

结论

以临时方式应用从决策树获得的简单决策规则来指导队列分层将是有益的。在为不同数据层构建逻辑回归模型时,探索有影响的预测因素之间成对交互的影响可能具有潜在益处。明智地使用所开发的临时CLR模型为医院再入院预测模型的未来发展提供了见解,这可以在识别高危患者和制定有效的出院后护理策略方面带来更好的直觉。最后,本文有望提高对收集额外标志物数据以及为再入院风险预测的大规模探索性研究开发必要数据库基础设施的认识。

相似文献

1
Predicting 30-day Hospital Readmission with Publicly Available Administrative Database. A Conditional Logistic Regression Modeling Approach.利用公开可用的行政数据库预测30天再入院情况。一种条件逻辑回归建模方法。
Methods Inf Med. 2015;54(6):560-7. doi: 10.3414/ME14-02-0017. Epub 2015 Nov 9.
2
A hospital wide predictive model for unplanned readmission using hierarchical ICD data.基于 ICD 数据的层级结构的全院范围预测性模型,用于预测非计划性再入院。
Comput Methods Programs Biomed. 2019 May;173:177-183. doi: 10.1016/j.cmpb.2019.02.007. Epub 2019 Feb 13.
3
Analyzing 30-Day Readmission Rate for Heart Failure Using Different Predictive Models.使用不同预测模型分析心力衰竭的30天再入院率。
Stud Health Technol Inform. 2016;225:143-7.
4
An improved support vector machine-based diabetic readmission prediction.基于改进支持向量机的糖尿病患者再入院预测。
Comput Methods Programs Biomed. 2018 Nov;166:123-135. doi: 10.1016/j.cmpb.2018.10.012. Epub 2018 Oct 12.
5
Predicting Depression among Patients with Diabetes Using Longitudinal Data. A Multilevel Regression Model.使用纵向数据预测糖尿病患者的抑郁症。一种多层回归模型。
Methods Inf Med. 2015;54(6):553-9. doi: 10.3414/ME14-02-0009. Epub 2015 Nov 18.
6
Using machine learning to predict paediatric 30-day unplanned hospital readmissions: a case-control retrospective analysis of medical records, including written discharge documentation.使用机器学习预测儿科 30 天内非计划性住院再入院率:病历回顾性病例对照研究,包括书面出院记录。
Aust Health Rev. 2021 Jun;45(3):328-337. doi: 10.1071/AH20062.
7
8
Analysis of Machine Learning Techniques for Heart Failure Readmissions.心力衰竭再入院的机器学习技术分析
Circ Cardiovasc Qual Outcomes. 2016 Nov;9(6):629-640. doi: 10.1161/CIRCOUTCOMES.116.003039. Epub 2016 Nov 8.
9
Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach.急诊科脓毒症患者院内死亡率的预测:一种基于本地大数据驱动的机器学习方法。
Acad Emerg Med. 2016 Mar;23(3):269-78. doi: 10.1111/acem.12876. Epub 2016 Feb 13.
10
Using decision trees to explore the association between the length of stay and potentially avoidable readmissions: A retrospective cohort study.使用决策树探索住院时间与潜在可避免再入院之间的关联:一项回顾性队列研究。
Inform Health Soc Care. 2017 Dec;42(4):361-377. doi: 10.1080/17538157.2016.1269105. Epub 2017 Jan 13.

引用本文的文献

1
Investigating the impact of extreme weather events and related indicators on cardiometabolic multimorbidity.调查极端天气事件及相关指标对心血管代谢共病的影响。
Arch Public Health. 2024 Aug 19;82(1):128. doi: 10.1186/s13690-024-01361-x.
2
Value and Applicability of Large Administrative Healthcare Databases in Prosthetics and Orthotics Outcomes Research.大型行政医疗保健数据库在假肢与矫形器结果研究中的价值与适用性
Can Prosthet Orthot J. 2021 Sep 21;4(2):35958. doi: 10.33137/cpoj.v4i2.35958. eCollection 2021.
3
A Predictive Decision Analytics Approach for Primary Care Operations Management: A Case Study of Double-Booking Strategy Design and Evaluation.
一种用于初级保健运营管理的预测性决策分析方法:双预约策略设计与评估的案例研究
Comput Ind Eng. 2023 Mar;17. doi: 10.1016/j.cie.2023.109069. Epub 2023 Feb 18.
4
The path from big data analytics capabilities to value in hospitals: a scoping review.从大数据分析能力到医院的价值:范围综述。
BMC Health Serv Res. 2022 Jan 31;22(1):134. doi: 10.1186/s12913-021-07332-0.
5
The impact of creating mathematical formula to predict cardiovascular events in patients with heart failure.创建数学公式预测心力衰竭患者心血管事件的影响。
Sci Rep. 2018 Mar 5;8(1):3986. doi: 10.1038/s41598-018-22347-0.
6
Claims data-driven modeling of hospital time-to-readmission risk with latent heterogeneity.采用潜在异质性的医院再入院风险数据驱动建模。
Health Care Manag Sci. 2019 Mar;22(1):156-179. doi: 10.1007/s10729-018-9431-0. Epub 2018 Jan 25.
7
The Inherent Challenges of Using Large Data Sets in Healthcare Research: Experiences of an Interdisciplinary Team.医疗保健研究中使用大数据集的内在挑战:一个跨学科团队的经验
Comput Inform Nurs. 2017 May;35(5):221-225. doi: 10.1097/CIN.0000000000000359.
8
Contribution of temporal data to predictive performance in 30-day readmission of morbidly obese patients.时间数据对病态肥胖患者30天再入院预测性能的贡献。
PeerJ. 2017 Apr 25;5:e3230. doi: 10.7717/peerj.3230. eCollection 2017.