• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

改善对持续高医疗服务利用者的预测:使用集成方法的回顾性分析

Improving the Prediction of Persistent High Health Care Utilizers: Retrospective Analysis Using Ensemble Methodology.

作者信息

Howson Stephanie N, McShea Michael J, Ramachandran Raghav, Burkom Howard S, Chang Hsien-Yen, Weiner Jonathan P, Kharrazi Hadi

机构信息

Applied Physics Laboratory, Johns Hopkins University, Baltimore, MD, United States.

Center for Population Health Information Technology, Johns Hopkins School of Public Health, Baltimore, MD, United States.

出版信息

JMIR Med Inform. 2022 Mar 24;10(3):e33212. doi: 10.2196/33212.

DOI:10.2196/33212
PMID:35275063
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8990371/
Abstract

BACKGROUND

A small proportion of high-need patients persistently use the bulk of health care services and incur disproportionate costs. Population health management (PHM) programs often refer to these patients as persistent high utilizers (PHUs). Accurate PHU prediction enables PHM programs to better align scarce health care resources with high-need PHUs while generally improving outcomes. While prior research in PHU prediction has shown promise, traditional regression methods used in these studies have yielded limited accuracy.

OBJECTIVE

We are seeking to improve PHU predictions with an ensemble approach in a retrospective observational study design using insurance claim records.

METHODS

We defined a PHU as a patient with health care costs in the top 20% of all patients for 4 consecutive 6-month periods. We used 2013 claims data to predict PHU status in next 24 months. Our study population included 165,595 patients in the Johns Hopkins Health Care plan, with 8359 (5.1%) patients identified as PHUs in 2014 and 2015. We assessed the performance of several standalone machine learning methods and then an ensemble approach combining multiple models.

RESULTS

The candidate ensemble with complement naïve Bayes and random forest layers produced increased sensitivity and positive predictive value (PPV; 49.0% and 50.3%, respectively) compared to logistic regression (46.8% and 46.1%, respectively).

CONCLUSIONS

Our results suggest that ensemble machine learning can improve prediction of care management needs. Improved PPV implies reduced incorrect referral of low-risk patients. With the improved sensitivity/PPV balance of this approach, resources may be directed more efficiently to patients needing them most.

摘要

背景

一小部分高需求患者持续占用大部分医疗服务并产生不成比例的费用。人群健康管理(PHM)项目通常将这些患者称为持续高利用者(PHU)。准确的PHU预测能够使PHM项目更好地将稀缺的医疗资源与高需求的PHU相匹配,同时总体上改善治疗效果。虽然之前关于PHU预测的研究已显示出前景,但这些研究中使用的传统回归方法准确性有限。

目的

我们试图在一项使用保险理赔记录的回顾性观察性研究设计中,通过集成方法改进PHU预测。

方法

我们将PHU定义为在连续4个6个月期间医疗费用处于所有患者前20%的患者。我们使用2013年的理赔数据来预测未来24个月的PHU状态。我们的研究人群包括约翰霍普金斯医疗保健计划中的165,595名患者,其中8359名(5.1%)患者在2014年和2015年被确定为PHU。我们评估了几种独立机器学习方法的性能,然后评估了一种结合多个模型的集成方法。

结果

与逻辑回归(分别为46.8%和46.1%)相比,具有互补朴素贝叶斯和随机森林层的候选集成模型产生了更高的敏感性和阳性预测值(PPV;分别为49.0%和50.3%)。

结论

我们的结果表明,集成机器学习可以改善对护理管理需求的预测。PPV的提高意味着低风险患者的错误转诊减少。随着这种方法的敏感性/PPV平衡得到改善,资源可以更有效地导向最需要它们的患者。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dc6/8990371/d2285115bf3d/medinform_v10i3e33212_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dc6/8990371/4e49e61598b3/medinform_v10i3e33212_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dc6/8990371/d1459a0f2e79/medinform_v10i3e33212_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dc6/8990371/d2285115bf3d/medinform_v10i3e33212_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dc6/8990371/4e49e61598b3/medinform_v10i3e33212_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dc6/8990371/d1459a0f2e79/medinform_v10i3e33212_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dc6/8990371/d2285115bf3d/medinform_v10i3e33212_fig3.jpg

相似文献

1
Improving the Prediction of Persistent High Health Care Utilizers: Retrospective Analysis Using Ensemble Methodology.改善对持续高医疗服务利用者的预测:使用集成方法的回顾性分析
JMIR Med Inform. 2022 Mar 24;10(3):e33212. doi: 10.2196/33212.
2
Assessing the Value of Unsupervised Clustering in Predicting Persistent High Health Care Utilizers: Retrospective Analysis of Insurance Claims Data.评估无监督聚类在预测持续高医疗保健使用者方面的价值:保险理赔数据的回顾性分析
JMIR Med Inform. 2021 Nov 25;9(11):e31442. doi: 10.2196/31442.
3
Characterising and predicting persistent high-cost utilisers in healthcare: a retrospective cohort study in Singapore.描述和预测医疗保健中持续高费用使用者的特征:新加坡的一项回顾性队列研究。
BMJ Open. 2020 Jan 6;10(1):e031622. doi: 10.1136/bmjopen-2019-031622.
4
Prediction of Neurological Outcomes in Out-of-hospital Cardiac Arrest Survivors Immediately after Return of Spontaneous Circulation: Ensemble Technique with Four Machine Learning Models.院外心脏骤停幸存者自主循环恢复后即刻的神经功能结局预测:四种机器学习模型的集成技术。
J Korean Med Sci. 2021 Jul 19;36(28):e187. doi: 10.3346/jkms.2021.36.e187.
5
A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.一种新的混合集成机器学习模型,用于严重程度风险评估和 COVID 后预测系统。
Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.
6
[Prediction of intensive care unit readmission for critically ill patients based on ensemble learning].基于集成学习的危重症患者重症监护病房再入院预测
Beijing Da Xue Xue Bao Yi Xue Ban. 2021 Jun 18;53(3):566-572. doi: 10.19723/j.issn.1671-167X.2021.03.021.
7
Accurate Prediction of Coronary Heart Disease for Patients With Hypertension From Electronic Health Records With Big Data and Machine-Learning Methods: Model Development and Performance Evaluation.利用大数据和机器学习方法从电子健康记录中准确预测高血压患者的冠心病:模型开发与性能评估
JMIR Med Inform. 2020 Jul 6;8(7):e17257. doi: 10.2196/17257.
8
Predictive modeling of blood pressure during hemodialysis: a comparison of linear model, random forest, support vector regression, XGBoost, LASSO regression and ensemble method.血液透析期间血压的预测建模:线性模型、随机森林、支持向量回归、XGBoost、LASSO回归及集成方法的比较
Comput Methods Programs Biomed. 2020 Oct;195:105536. doi: 10.1016/j.cmpb.2020.105536. Epub 2020 May 22.
9
Identifying Consistent High-cost Users in a Health Plan: Comparison of Alternative Prediction Models.识别健康计划中持续的高成本用户:替代预测模型的比较。
Med Care. 2016 Sep;54(9):852-9. doi: 10.1097/MLR.0000000000000566.
10
Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach.急诊科脓毒症患者院内死亡率的预测:一种基于本地大数据驱动的机器学习方法。
Acad Emerg Med. 2016 Mar;23(3):269-78. doi: 10.1111/acem.12876. Epub 2016 Feb 13.

引用本文的文献

1
An online explainable ensemble machine learning model for predicting epidermal growth factor receptor mutation status in lung adenocarcinoma.一种用于预测肺腺癌中表皮生长因子受体突变状态的在线可解释集成机器学习模型。
Transl Lung Cancer Res. 2025 Jul 31;14(7):2670-2687. doi: 10.21037/tlcr-2025-237. Epub 2025 Jul 28.
2
Predicting high-need high-cost pediatric hospitalized patients in China based on machine learning methods.基于机器学习方法预测中国高需求高成本儿科住院患者。
Sci Rep. 2025 May 8;15(1):16006. doi: 10.1038/s41598-025-99546-z.
3
Long-term follow-up of children who received rapid genomic sequencing.

本文引用的文献

1
Assessment of structured data elements for social risk factors.社会风险因素的结构化数据元素评估。
Am J Manag Care. 2022 Jan 1;28(1):e14-e23. doi: 10.37765/ajmc.2022.88816.
2
Assessing the Added Value of Blood Pressure Information Derived from Electronic Health Records in Predicting Health Care Cost and Utilization.评估电子健康记录中血压信息对预测医疗费用和利用的增值作用。
Popul Health Manag. 2022 Jun;25(3):323-334. doi: 10.1089/pop.2021.0250. Epub 2021 Nov 29.
3
Assessing the Value of Unsupervised Clustering in Predicting Persistent High Health Care Utilizers: Retrospective Analysis of Insurance Claims Data.
接受快速基因组测序的儿童的长期随访
Genet Med. 2025 Jun;27(6):101403. doi: 10.1016/j.gim.2025.101403. Epub 2025 Mar 7.
4
Machine-learning-based cost prediction models for inpatients with mental disorders in China.基于机器学习的中国精神障碍住院患者成本预测模型
BMC Psychiatry. 2025 Jan 9;25(1):33. doi: 10.1186/s12888-024-06358-y.
评估无监督聚类在预测持续高医疗保健使用者方面的价值:保险理赔数据的回顾性分析
JMIR Med Inform. 2021 Nov 25;9(11):e31442. doi: 10.2196/31442.
4
Electronic Health Record-Based Risk Stratification: A Potential Key Ingredient to Achieving Value-Based Care.基于电子健康记录的风险分层:实现价值医疗的潜在关键要素。
Popul Health Manag. 2021 Dec;24(6):654-656. doi: 10.1089/pop.2021.0131. Epub 2021 Jun 14.
5
Class-Imbalanced Deep Learning via a Class-Balanced Ensemble.基于类别平衡集成的类别失衡深度学习。
IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5626-5640. doi: 10.1109/TNNLS.2021.3071122. Epub 2022 Oct 5.
6
Comparing the Predictive Effects of Patient Medication Adherence Indices in Electronic Health Record and Claims-Based Risk Stratification Models.比较电子健康记录和基于理赔的风险分层模型中患者用药依从性指标的预测效果。
Popul Health Manag. 2021 Oct;24(5):601-609. doi: 10.1089/pop.2020.0306. Epub 2021 Feb 5.
7
Impact of Area Deprivation Index on the Performance of Claims-Based Risk-Adjustment Models in Predicting Health Care Costs and Utilization.区域贫困指数对基于索赔的风险调整模型预测医疗费用和利用效果的影响。
Popul Health Manag. 2021 Jun;24(3):403-411. doi: 10.1089/pop.2020.0135. Epub 2020 Sep 10.
8
Integrating E-Prescribing and Pharmacy Claims Data for Predictive Modeling: Comparing Costs and Utilization of Health Plan Members Who Fill Their Initial Medications with Those Who Do Not.整合电子处方和药房理赔数据进行预测建模:比较初始用药者和非初始用药者的健康计划成员的成本和使用情况。
J Manag Care Spec Pharm. 2020 Oct;26(10):1282-1290. doi: 10.18553/jmcp.2020.26.10.1282.
9
Including Social and Behavioral Determinants in Predictive Models: Trends, Challenges, and Opportunities.在预测模型中纳入社会和行为决定因素:趋势、挑战与机遇。
JMIR Med Inform. 2020 Sep 8;8(9):e18084. doi: 10.2196/18084.
10
Assessing the Impact of Social Needs and Social Determinants of Health on Health Care Utilization: Using Patient- and Community-Level Data.评估社会需求和健康的社会决定因素对医疗保健利用的影响:使用患者和社区层面的数据。
Popul Health Manag. 2021 Apr;24(2):222-230. doi: 10.1089/pop.2020.0043. Epub 2020 Jun 25.