应用机器学习方法于行政索赔数据，以预测医疗和手术患者人群的临床结局。

Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations.

机构信息

Department of Anesthesiology and Critical Care, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

Penn Center for Perioperative Outcomes Research and Transformation (CPORT), University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

出版信息

PLoS One. 2021 Jun 3;16(6):e0252585. doi: 10.1371/journal.pone.0252585. eCollection 2021.

DOI:10.1371/journal.pone.0252585

PMID:34081720

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8174683/

Abstract

OBJECTIVE

This study aimed to develop and validate a claims-based, machine learning algorithm to predict clinical outcomes across both medical and surgical patient populations.

METHODS

This retrospective, observational cohort study, used a random 5% sample of 770,777 fee-for-service Medicare beneficiaries with an inpatient hospitalization between 2009-2011. The machine learning algorithms tested included: support vector machine, random forest, multilayer perceptron, extreme gradient boosted tree, and logistic regression. The extreme gradient boosted tree algorithm outperformed the alternatives and was the machine learning method used for the final risk model. Primary outcome was 30-day mortality. Secondary outcomes were: rehospitalization, and any of 23 adverse clinical events occurring within 30 days of the index admission date.

RESULTS

The machine learning algorithm performance was evaluated by both the area under the receiver operating curve (AUROC) and Brier Score. The risk model demonstrated high performance for prediction of: 30-day mortality (AUROC = 0.88; Brier Score = 0.06), and 17 of the 23 adverse events (AUROC range: 0.80-0.86; Brier Score range: 0.01-0.05). The risk model demonstrated moderate performance for prediction of: rehospitalization within 30 days (AUROC = 0.73; Brier Score: = 0.07) and six of the 23 adverse events (AUROC range: 0.74-0.79; Brier Score range: 0.01-0.02). The machine learning risk model performed comparably on a second, independent validation dataset, confirming that the risk model was not overfit.

CONCLUSIONS AND RELEVANCE

We have developed and validated a robust, claims-based, machine learning risk model that is applicable to both medical and surgical patient populations and demonstrates comparable predictive accuracy to existing risk models.

摘要

目的

本研究旨在开发和验证一种基于索赔的机器学习算法，以预测医疗和手术患者群体的临床结局。

方法

本回顾性观察队列研究使用了 2009-2011 年间住院的 770777 名付费医疗保险受益人的随机 5%样本。测试的机器学习算法包括：支持向量机、随机森林、多层感知机、极端梯度提升树和逻辑回归。极端梯度提升树算法表现优于其他算法，是最终风险模型使用的机器学习方法。主要结局为 30 天死亡率。次要结局为：30 天内再次住院和 23 种不良临床事件中的任何一种在索引入院日期后 30 天内发生。

结果

通过接收者操作特征曲线（AUROC）和 Brier 评分评估机器学习算法的性能。风险模型在预测 30 天死亡率（AUROC = 0.88；Brier 评分 = 0.06）和 23 种不良事件中的 17 种（AUROC 范围：0.80-0.86；Brier 评分范围：0.01-0.05）方面表现出较高的性能。风险模型在预测 30 天内再次住院（AUROC = 0.73；Brier 评分 = 0.07）和 23 种不良事件中的 6 种（AUROC 范围：0.74-0.79；Brier 评分范围：0.01-0.02）方面表现出中等性能。机器学习风险模型在第二个独立验证数据集上的表现相当，证实风险模型没有过度拟合。

结论和相关性

我们已经开发并验证了一种强大的、基于索赔的机器学习风险模型，该模型适用于医疗和手术患者群体，并且与现有风险模型具有可比的预测准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7c/8174683/1a90928960af/pone.0252585.g001.jpg

相似文献

Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations.

PLoS One. 2021 Jun 3;16(6):e0252585. doi: 10.1371/journal.pone.0252585. eCollection 2021.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Comparison of Machine Learning Methods With Traditional Models for Use of Administrative Claims With Electronic Medical Records to Predict Heart Failure Outcomes.

JAMA Netw Open. 2020 Jan 3;3(1):e1918962. doi: 10.1001/jamanetworkopen.2019.18962.

Machine learning prediction of postoperative major adverse cardiovascular events in geriatric patients: a prospective cohort study.

BMC Anesthesiol. 2022 Sep 10;22(1):284. doi: 10.1186/s12871-022-01827-x.

Machine learning-based prediction of in-hospital mortality using admission laboratory data: A retrospective, single-site study using electronic health record data.

PLoS One. 2021 Feb 5;16(2):e0246640. doi: 10.1371/journal.pone.0246640. eCollection 2021.

Development and Validation of Unplanned Extubation Prediction Models Using Intensive Care Unit Data: Retrospective, Comparative, Machine Learning Study.

J Med Internet Res. 2021 Aug 11;23(8):e23508. doi: 10.2196/23508.

Impact of Intraoperative Data on Risk Prediction for Mortality After Intra-Abdominal Surgery.

Anesth Analg. 2022 Jan 1;134(1):102-113. doi: 10.1213/ANE.0000000000005694.

Using machine learning to predict outcomes following carotid endarterectomy.

J Vasc Surg. 2023 Oct;78(4):973-987.e6. doi: 10.1016/j.jvs.2023.05.024. Epub 2023 May 20.

Applying machine learning approaches for predicting obesity risk using US health administrative claims database.

BMJ Open Diabetes Res Care. 2024 Sep 26;12(5):e004193. doi: 10.1136/bmjdrc-2024-004193.

[Prediction of intensive care unit readmission for critically ill patients based on ensemble learning].

Beijing Da Xue Xue Bao Yi Xue Ban. 2021 Jun 18;53(3):566-572. doi: 10.19723/j.issn.1671-167X.2021.03.021.

引用本文的文献

Predicting outcomes after hospitalisation for COPD exacerbation using machine learning.

ERJ Open Res. 2025 May 12;11(3). doi: 10.1183/23120541.00651-2024. eCollection 2025 May.

A Claims-Based Machine Learning Classifier of Modified Rankin Scale in Acute Ischemic Stroke.

medRxiv. 2025 Feb 10:2025.02.06.25321827. doi: 10.1101/2025.02.06.25321827.

Prediction of the Risk of Adverse Clinical Outcomes with Machine Learning Techniques in Patients with Noncommunicable Diseases.

J Med Syst. 2025 Feb 3;49(1):19. doi: 10.1007/s10916-025-02140-z.

Enhancing the Understanding of Abdominal Trauma During the COVID-19 Pandemic Through Co-Occurrence Analysis and Machine Learning.

Diagnostics (Basel). 2024 Oct 31;14(21):2444. doi: 10.3390/diagnostics14212444.

Combining artificial neural networks and a marginal structural model to predict the progression from depression to Alzheimer's disease.

Front Dement. 2024 Apr 5;3:1362230. doi: 10.3389/frdem.2024.1362230. eCollection 2024.

Applying Machine Learning Models Derived From Administrative Claims Data to Predict Medication Nonadherence in Patients Self-Administering Biologic Medications for Inflammatory Bowel Disease.

Crohns Colitis 360. 2024 Jul 8;6(3):otae039. doi: 10.1093/crocol/otae039. eCollection 2024 Jul.

Signals in the Cells: Multimodal and Contextualized Machine Learning Foundations for Therapeutics.

bioRxiv. 2024 Nov 12:2024.06.12.598655. doi: 10.1101/2024.06.12.598655.

Interpretable (not just posthoc-explainable) medical claims modeling for discharge placement to reduce preventable all-cause readmissions or death.

PLoS One. 2024 May 9;19(5):e0302871. doi: 10.1371/journal.pone.0302871. eCollection 2024.

On the Horizon: Specific Applications of Automation and Artificial Intelligence in Anesthesiology.

Curr Anesthesiol Rep. 2023 Jun;13(2):31-40. doi: 10.1007/s40140-023-00558-0. Epub 2023 Apr 6.

Prediction of Complications and Prognostication in Perioperative Medicine: A Systematic Review and PROBAST Assessment of Machine Learning Tools.

Anesthesiology. 2024 Jan 1;140(1):85-101. doi: 10.1097/ALN.0000000000004764.

本文引用的文献

Deep Learning to Improve Breast Cancer Detection on Screening Mammography.

Sci Rep. 2019 Aug 29;9(1):12495. doi: 10.1038/s41598-019-48995-4.

Development and Testing of Improved Models to Predict Payment Using Centers for Medicare & Medicaid Services Claims Data.

JAMA Netw Open. 2019 Aug 2;2(8):e198406. doi: 10.1001/jamanetworkopen.2019.8406.

Comparative Effectiveness of New Approaches to Improve Mortality Risk Models From Medicare Claims Data.

JAMA Netw Open. 2019 Jul 3;2(7):e197314. doi: 10.1001/jamanetworkopen.2019.7314.

A guide to deep learning in healthcare.

Nat Med. 2019 Jan;25(1):24-29. doi: 10.1038/s41591-018-0316-z. Epub 2019 Jan 7.

Machine learning: supervised methods.

Nat Methods. 2018 Jan;15(1):5-6. doi: 10.1038/nmeth.4551. Epub 2018 Jan 3.

A machine learning model to predict the risk of 30-day readmissions in patients with heart failure: a retrospective analysis of electronic medical records data.

BMC Med Inform Decis Mak. 2018 Jun 22;18(1):44. doi: 10.1186/s12911-018-0620-z.

MRI-Guided Thrombolysis for Stroke with Unknown Time of Onset.

N Engl J Med. 2018 Aug 16;379(7):611-622. doi: 10.1056/NEJMoa1804355. Epub 2018 May 16.

The Society of Thoracic Surgeons 2018 Adult Cardiac Surgery Risk Models: Part 2-Statistical Methods and Results.

Ann Thorac Surg. 2018 May;105(5):1419-1428. doi: 10.1016/j.athoracsur.2018.03.003. Epub 2018 Mar 22.

Development and Validation of Machine Learning Models for Prediction of 1-Year Mortality Utilizing Electronic Medical Record Data Available at the End of Hospitalization in Multicondition Patients: a Proof-of-Concept Study.

J Gen Intern Med. 2018 Jun;33(6):921-928. doi: 10.1007/s11606-018-4316-y. Epub 2018 Jan 30.

Can machine-learning improve cardiovascular risk prediction using routine clinical data?

PLoS One. 2017 Apr 4;12(4):e0174944. doi: 10.1371/journal.pone.0174944. eCollection 2017.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

应用机器学习方法于行政索赔数据，以预测医疗和手术患者人群的临床结局。

Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations.

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS AND RELEVANCE

目的

方法

结果

结论和相关性

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献