一种用于创建风险调整支付公式的新型机器学习算法。

A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas.

机构信息

Center for Innovation in Population Health, College of Public Health, University of Kentucky, Lexington.

Department of Economics, Boston University, Boston, Massachusetts.

出版信息

JAMA Health Forum. 2024 Apr 5;5(4):e240625. doi: 10.1001/jamahealthforum.2024.0625.

DOI:10.1001/jamahealthforum.2024.0625

PMID:38639980

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11065160/

Abstract

IMPORTANCE

Models predicting health care spending and other outcomes from administrative records are widely used to manage and pay for health care, despite well-documented deficiencies. New methods are needed that can incorporate more than 70 000 diagnoses without creating undesirable coding incentives.

OBJECTIVE

To develop a machine learning (ML) algorithm, building on Diagnostic Item (DXI) categories and Diagnostic Cost Group (DCG) methods, that automates development of clinically credible and transparent predictive models for policymakers and clinicians.

DESIGN, SETTING, AND PARTICIPANTS: DXIs were organized into disease hierarchies and assigned an Appropriateness to Include (ATI) score to reflect vagueness and gameability concerns. A novel automated DCG algorithm iteratively assigned DXIs in 1 or more disease hierarchies to DCGs, identifying sets of DXIs with the largest regression coefficient as dominant; presence of a previously identified dominating DXI removed lower-ranked ones before the next iteration. The Merative MarketScan Commercial Claims and Encounters Database for commercial health insurance enrollees 64 years and younger was used. Data from January 2016 through December 2018 were randomly split 90% to 10% for model development and validation, respectively. Deidentified claims and enrollment data were delivered by Merative the following November in each calendar year and analyzed from November 2020 to January 2024.

MAIN OUTCOME AND MEASURES

Concurrent top-coded total health care cost. Model performance was assessed using validation sample weighted least-squares regression, mean absolute errors, and mean errors for rare and common diagnoses.

RESULTS

This study included 35 245 586 commercial health insurance enrollees 64 years and younger (65 901 460 person-years) and relied on 19 clinicians who provided reviews in the base model. The algorithm implemented 218 clinician-specified hierarchies compared with the US Department of Health and Human Services (HHS) hierarchical condition category (HCC) model's 64 hierarchies. The base model that dropped vague and gameable DXIs reduced the number of parameters by 80% (1624 of 3150), achieved an R2 of 0.535, and kept mean predicted spending within 12% ($3843 of $31 313) of actual spending for the 3% of people with rare diseases. In contrast, the HHS HCC model had an R2 of 0.428 and underpaid this group by 33% ($10 354 of $31 313).

CONCLUSIONS AND RELEVANCE

In this study, by automating DXI clustering within clinically specified hierarchies, this algorithm built clinically interpretable risk models in large datasets while addressing diagnostic vagueness and gameability concerns.

摘要

重要性

尽管已有充分的文件证明其存在缺陷，但预测医疗支出和其他结果的模型仍被广泛用于管理和支付医疗保健费用。需要新的方法，这些方法可以纳入 70000 多个诊断，而不会产生不良的编码激励。

目的

开发一种机器学习 (ML) 算法，基于诊断项目 (DXI) 类别和诊断费用组 (DCG) 方法，为政策制定者和临床医生自动开发具有临床可信度和透明度的预测模型。

设计、设置和参与者：将 DXIs 组织成疾病层次结构，并分配适当性纳入 (ATI) 分数，以反映模糊性和可操作性问题。一种新颖的自动化 DCG 算法迭代地将 DXIs 分配到 1 个或多个疾病层次结构中，识别出具有最大回归系数的 DXIs 集作为主导；之前确定的主导 DXI 的存在消除了下一个迭代之前排名较低的 DXIs。使用 Merative 的 MarketScan 商业索赔和就诊数据库，该数据库涵盖了 64 岁及以下的商业健康保险参保人。2016 年 1 月至 2018 年 12 月的数据分别随机分为 90%和 10%用于模型开发和验证。Merative 在次年 11 月提供了经过去识别的索赔和参保数据，并在 2020 年 11 月至 2024 年 1 月进行了分析。

主要结果和措施

同期最高编码总医疗费用。使用验证样本加权最小二乘回归、罕见和常见诊断的平均绝对误差和平均误差来评估模型性能。

结果

这项研究包括 35245586 名 64 岁及以下的商业健康保险参保人（65901460 人年），并依赖于 19 名在基础模型中提供审查的临床医生。该算法实施了 218 个临床医生指定的层次结构，而美国卫生与公众服务部（HHS）的层次条件类别（HCC）模型只有 64 个层次结构。基础模型删除了模糊和可操作性的 DXIs，将参数数量减少了 80%（从 3150 个减少到 1624 个），实现了 0.535 的 R2，并将 3%罕见疾病患者的预测支出保持在实际支出的 12%以内（3843 美元，31313 美元）。相比之下，HHS HCC 模型的 R2 为 0.428，该模型对这一人群的支付额低了 33%（10354 美元，31313 美元）。

结论和相关性

在这项研究中，通过在临床指定的层次结构中自动对 DXI 进行聚类，该算法在大型数据集上构建了具有临床可解释性的风险模型，同时解决了诊断模糊性和可操作性问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5497/11065160/16e21e8cdb3b/jamahealthforum-e240625-g001.jpg

相似文献

A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas.一种用于创建风险调整支付公式的新型机器学习算法。

JAMA Health Forum. 2024 Apr 5;5(4):e240625. doi: 10.1001/jamahealthforum.2024.0625.

Development and Assessment of a New Framework for Disease Surveillance, Prediction, and Risk Adjustment: The Diagnostic Items Classification System.疾病监测、预测和风险调整新框架的开发与评估：诊断项目分类系统。

JAMA Health Forum. 2022 Mar 25;3(3):e220276. doi: 10.1001/jamahealthforum.2022.0276. eCollection 2022 Mar.

Plan Selection, Enrollee Risk, and Health Spending on the Patient Protection and Affordable Care Act Individual Marketplaces, 2019.《2019 年患者保护与平价医疗法案个人市场的计划选择、参保人风险与医疗支出》

JAMA Netw Open. 2023 Mar 1;6(3):e234529. doi: 10.1001/jamanetworkopen.2023.4529.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Diagnostic Category Prevalence in 3 Classification Systems Across the Transition to the International Classification of Diseases, Tenth Revision, Clinical Modification.在向国际疾病分类第十版临床修订版过渡的过程中，3 种分类系统中的诊断类别患病率。

JAMA Netw Open. 2020 Apr 1;3(4):e202280. doi: 10.1001/jamanetworkopen.2020.2280.

Mental Health Risk Adjustment with Clinical Categories and Machine Learning.精神健康风险调整与临床类别和机器学习。

Health Serv Res. 2018 Aug;53 Suppl 1(Suppl Suppl 1):3189-3206. doi: 10.1111/1475-6773.12818. Epub 2017 Dec 15.

Bundled Payment Models for Actinic Keratosis Management.光化性角化病管理的打包支付模式。

JAMA Dermatol. 2016 Jul 1;152(7):789-97. doi: 10.1001/jamadermatol.2016.0502.

The future of Cochrane Neonatal.考克兰新生儿协作网的未来。

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

A population health approach to reducing observational intensity bias in health risk adjustment: cross sectional analysis of insurance claims.采用人群健康方法减少健康风险调整中的观察强度偏差：保险索赔的横断面分析。

BMJ. 2014 Apr 10;348:g2392. doi: 10.1136/bmj.g2392.

Diagnostic, pharmacy-based, and self-reported health measures in risk equalization models.诊断、药学和自我报告的健康措施在风险均衡模型中。

Med Care. 2010 May;48(5):448-57. doi: 10.1097/MLR.0b013e3181d559b4.

引用本文的文献

Algorithms to Improve Fairness in Medicare Risk Adjustment.改善医疗保险风险调整公平性的算法

JAMA Health Forum. 2025 Aug 1;6(8):e252640. doi: 10.1001/jamahealthforum.2025.2640.

Artificial Intelligence in Relation to Accurate Information and Tasks in Gynecologic Oncology and Clinical Medicine-Dunning-Kruger Effects and Ultracrepidarianism.人工智能与妇科肿瘤学和临床医学中的准确信息及任务——邓宁-克鲁格效应和不懂装懂。

Diagnostics (Basel). 2025 Mar 15;15(6):735. doi: 10.3390/diagnostics15060735.

Algorithms to Improve Fairness in Medicare Risk Adjustment.改善医疗保险风险调整公平性的算法

medRxiv. 2025 Jan 27:2025.01.25.25321057. doi: 10.1101/2025.01.25.25321057.

本文引用的文献

Principles to Inform Risk Adjustment Policy: Reflection From the Front Lines.为风险调整政策提供参考的原则：来自一线的思考

JAMA. 2023 Sep 5;330(9):805-806. doi: 10.1001/jama.2023.12830.

Guiding Risk Adjustment Models Toward Machine Learning Methods.引导风险调整模型走向机器学习方法。

JAMA. 2023 Sep 5;330(9):807-808. doi: 10.1001/jama.2023.12920.

Preparing for the International Classification of Diseases, 11th Revision (ICD-11) in the US Health Care System.为美国医疗保健系统中的《国际疾病分类》第11次修订版（ICD-11）做准备。

JAMA Health Forum. 2023 Jul 7;4(7):e232253. doi: 10.1001/jamahealthforum.2023.2253.

Risk Adjustment And Promoting Health Equity In Population-Based Payment: Concepts And Evidence.基于人群的支付中的风险调整和促进健康公平：概念和证据。

Health Aff (Millwood). 2023 Jan;42(1):105-114. doi: 10.1377/hlthaff.2022.00916.

JAMA Health Forum. 2022 Mar 25;3(3):e220276. doi: 10.1001/jamahealthforum.2022.0276. eCollection 2022 Mar.

Measuring Multimorbidity: Selecting the Right Instrument for the Purpose and the Data Source.测量多种并存疾病：根据目的和数据源选择合适的工具。

Med Care. 2021 Aug 1;59(8):743-756. doi: 10.1097/MLR.0000000000001566.

Upcoding: Evidence from Medicare on Squishy Risk Adjustment.高编计费：来自医疗保险关于模糊风险调整的证据。

J Polit Econ. 2020 Mar;12(3):984-1026. doi: 10.1086/704756. Epub 2020 Jan 29.

JAMA Netw Open. 2020 Apr 1;3(4):e202280. doi: 10.1001/jamanetworkopen.2020.2280.

Fair regression for health care spending.公平回归医疗支出。

Biometrics. 2020 Sep;76(3):973-982. doi: 10.1111/biom.13206. Epub 2020 Jan 6.

Social Determinants of Health in Managed Care Payment Formulas.管理式医疗支付公式中的健康社会决定因素

JAMA Intern Med. 2017 Oct 1;177(10):1424-1430. doi: 10.1001/jamainternmed.2017.3317.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于创建风险调整支付公式的新型机器学习算法。

A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas.

机构信息

出版信息

IMPORTANCE

OBJECTIVE

MAIN OUTCOME AND MEASURES

RESULTS

CONCLUSIONS AND RELEVANCE

重要性

目的

主要结果和措施

结果

结论和相关性

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献