一种基于索赔数据的机器学习算法，用于识别肺动脉高压患者。

A claims-based, machine-learning algorithm to identify patients with pulmonary arterial hypertension.

作者信息

Hyde Bethany, Paoli Carly J, Panjabi Sumeet, Bettencourt Katherine C, Bell Lynum Karimah S, Selej Mona

机构信息

Janssen Business Technology Commercial Data Insights & Data Science Titusville New Jersey USA.

Janssen Scientific Affairs, Inc. Titusville New Jersey USA.

出版信息

Pulm Circ. 2023 Jun 6;13(2):e12237. doi: 10.1002/pul2.12237. eCollection 2023 Apr.

DOI:10.1002/pul2.12237

PMID:37287599

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10243208/

Abstract

Many patients with pulmonary arterial hypertension (PAH) experience substantial delays in diagnosis, which is associated with worse outcomes and higher costs. Tools for diagnosing PAH sooner may lead to earlier treatment, which may delay disease progression and adverse outcomes including hospitalization and death. We developed a machine-learning (ML) algorithm to identify patients at risk for PAH earlier in their symptom journey and distinguish them from patients with similar early symptoms not at risk for developing PAH. Our supervised ML model analyzed retrospective, de-identified data from the US-based Optum® Clinformatics® Data Mart claims database (January 2015 to December 2019). Propensity score matched PAH and non-PAH (control) cohorts were established based on observed differences. Random forest models were used to classify patients as PAH or non-PAH at diagnosis and at 6 months prediagnosis. The PAH and non-PAH cohorts included 1339 and 4222 patients, respectively. At 6 months prediagnosis, the model performed well in distinguishing PAH and non-PAH patients, with area under the curve of the receiver operating characteristic of 0.84, recall (sensitivity) of 0.73, and precision of 0.50. Key features distinguishing PAH from non-PAH cohorts were a longer time between first symptom and the prediagnosis model date (i.e., 6 months before diagnosis); more diagnostic and prescription claims, circulatory claims, and imaging procedures, leading to higher overall healthcare resource utilization; and more hospitalizations. Our model distinguishes between patients with and without PAH at 6 months before diagnosis and illustrates the feasibility of using routine claims data to identify patients at a population level who might benefit from PAH-specific screening and/or earlier specialist referral.

摘要

许多肺动脉高压（PAH）患者在诊断方面存在显著延迟，这与更差的预后和更高的成本相关。更早诊断PAH的工具可能会带来更早的治疗，从而可能延缓疾病进展以及包括住院和死亡在内的不良后果。我们开发了一种机器学习（ML）算法，以在症状出现过程中更早地识别有PAH风险的患者，并将他们与有类似早期症状但无PAH发病风险的患者区分开来。我们的监督式ML模型分析了来自美国Optum® Clinformatics®数据集市索赔数据库（2015年1月至2019年12月）的回顾性、去识别化数据。基于观察到的差异建立了倾向评分匹配的PAH和非PAH（对照）队列。随机森林模型用于在诊断时和诊断前6个月将患者分类为PAH或非PAH。PAH和非PAH队列分别包括1339例和4222例患者。在诊断前6个月，该模型在区分PAH和非PAH患者方面表现良好，受试者操作特征曲线下面积为0.84，召回率（敏感性）为0.73，精确率为0.50。区分PAH和非PAH队列的关键特征是从首次症状出现到诊断前模型日期（即诊断前6个月）的时间更长；更多的诊断和处方索赔、循环系统索赔以及影像检查程序，导致更高的总体医疗资源利用率；以及更多的住院治疗。我们的模型在诊断前6个月就能区分有无PAH的患者，并说明了使用常规索赔数据在人群层面识别可能从PAH特异性筛查和/或更早的专科转诊中受益的患者的可行性。

相似文献

A claims-based, machine-learning algorithm to identify patients with pulmonary arterial hypertension.一种基于索赔数据的机器学习算法，用于识别肺动脉高压患者。

Pulm Circ. 2023 Jun 6;13(2):e12237. doi: 10.1002/pul2.12237. eCollection 2023 Apr.

Development and evaluation of a predictive algorithm for unsatisfactory response among patients with pulmonary arterial hypertension using health insurance claims data.利用医疗保险索赔数据开发和评估肺动脉高压患者不良反应预测算法。

Curr Med Res Opin. 2022 Jun;38(6):1019-1030. doi: 10.1080/03007995.2022.2049162. Epub 2022 Mar 17.

Economic Burden of Delayed Diagnosis in Patients with Pulmonary Arterial Hypertension (PAH).肺动脉高压（PAH）患者延迟诊断的经济负担

Pharmacoecon Open. 2024 Jan;8(1):133-146. doi: 10.1007/s41669-023-00453-8. Epub 2023 Nov 18.

Time to diagnosis of pulmonary hypertension and diagnostic burden: A retrospective analysis of nationwide US healthcare data.肺动脉高压的诊断时间与诊断负担：对美国全国医疗保健数据的回顾性分析。

Pulm Circ. 2023 Jan 1;13(1):e12188. doi: 10.1002/pul2.12188. eCollection 2023 Jan.

Excess healthcare resource utilization and costs for commercially insured patients with pulmonary arterial hypertension: A real-world data analysis.肺动脉高压商业保险患者的医疗资源过度使用及费用：一项真实世界数据分析。

Pulm Circ. 2024 Jun 19;14(2):e12390. doi: 10.1002/pul2.12390. eCollection 2024 Apr.

Real-World Treatment Patterns Among Patients with Connective Tissue Disorder-Related Pulmonary Arterial Hypertension in the United States: A Retrospective Claims-Based Analysis.美国结缔组织疾病相关性肺动脉高压患者的真实世界治疗模式：一项回顾性基于索赔的分析。

Adv Ther. 2023 Nov;40(11):5037-5054. doi: 10.1007/s12325-023-02658-z. Epub 2023 Sep 20.

Hospitalization Among Pulmonary Arterial Hypertension Patients With and Without Connective Tissue Disease Comorbidities Prescribed Oral Selexipag.开具口服司来帕格的合并和未合并结缔组织病的肺动脉高压患者的住院情况

Rheumatol Ther. 2023 Jun;10(3):741-756. doi: 10.1007/s40744-023-00547-z. Epub 2023 Mar 23.

Hospitalization-related costs associated with oral agents targeting the prostacyclin pathway for pulmonary arterial hypertension.与靶向前列环素通路的肺动脉高压口服药物相关的住院费用。

J Med Econ. 2023 Jan-Dec;26(1):1349-1355. doi: 10.1080/13696998.2023.2254160. Epub 2023 Oct 31.

Impact of selexipag use within 12 months of pulmonary arterial hypertension diagnosis on hospitalizations and medical costs: A retrospective cohort study.肺动脉高压诊断后 12 个月内使用塞乐西帕对住院和医疗费用的影响：一项回顾性队列研究。

Clin Respir J. 2023 Dec;17(12):1209-1222. doi: 10.1111/crj.13704. Epub 2023 Oct 7.

Economic burden of illness among patients with pulmonary arterial hypertension (PAH) associated with connective tissue disorders (CTD).与结缔组织病（CTD）相关的肺动脉高压（PAH）患者的疾病经济负担。

Pulm Circ. 2023 Apr 1;13(2):e12218. doi: 10.1002/pul2.12218. eCollection 2023 Apr.

引用本文的文献

Advances in diagnosis and patient profiling in pulmonary arterial hypertension for precision medicine.肺动脉高压精准医学中诊断与患者特征分析的进展

Ther Adv Respir Dis. 2025 Jan-Dec;19:17534666251367312. doi: 10.1177/17534666251367312. Epub 2025 Aug 29.

Experimental animal models and patient-derived platforms to bridge preclinical discovery and translational therapeutics in pulmonary arterial hypertension.用于在肺动脉高压中衔接临床前发现与转化治疗的实验动物模型和患者来源平台。

J Transl Med. 2025 Jun 17;23(1):665. doi: 10.1186/s12967-025-06709-7.

The Heart of Transformation: Exploring Artificial Intelligence in Cardiovascular Disease.变革的核心：探索心血管疾病中的人工智能

Biomedicines. 2025 Feb 10;13(2):427. doi: 10.3390/biomedicines13020427.

Assessing the precision of machine learning for diagnosing pulmonary arterial hypertension: a systematic review and meta-analysis of diagnostic accuracy studies.评估机器学习诊断肺动脉高压的准确性：诊断准确性研究的系统评价和荟萃分析

Front Cardiovasc Med. 2024 Aug 27;11:1422327. doi: 10.3389/fcvm.2024.1422327. eCollection 2024.

Revolutionizing Cardiology through Artificial Intelligence-Big Data from Proactive Prevention to Precise Diagnostics and Cutting-Edge Treatment-A Comprehensive Review of the Past 5 Years.通过人工智能革新心脏病学——从主动预防到精准诊断与前沿治疗的大数据——过去五年的全面综述

Diagnostics (Basel). 2024 May 26;14(11):1103. doi: 10.3390/diagnostics14111103.

Machine learning to identify chronic cough from administrative claims data.机器学习识别行政索赔数据中的慢性咳嗽。

Sci Rep. 2024 Jan 30;14(1):2449. doi: 10.1038/s41598-024-51522-9.

本文引用的文献

A machine learning approach to identifying patients with pulmonary hypertension using real-world electronic health records.一种使用真实世界电子健康记录识别肺动脉高压患者的机器学习方法。

Int J Cardiol. 2023 Mar 1;374:95-99. doi: 10.1016/j.ijcard.2022.12.016. Epub 2022 Dec 14.

2022 ESC/ERS Guidelines for the diagnosis and treatment of pulmonary hypertension.2022年欧洲心脏病学会/欧洲呼吸学会肺动脉高压诊断和治疗指南。

Eur Respir J. 2023 Jan 6;61(1). doi: 10.1183/13993003.00879-2022. Print 2023 Jan.

An algorithm to identify cases of pulmonary arterial hypertension from the electronic medical record.一种从电子病历中识别肺动脉高压病例的算法。

Respir Res. 2022 May 28;23(1):138. doi: 10.1186/s12931-022-02055-0.

The economic burden of pulmonary arterial hypertension in Spain.西班牙肺动脉高压的经济负担。

BMC Pulm Med. 2022 Mar 26;22(1):105. doi: 10.1186/s12890-022-01906-2.

Advances in the management of pulmonary arterial hypertension.肺动脉高压的管理进展。

J Investig Med. 2021 Oct;69(7):1270-1280. doi: 10.1136/jim-2021-002027.

The promise of artificial intelligence: a review of the opportunities and challenges of artificial intelligence in healthcare.人工智能的前景：人工智能在医疗保健领域的机遇与挑战综述。

Br Med Bull. 2021 Sep 10;139(1):4-15. doi: 10.1093/bmb/ldab016.

Cost-Effectiveness of Combination Therapy for Patients With Systemic Sclerosis-Related Pulmonary Arterial Hypertension.系统性硬皮病相关肺动脉高压患者联合治疗的成本效益。

J Am Heart Assoc. 2021 Apr 6;10(7):e015816. doi: 10.1161/JAHA.119.015816. Epub 2021 Mar 24.

Burden of pulmonary arterial hypertension in England: retrospective HES database analysis.英国肺动脉高压负担：回顾性 HES 数据库分析。

Ther Adv Respir Dis. 2021 Jan-Dec;15:1753466621995040. doi: 10.1177/1753466621995040.

Evaluation of code-based algorithms to identify pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension patients in large administrative databases.在大型管理数据库中评估基于编码的算法以识别肺动脉高压和慢性血栓栓塞性肺动脉高压患者。

Pulm Circ. 2020 Nov 10;10(4):2045894020961713. doi: 10.1177/2045894020961713. eCollection 2020 Oct-Dec.

The 'great wait' for diagnosis in pulmonary arterial hypertension.肺动脉高压诊断中的“漫长等待”。

Respirology. 2020 Aug;25(8):790-792. doi: 10.1111/resp.13814. Epub 2020 Apr 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种基于索赔数据的机器学习算法，用于识别肺动脉高压患者。

A claims-based, machine-learning algorithm to identify patients with pulmonary arterial hypertension.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献