利用电子病历数据增强基于ICD编码的心力衰竭病例定义

Enhancing ICD-Code-Based Case Definition for Heart Failure Using Electronic Medical Record Data.

作者信息

Xu Yuan, Lee Seungwon, Martin Elliot, D'souza Adam G, Doktorchik Chelsea T A, Jiang Jason, Lee Sangmin, Eastwood Cathy A, Fine Nowell, Hemmelgarn Brenda, Todd Kathryn, Quan Hude

机构信息

Department of Oncology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada; Department of Surgery, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada; Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada; Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.

Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada; Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada; Alberta Health Services, Calgary, Alberta, Canada.

出版信息

J Card Fail. 2020 Jul;26(7):610-617. doi: 10.1016/j.cardfail.2020.04.003. Epub 2020 Apr 15.

DOI:10.1016/j.cardfail.2020.04.003

PMID:32304875

Abstract

BACKGROUND

Surveillance and outcome studies for heart failure (HF) require accurate identification of patients with HF. Algorithms based on International Classification of Diseases (ICD) codes to identify HF from administrative data are inadequate owing to their relatively low sensitivity. Detailed clinical information from electronic medical records (EMRs) is potentially useful for improving ICD algorithms. This study aimed to enhance the ICD algorithm for HF definition by incorporating comprehensive information from EMRs.

METHODS

The study included 2106 inpatients in Calgary, Alberta, Canada. Medical chart review was used as the reference gold standard for evaluating developed algorithms. The commonly used ICD codes for defining HF were used (namely, the ICD algorithm). The performance of different algorithms using the free text discharge summaries from a population-based EMR were compared with the ICD algorithm. These algorithms included a keyword search algorithm looking for HF-specific terms, a machine learning-based HF concept (HFC) algorithm, an EMR structured data based algorithm, and combined algorithms (the ICD and HFC combined algorithm).

RESULTS

Of 2106 patients, 296 (14.1%) were patients with HF as determined by chart review. The ICD algorithm had 92.4% positive predictive value (PPV) but low sensitivity (57.4%). The EMR keyword search algorithm achieved a higher sensitivity (65.5%) than the ICD algorithm, but with a lower PPV (77.6%). The HFC algorithm achieved a better sensitivity (80.0%) and maintained a reasonable PPV (88.9%) compared with the ICD algorithm and the keyword algorithm. An even higher sensitivity (83.3%) was reached by combining the HFC and ICD algorithms, with a lower PPV (83.3%). The structured EMR data algorithm reached a sensitivity of 78% and a PPV of 54.2%. The combined EMR structured data and ICD algorithm had a higher sensitivity (82.4%), but the PPV remained low at 54.8%. All algorithms had a specificity ranging from 87.5% to 99.2%.

CONCLUSIONS

Applying natural language processing and machine learning on the discharge summaries of inpatient EMR data can improve the capture of cases of HF compared with the widely used ICD algorithm. The utility of the HFC algorithm is straightforward, making it easily applied for HF case identification.

摘要

背景

心力衰竭（HF）的监测和结局研究需要准确识别HF患者。基于国际疾病分类（ICD）编码从管理数据中识别HF的算法由于其相对较低的敏感性而不够完善。电子病历（EMR）中的详细临床信息可能有助于改进ICD算法。本研究旨在通过纳入EMR中的综合信息来增强用于HF定义的ICD算法。

方法

该研究纳入了加拿大艾伯塔省卡尔加里市的2106名住院患者。病历审查被用作评估所开发算法的参考金标准。使用了用于定义HF的常用ICD编码（即ICD算法）。将使用基于人群的EMR中的自由文本出院小结的不同算法的性能与ICD算法进行比较。这些算法包括寻找HF特定术语的关键词搜索算法、基于机器学习的HF概念（HFC）算法、基于EMR结构化数据的算法以及组合算法（ICD和HFC组合算法）。

结果

在2106名患者中，经病历审查确定有296名（14.1%）为HF患者。ICD算法的阳性预测值（PPV）为92.4%，但敏感性较低（57.4%）。EMR关键词搜索算法的敏感性（65.5%）高于ICD算法，但PPV较低（77.6%）。与ICD算法和关键词算法相比，HFC算法具有更好的敏感性（80.0%）并保持了合理的PPV（88.9%）。将HFC和ICD算法相结合可达到更高的敏感性（83.3%），但PPV较低（83.3%）。结构化EMR数据算法的敏感性达到78%，PPV为54.2%。EMR结构化数据与ICD组合算法具有更高的敏感性（82.4%），但PPV仍然较低，为54.8%。所有算法的特异性范围为87.5%至99.2%。

结论

与广泛使用的ICD算法相比，对住院患者EMR数据的出院小结应用自然语言处理和机器学习可以改善HF病例的捕获。HFC算法的实用性直接明了，使其易于应用于HF病例识别。

相似文献

Enhancing ICD-Code-Based Case Definition for Heart Failure Using Electronic Medical Record Data.

J Card Fail. 2020 Jul;26(7):610-617. doi: 10.1016/j.cardfail.2020.04.003. Epub 2020 Apr 15.

Cerebrovascular disease case identification in inpatient electronic medical record data using natural language processing.

Brain Inform. 2023 Sep 2;10(1):22. doi: 10.1186/s40708-023-00203-w.

Artificial intelligence approaches for phenotyping heart failure in U.S. Veterans Health Administration electronic health record.

ESC Heart Fail. 2024 Oct;11(5):3155-3166. doi: 10.1002/ehf2.14787. Epub 2024 Jun 14.

Developing an Inpatient Electronic Medical Record Phenotype for Hospital-Acquired Pressure Injuries: Case Study Using Natural Language Processing Models.

JMIR AI. 2023 Mar 8;2:e41264. doi: 10.2196/41264.

Rule-based and machine learning algorithms identify patients with systemic sclerosis accurately in the electronic health record.

Arthritis Res Ther. 2019 Dec 30;21(1):305. doi: 10.1186/s13075-019-2092-7.

Exploring the reliability of inpatient EMR algorithms for diabetes identification.

BMJ Health Care Inform. 2023 Dec 20;30(1):e100894. doi: 10.1136/bmjhci-2023-100894.

Validation of Case Finding Algorithms for Hepatocellular Cancer From Administrative Data and Electronic Health Records Using Natural Language Processing.

Med Care. 2016 Feb;54(2):e9-14. doi: 10.1097/MLR.0b013e3182a30373.

Impact of ICD10 and secular changes on electronic medical record rheumatoid arthritis algorithms.

Rheumatology (Oxford). 2020 Dec 1;59(12):3759-3766. doi: 10.1093/rheumatology/keaa198.

Hypertension identification using inpatient clinical notes from electronic medical records: an explainable, data-driven algorithm study.

CMAJ Open. 2023 Feb 14;11(1):E131-E139. doi: 10.9778/cmajo.20210170. Print 2023 Jan-Feb.

Natural Language Processing Combined with ICD-9-CM Codes as a Novel Method to Study the Epidemiology of Allergic Drug Reactions.

J Allergy Clin Immunol Pract. 2020 Mar;8(3):1032-1038.e1. doi: 10.1016/j.jaip.2019.12.007. Epub 2019 Dec 16.

引用本文的文献

Explainable mortality prediction models incorporating social health determinants and physical frailty for heart failure patients.

PLoS One. 2025 Sep 3;20(9):e0327979. doi: 10.1371/journal.pone.0327979. eCollection 2025.

Using a Healthcare Process Modeling Approach to Understand Electronic Health Records-based Pressure Injury Data and to Support Development of a Standardized Pressure Injury Phenotyping Pipeline.

AMIA Annu Symp Proc. 2025 May 22;2024:738-747. eCollection 2024.

Utilizing large language models for detecting hospital-acquired conditions: an empirical study on pulmonary embolism.

J Am Med Inform Assoc. 2025 May 1;32(5):876-884. doi: 10.1093/jamia/ocaf048.

Electronic health record in military healthcare systems: A systematic review.

PLoS One. 2025 Feb 12;20(2):e0313641. doi: 10.1371/journal.pone.0313641. eCollection 2025.

Diagnostic accuracy of case-identification algorithms for heart failure in the general population using routinely collected health data: a systematic review.

Syst Rev. 2024 Dec 24;13(1):313. doi: 10.1186/s13643-024-02717-8.

Developing an Inpatient Electronic Medical Record Phenotype for Hospital-Acquired Pressure Injuries: Case Study Using Natural Language Processing Models.

JMIR AI. 2023 Mar 8;2:e41264. doi: 10.2196/41264.

Accuracy of heart failure ascertainment using routinely collected healthcare data: a systematic review and meta-analysis.

Syst Rev. 2024 Mar 1;13(1):79. doi: 10.1186/s13643-024-02477-5.

Identifying Diabetes Related-Complications in a Real-World Free-Text Electronic Medical Records in Hebrew Using Natural Language Processing Techniques.

J Diabetes Sci Technol. 2024 Jan 30:19322968241228555. doi: 10.1177/19322968241228555.

Retrospective comparison of traditional and artificial intelligence-based heart failure phenotyping in a US health system to enable real-world evidence.

BMJ Open. 2023 Aug 9;13(8):e073178. doi: 10.1136/bmjopen-2023-073178.

Positive Predictive Value of , and , Codes for Identification of Congenital Heart Defects.

J Am Heart Assoc. 2023 Aug 15;12(16):e030821. doi: 10.1161/JAHA.123.030821. Epub 2023 Aug 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用电子病历数据增强基于ICD编码的心力衰竭病例定义

Enhancing ICD-Code-Based Case Definition for Heart Failure Using Electronic Medical Record Data.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献