使用自然语言处理技术从叙述性临床记录中挖掘外周动脉疾病病例。

Mining peripheral arterial disease cases from narrative clinical notes using natural language processing.

作者信息

Afzal Naveed, Sohn Sunghwan, Abram Sara, Scott Christopher G, Chaudhry Rajeev, Liu Hongfang, Kullo Iftikhar J, Arruda-Olson Adelaide M

机构信息

Department of Health Sciences Research, Mayo Clinic, Rochester, Minn.

Department of Cardiovascular Diseases, Mayo Clinic, Rochester, Minn.

出版信息

J Vasc Surg. 2017 Jun;65(6):1753-1761. doi: 10.1016/j.jvs.2016.11.031. Epub 2017 Feb 8.

DOI:10.1016/j.jvs.2016.11.031

PMID:28189359

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5438905/

Abstract

OBJECTIVE

Lower extremity peripheral arterial disease (PAD) is highly prevalent and affects millions of individuals worldwide. We developed a natural language processing (NLP) system for automated ascertainment of PAD cases from clinical narrative notes and compared the performance of the NLP algorithm with billing code algorithms, using ankle-brachial index test results as the gold standard.

METHODS

We compared the performance of the NLP algorithm to (1) results of gold standard ankle-brachial index; (2) previously validated algorithms based on relevant International Classification of Diseases, Ninth Revision diagnostic codes (simple model); and (3) a combination of International Classification of Diseases, Ninth Revision codes with procedural codes (full model). A dataset of 1569 patients with PAD and controls was randomly divided into training (n = 935) and testing (n = 634) subsets.

RESULTS

We iteratively refined the NLP algorithm in the training set including narrative note sections, note types, and service types, to maximize its accuracy. In the testing dataset, when compared with both simple and full models, the NLP algorithm had better accuracy (NLP, 91.8%; full model, 81.8%; simple model, 83%; P < .001), positive predictive value (NLP, 92.9%; full model, 74.3%; simple model, 79.9%; P < .001), and specificity (NLP, 92.5%; full model, 64.2%; simple model, 75.9%; P < .001).

CONCLUSIONS

A knowledge-driven NLP algorithm for automatic ascertainment of PAD cases from clinical notes had greater accuracy than billing code algorithms. Our findings highlight the potential of NLP tools for rapid and efficient ascertainment of PAD cases from electronic health records to facilitate clinical investigation and eventually improve care by clinical decision support.

摘要

目的

下肢外周动脉疾病（PAD）极为常见，影响着全球数百万人。我们开发了一种自然语言处理（NLP）系统，用于从临床记录中自动确定PAD病例，并以踝臂指数测试结果作为金标准，将NLP算法的性能与计费代码算法进行比较。

方法

我们将NLP算法的性能与以下各项进行比较：（1）金标准踝臂指数的结果；（2）基于相关国际疾病分类第九版诊断代码的先前验证算法（简单模型）；以及（3）国际疾病分类第九版代码与程序代码的组合（完整模型）。将1569例PAD患者和对照的数据集随机分为训练子集（n = 935）和测试子集（n = 634）。

结果

我们在训练集中对NLP算法进行了迭代优化，包括病历记录部分、记录类型和服务类型，以最大限度提高其准确性。在测试数据集中，与简单模型和完整模型相比，NLP算法具有更高的准确性（NLP为91.8%；完整模型为81.8%；简单模型为83%；P <.001）、阳性预测值（NLP为92.9%；完整模型为74.3%；简单模型为79.9%；P <.001）和特异性（NLP为92.5%；完整模型为64.2%；简单模型为75.9%；P <.001）。

结论

一种用于从临床记录中自动确定PAD病例的知识驱动型NLP算法比计费代码算法具有更高的准确性。我们的研究结果凸显了NLP工具从电子健康记录中快速高效确定PAD病例的潜力，有助于临床研究，并最终通过临床决策支持改善医疗护理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d9c6/5438905/77f599af0d37/nihms838952f1.jpg

相似文献

Mining peripheral arterial disease cases from narrative clinical notes using natural language processing.使用自然语言处理技术从叙述性临床记录中挖掘外周动脉疾病病例。

J Vasc Surg. 2017 Jun;65(6):1753-1761. doi: 10.1016/j.jvs.2016.11.031. Epub 2017 Feb 8.

Natural language processing of clinical notes for identification of critical limb ischemia.临床记录的自然语言处理以识别严重肢体缺血。

Int J Med Inform. 2018 Mar;111:83-89. doi: 10.1016/j.ijmedinf.2017.12.024. Epub 2017 Dec 28.

Use of Deep Learning to Identify Peripheral Arterial Disease Cases From Narrative Clinical Notes.利用深度学习从临床病历中识别外周动脉疾病病例。

J Surg Res. 2024 Nov;303:699-708. doi: 10.1016/j.jss.2024.09.062. Epub 2024 Oct 24.

Administrative data are not sensitive for the detection of peripheral artery disease in the community.行政数据对于社区中周围动脉疾病的检测并不敏感。

Vasc Med. 2016 Aug;21(4):331-6. doi: 10.1177/1358863X16631041. Epub 2016 Apr 25.

Ankle- and Toe-Brachial Index for Peripheral Artery Disease Identification: Unlocking Clinical Data Through Novel Methods.踝臂指数和趾臂指数在周围动脉疾病识别中的应用：通过新方法挖掘临床数据。

Circ Cardiovasc Interv. 2022 Mar;15(3):e011092. doi: 10.1161/CIRCINTERVENTIONS.121.011092. Epub 2022 Feb 18.

Billing code algorithms to identify cases of peripheral artery disease from administrative data.利用计费代码算法从管理数据中识别外周动脉疾病病例。

J Am Med Inform Assoc. 2013 Dec;20(e2):e349-54. doi: 10.1136/amiajnl-2013-001827. Epub 2013 Oct 28.

De Novo Natural Language Processing Algorithm Accurately Identifies Myxofibrosarcoma From Pathology Reports.全新自然语言处理算法可从病理报告中准确识别黏液纤维肉瘤。

Clin Orthop Relat Res. 2025 Jan 1;483(1):80-87. doi: 10.1097/CORR.0000000000003270. Epub 2024 Oct 2.

The use of natural language processing to identify vaccine-related anaphylaxis at five health care systems in the Vaccine Safety Datalink.利用自然语言处理技术在疫苗安全数据链中的五个医疗系统中识别与疫苗相关的过敏反应。

Pharmacoepidemiol Drug Saf. 2020 Feb;29(2):182-188. doi: 10.1002/pds.4919. Epub 2019 Dec 3.

Ascertainment of Delirium Status Using Natural Language Processing From Electronic Health Records.使用电子健康记录中的自然语言处理来确定谵妄状态。

J Gerontol A Biol Sci Med Sci. 2022 Mar 3;77(3):524-530. doi: 10.1093/gerona/glaa275.

Use of Natural Language Processing to Improve Identification of Patients With Peripheral Artery Disease.利用自然语言处理提高外周动脉疾病患者的识别率。

Circ Cardiovasc Interv. 2020 Oct;13(10):e009447. doi: 10.1161/CIRCINTERVENTIONS.120.009447. Epub 2020 Oct 12.

引用本文的文献

Decoding Immunodeficiencies with Artificial Intelligence: A New Era of Precision Medicine.利用人工智能解码免疫缺陷：精准医学的新时代。

Biomedicines. 2025 Jul 28;13(8):1836. doi: 10.3390/biomedicines13081836.

Natural Language Processing framework for identifying abdominal aortic aneurysm repairs using unstructured electronic health records.使用非结构化电子健康记录识别腹主动脉瘤修复手术的自然语言处理框架。

Sci Rep. 2025 Jul 21;15(1):26388. doi: 10.1038/s41598-025-11870-6.

pyDeid: an improved, fast, flexible, and generalizable rule-based approach for deidentification of free-text medical records.pyDeid：一种用于对自由文本医疗记录进行去识别处理的经过改进的、快速、灵活且可推广的基于规则的方法。

JAMIA Open. 2025 Jan 22;8(1):ooae152. doi: 10.1093/jamiaopen/ooae152. eCollection 2025 Feb.

Current Applications and Future Perspectives of Artificial and Biomimetic Intelligence in Vascular Surgery and Peripheral Artery Disease.人工智能与仿生智能在血管外科和外周动脉疾病中的当前应用及未来展望

Biomimetics (Basel). 2024 Aug 1;9(8):465. doi: 10.3390/biomimetics9080465.

Convolutional Neural Networks to Study Contrast-Enhanced Magnetic Resonance Imaging-Based Skeletal Calf Muscle Perfusion in Peripheral Artery Disease.利用卷积神经网络研究基于对比增强磁共振成像的外周动脉疾病小腿骨骼肌灌注情况

Am J Cardiol. 2024 Jun 1;220:56-66. doi: 10.1016/j.amjcard.2024.03.035. Epub 2024 Apr 3.

Artificial Intelligence of Arterial Doppler Waveforms to Predict Major Adverse Outcomes Among Patients Evaluated for Peripheral Artery Disease.利用动脉多普勒波形人工智能预测接受外周动脉疾病评估患者的主要不良结局

J Am Heart Assoc. 2024 Feb 6;13(3):e031880. doi: 10.1161/JAHA.123.031880. Epub 2024 Jan 19.

Applying Natural Language Processing to Textual Data From Clinical Data Warehouses: Systematic Review.将自然语言处理应用于临床数据仓库中的文本数据：系统评价。

JMIR Med Inform. 2023 Dec 15;11:e42477. doi: 10.2196/42477.

Fusion Modeling: Combining Clinical and Imaging Data to Advance Cardiac Care.融合建模：结合临床和影像数据以推进心脏护理。

Circ Cardiovasc Imaging. 2023 Dec;16(12):e014533. doi: 10.1161/CIRCIMAGING.122.014533. Epub 2023 Dec 11.

A machine learning-based approach to identify peripheral artery disease using texture features from contrast-enhanced magnetic resonance imaging.基于机器学习的方法，利用对比增强磁共振成像的纹理特征来识别外周动脉疾病。

Magn Reson Imaging. 2024 Feb;106:31-42. doi: 10.1016/j.mri.2023.11.014. Epub 2023 Dec 6.

Scalable and interpretable alternative to chart review for phenotype evaluation using standardized structured data from electronic health records.利用电子健康记录中的标准化结构化数据进行表型评估的可扩展且可解释的图表审查替代方法。

J Am Med Inform Assoc. 2023 Dec 22;31(1):119-129. doi: 10.1093/jamia/ocad202.

本文引用的文献

CLINICAL PRACTICE. Peripheral Artery Disease.临床实践。外周动脉疾病

N Engl J Med. 2016 Mar 3;374(9):861-71. doi: 10.1056/NEJMcp1507631.

An information extraction framework for cohort identification using electronic health records.一种使用电子健康记录进行队列识别的信息提取框架。

AMIA Jt Summits Transl Sci Proc. 2013 Mar 18;2013:149-53. eCollection 2013.

Billing code algorithms to identify cases of peripheral artery disease from administrative data.利用计费代码算法从管理数据中识别外周动脉疾病病例。

J Am Med Inform Assoc. 2013 Dec;20(e2):e349-54. doi: 10.1136/amiajnl-2013-001827. Epub 2013 Oct 28.

Data resource profile: the Rochester Epidemiology Project (REP) medical records-linkage system.数据资源简介：罗切斯特流行病学项目（REP）医疗记录链接系统。

Int J Epidemiol. 2012 Dec;41(6):1614-24. doi: 10.1093/ije/dys195. Epub 2012 Nov 18.

A call to action: women and peripheral artery disease: a scientific statement from the American Heart Association.行动呼吁：女性与外周动脉疾病：美国心脏协会的科学声明

Circulation. 2012 Mar 20;125(11):1449-72. doi: 10.1161/CIR.0b013e31824c39ba. Epub 2012 Feb 15.

Discovering peripheral arterial disease cases from radiology notes using natural language processing.使用自然语言处理技术从放射学记录中发现外周动脉疾病病例。

AMIA Annu Symp Proc. 2010 Nov 13;2010:722-6.

ACCF/AHA/ACR/SCAI/SIR/SVM/SVN/SVS 2010 performance measures for adults with peripheral artery disease: a report of the American College of Cardiology Foundation/American Heart Association Task Force on Performance Measures, the American College of Radiology, the Society for Cardiac Angiography and Interventions, the Society for Interventional Radiology, the Society for Vascular Medicine, the Society for Vascular Nursing, and the Society for Vascular Surgery (Writing Committee to Develop Clinical Performance Measures for Peripheral Artery Disease).ACCF/AHA/ACR/SCAI/SIR/SVM/SVN/SVS 2010年外周动脉疾病成人患者的性能指标：美国心脏病学会基金会/美国心脏协会性能指标特别工作组、美国放射学会、心脏血管造影和介入学会、介入放射学会、血管医学学会、血管护理学会以及血管外科学会（外周动脉疾病临床性能指标制定写作委员会）报告

J Am Coll Cardiol. 2010 Dec 14;56(25):2147-81. doi: 10.1016/j.jacc.2010.08.606.

Use of International Classification of Diseases, Ninth Revision, Clinical Modification codes and medication use data to identify nosocomial Clostridium difficile infection.利用国际疾病分类，第九修订版，临床修正码和药物使用数据来识别医院获得性艰难梭菌感染。

Infect Control Hosp Epidemiol. 2009 Nov;30(11):1070-6. doi: 10.1086/606164.

The influence of peripheral arterial disease on outcomes: a pooled analysis of mortality in eight large randomized percutaneous coronary intervention trials.外周动脉疾病对预后的影响：八项大型随机经皮冠状动脉介入试验中死亡率的汇总分析。

J Am Coll Cardiol. 2006 Oct 17;48(8):1567-72. doi: 10.1016/j.jacc.2006.03.067. Epub 2006 Sep 26.

Treatment of peripheral arterial disease--extending "intervention" to "therapeutic choice".外周动脉疾病的治疗——将“干预”扩展至“治疗选择”

N Engl J Med. 2006 May 4;354(18):1944-7. doi: 10.1056/NEJMe068037.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验