通过自然语言处理，使用《国际疾病分类》第十版，对神经影像学报告进行自动编码与人工编码的比较

Automated vs. manual coding of neuroimaging reports via natural language processing, using the international classification of diseases, tenth revision.

作者信息

McKinney Alexander M, Moore Jessica A, Campbell Kevin, Braga Thiago A, Rykken Jeffrey B, Jagadeesan Bharathi D, McKinney Zeke J

机构信息

Department of Radiology, University of Miami-Miller School of Medicine, Miami, FL, USA.

University of Minnesota, St. Paul, Minnesota, USA.

出版信息

Heliyon. 2024 May 7;10(10):e30106. doi: 10.1016/j.heliyon.2024.e30106. eCollection 2024 May 30.

DOI:10.1016/j.heliyon.2024.e30106

PMID:38799748

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11126795/

Abstract

OBJECTIVE

Natural language processing (NLP) can generate diagnoses codes from imaging reports. Meanwhile, the International Classification of Diseases (ICD-10) codes are the United States' standard for billing/coding, which enable tracking disease burden and outcomes. This cross-sectional study aimed to test feasibility of an NLP algorithm's performance and comparison to radiologists' and physicians' manual coding.

METHODS

Three neuroradiologists and one non-radiologist physician reviewers manually coded a randomly-selected pool of 200 craniospinal CT and MRI reports from a pool of >10,000. The NLP algorithm () subdivided each report's Impression into "phrases", with multiple ICD-10 matches for each phrase. Only viewing the Impression, the physician reviewers selected the single best ICD-10 code for each phrase. Codes selected by the physicians and algorithm were compared for agreement.

RESULTS

The algorithm extracted the reports' Impressions into 645 phrases, each having ranked ICD-10 matches. Regarding the reviewers' selected codes, pairwise agreement was unreliable ( = ). Using unanimous reviewer agreement as "ground truth", the algorithm's sensitivity/specificity/F2 for top 5 codes was , and for the single best code was The engine tabulated "pertinent negatives" as negative codes for stated findings (e.g. "no intracranial hemorrhage"). The engine's matching was more specific for shorter than full-length ICD-10 codes ( = ).

CONCLUSIONS

Manual coding by physician reviewers has significant variability and is time-consuming, while the NLP algorithm's top 5 diagnosis codes are relatively accurate. This preliminary work demonstrates the feasibility and potential for generating codes with reliability and consistency. Future works may include correlating diagnosis codes with clinical encounter codes to evaluate imaging's impact on, and relevance to care.

摘要

目的

自然语言处理（NLP）可从影像报告中生成诊断代码。同时，国际疾病分类（ICD - 10）代码是美国计费/编码的标准，可用于跟踪疾病负担和结果。这项横断面研究旨在测试一种NLP算法的性能及其与放射科医生和内科医生手动编码相比较的可行性。

方法

三名神经放射科医生和一名非放射科医生审阅者对从超过10000份报告中随机抽取的200份颅脊髓CT和MRI报告进行手动编码。NLP算法将每份报告的印象部分细分为“短语”，每个短语有多个ICD - 10匹配项。仅查看印象部分，医生审阅者为每个短语选择单个最佳ICD - 10代码。比较医生和算法选择的代码以评估一致性。

结果

该算法将报告的印象部分提取为645个短语，每个短语都有排名的ICD - 10匹配项。关于审阅者选择的代码，两两一致性不可靠（=）。以审阅者的一致意见作为“金标准”，算法前5个代码的灵敏度/特异性/F2为，单个最佳代码的为该引擎将“相关阴性结果”列为所述发现的阴性代码（例如“无颅内出血”）。该引擎的匹配对于较短的ICD - 10代码比全长代码更具特异性（=）。

结论

医生审阅者的手动编码存在显著变异性且耗时，而NLP算法的前5个诊断代码相对准确。这项初步工作证明了可靠且一致地生成代码的可行性和潜力。未来的工作可能包括将诊断代码与临床会诊代码相关联，以评估影像对医疗的影响及相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691e/11126795/c1b3d641396c/gr1a.jpg

相似文献

Automated vs. manual coding of neuroimaging reports via natural language processing, using the international classification of diseases, tenth revision.通过自然语言处理，使用《国际疾病分类》第十版，对神经影像学报告进行自动编码与人工编码的比较

Heliyon. 2024 May 7;10(10):e30106. doi: 10.1016/j.heliyon.2024.e30106. eCollection 2024 May 30.

Evaluating a Natural Language Processing-Driven, AI-Assisted International Classification of Diseases, 10th Revision, Clinical Modification, Coding System for Diagnosis Related Groups in a Real Hospital Environment: Algorithm Development and Validation Study.在真实医院环境中评估自然语言处理驱动、人工智能辅助的国际疾病分类第 10 版临床修订版、诊断相关组编码系统：算法开发和验证研究。

J Med Internet Res. 2024 Sep 20;26:e58278. doi: 10.2196/58278.

A comparison of natural language processing to ICD-10 codes for identification and characterization of pulmonary embolism.用于识别和表征肺栓塞的自然语言处理与国际疾病分类第10版（ICD - 10）编码的比较。

Thromb Res. 2021 Jul;203:190-195. doi: 10.1016/j.thromres.2021.04.020. Epub 2021 May 6.

Natural Language Processing Combined with ICD-9-CM Codes as a Novel Method to Study the Epidemiology of Allergic Drug Reactions.自然语言处理结合 ICD-9-CM 代码作为研究过敏性药物反应流行病学的新方法。

J Allergy Clin Immunol Pract. 2020 Mar;8(3):1032-1038.e1. doi: 10.1016/j.jaip.2019.12.007. Epub 2019 Dec 16.

Defining a Patient Population With Cirrhosis: An Automated Algorithm With Natural Language Processing.定义肝硬化患者群体：一种基于自然语言处理的自动化算法

J Clin Gastroenterol. 2016 Nov/Dec;50(10):889-894. doi: 10.1097/MCG.0000000000000583.

A Question-and-Answer System to Extract Data From Free-Text Oncological Pathology Reports (CancerBERT Network): Development Study.从自由文本肿瘤病理学报告（CancerBERT 网络）中提取数据的问答系统：开发研究。

J Med Internet Res. 2022 Mar 23;24(3):e27210. doi: 10.2196/27210.

Developing and testing a framework for coding general practitioners' free-text diagnoses in electronic medical records - a reliability study for generating training data in natural language processing.开发和测试电子病历中全科医生自由文本诊断编码的框架 - 自然语言处理中生成训练数据的可靠性研究。

BMC Prim Care. 2024 Jul 16;25(1):257. doi: 10.1186/s12875-024-02514-1.

Developing and validating natural language processing algorithms for radiology reports compared to ICD-10 codes for identifying venous thromboembolism in hospitalized medical patients.开发和验证放射学报告的自然语言处理算法，以与国际疾病分类第 10 版编码相比，用于识别住院医疗患者中的静脉血栓栓塞症。

Thromb Res. 2022 Jan;209:51-58. doi: 10.1016/j.thromres.2021.11.020. Epub 2021 Nov 27.

Using natural language processing to identify opioid use disorder in electronic health record data.利用自然语言处理技术在电子健康记录数据中识别阿片类药物使用障碍。

Int J Med Inform. 2023 Feb;170:104963. doi: 10.1016/j.ijmedinf.2022.104963. Epub 2022 Dec 10.

De Novo Natural Language Processing Algorithm Accurately Identifies Myxofibrosarcoma From Pathology Reports.全新自然语言处理算法可从病理报告中准确识别黏液纤维肉瘤。

Clin Orthop Relat Res. 2025 Jan 1;483(1):80-87. doi: 10.1097/CORR.0000000000003270. Epub 2024 Oct 2.

引用本文的文献

Using Natural Language Processing and Machine Learning to classify the status of kidney allograft in Electronic Medical Records written in Spanish.使用自然语言处理和机器学习对西班牙语电子病历中同种异体肾移植的状态进行分类。

PLoS One. 2025 May 8;20(5):e0322587. doi: 10.1371/journal.pone.0322587. eCollection 2025.

本文引用的文献

Multimodal risk prediction with physiological signals, medical images and clinical notes.利用生理信号、医学图像和临床记录进行多模态风险预测。

Heliyon. 2024 Feb 28;10(5):e26772. doi: 10.1016/j.heliyon.2024.e26772. eCollection 2024 Mar 15.

ChatGPT-assisted deep learning for diagnosing bone metastasis in bone scans: Bridging the AI Gap for Clinicians.ChatGPT辅助的深度学习在骨扫描中诊断骨转移：弥合临床医生的人工智能差距

Heliyon. 2023 Nov 20;9(12):e22409. doi: 10.1016/j.heliyon.2023.e22409. eCollection 2023 Dec.

Association of Incomplete Neurovascular Imaging After Emergency Department Encounters for Transient Ischemic Attack and Odds of Subsequent Stroke: A National Medicare Analysis.急诊科短暂性脑缺血发作就诊后神经血管成像不完整与后续中风几率的关联：一项全国医疗保险分析

AJR Am J Roentgenol. 2023 Nov;221(5):673-686. doi: 10.2214/AJR.23.29352. Epub 2023 May 31.

Automatic ICD-10 coding: Deep semantic matching based on analogical reasoning.自动ICD-10编码：基于类比推理的深度语义匹配

Heliyon. 2023 Apr 19;9(4):e15570. doi: 10.1016/j.heliyon.2023.e15570. eCollection 2023 Apr.

A Methodological Approach to Validate Pneumonia Encounters from Radiology Reports Using Natural Language Processing.一种使用自然语言处理技术验证放射学报告中肺炎病例的方法学途径。

Methods Inf Med. 2022 May;61(1-02):38-45. doi: 10.1055/a-1817-7008. Epub 2022 Apr 5.

Comparative Accuracy of ICD-9 vs ICD-10 Codes for Acute Appendicitis.ICD-9 与 ICD-10 编码在急性阑尾炎中的比较准确性。

J Am Coll Surg. 2022 Mar 1;234(3):377-383. doi: 10.1097/XCS.0000000000000058.

Thromb Res. 2022 Jan;209:51-58. doi: 10.1016/j.thromres.2021.11.020. Epub 2021 Nov 27.

ICD-11: an international classification of diseases for the twenty-first century.《国际疾病分类第 11 次修订本》：二十一世纪的国际疾病分类。

BMC Med Inform Decis Mak. 2021 Nov 9;21(Suppl 6):206. doi: 10.1186/s12911-021-01534-6.

Qualifying Certainty in Radiology Reports through Deep Learning-Based Natural Language Processing.基于深度学习的自然语言处理在放射学报告中的定质研究。

AJNR Am J Neuroradiol. 2021 Oct;42(10):1755-1761. doi: 10.3174/ajnr.A7241. Epub 2021 Aug 19.

The Impacts of ICD-10-CM on U.S. Army Injury Surveillance.ICD-10-CM 对美军伤病情监测的影响。

Am J Prev Med. 2021 Jul;61(1):e47-e52. doi: 10.1016/j.amepre.2021.01.044.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过自然语言处理，使用《国际疾病分类》第十版，对神经影像学报告进行自动编码与人工编码的比较

Automated vs. manual coding of neuroimaging reports via natural language processing, using the international classification of diseases, tenth revision.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献