应用深度学习模型预测病历诊断代码

Applying Deep Learning Model to Predict Diagnosis Code of Medical Records.

作者信息

Masud Jakir Hossain Bhuiyan, Kuo Chen-Cheng, Yeh Chih-Yang, Yang Hsuan-Chia, Lin Ming-Chin

机构信息

Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei 11031, Taiwan.

International Center for Health Information Technology (ICHIT), College of Medical Science and Technology, Taipei Medical University, Taipei 11031, Taiwan.

出版信息

Diagnostics (Basel). 2023 Jul 6;13(13):2297. doi: 10.3390/diagnostics13132297.

DOI:10.3390/diagnostics13132297

PMID:37443689

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10340491/

Abstract

The International Classification of Diseases (ICD) code is a diagnostic classification standard that is frequently used as a referencing system in healthcare and insurance. However, it takes time and effort to find and use the right diagnosis code based on a patient's medical records. In response, deep learning (DL) methods have been developed to assist physicians in the ICD coding process. Our findings propose a deep learning model that utilized clinical notes from medical records to predict ICD-10 codes. Our research used text-based medical data from the outpatient department (OPD) of a university hospital from January to December 2016. The dataset used clinical notes from five departments, and a total of 21,953 medical records were collected. Clinical notes consisted of a subjective component, objective component, assessment, plan (SOAP) notes, diagnosis code, and a drug list. The dataset was divided into two groups: 90% for training and 10% for test cases. We applied natural language processing (NLP) technique (word embedding, Word2Vector) to process the data. A deep learning-based convolutional neural network (CNN) model was created based on the information presented above. Three metrics (precision, recall, and F-score) were used to calculate the achievement of the deep learning CNN model. Clinically acceptable results were achieved through the deep learning model for five departments (precision: 0.53-0.96; recall: 0.85-0.99; and F-score: 0.65-0.98). With a precision of 0.95, a recall of 0.99, and an F-score of 0.98, the deep learning model performed the best in the department of cardiology. Our proposed CNN model significantly improved the prediction performance for an automated ICD-10 code prediction system based on prior clinical information. This CNN model could reduce the laborious task of manual coding and could assist physicians in making a better diagnosis.

摘要

国际疾病分类（ICD）代码是一种诊断分类标准，在医疗保健和保险领域经常用作参考系统。然而，根据患者的病历查找并使用正确的诊断代码需要花费时间和精力。对此，人们开发了深度学习（DL）方法来协助医生进行ICD编码过程。我们的研究结果提出了一种深度学习模型，该模型利用病历中的临床记录来预测ICD-10代码。我们的研究使用了某大学医院门诊部2016年1月至12月基于文本的医疗数据。该数据集使用了五个科室的临床记录，共收集了21953份病历。临床记录包括主观部分、客观部分、评估、计划（SOAP）记录、诊断代码和药物清单。该数据集分为两组：90%用于训练，10%用于测试案例。我们应用自然语言处理（NLP）技术（词嵌入，Word2Vector）来处理数据。基于上述信息创建了一个基于深度学习的卷积神经网络（CNN）模型。使用三个指标（精确率、召回率和F值）来计算深度学习CNN模型的成果。通过深度学习模型在五个科室取得了临床可接受的结果（精确率：0.53 - 0.96；召回率：0.85 - 0.99；F值：0.65 - 0.98）。深度学习模型在心脏病科表现最佳，精确率为0.95，召回率为0.99，F值为0.98。我们提出的CNN模型显著提高了基于先前临床信息的自动ICD-10代码预测系统的预测性能。这种CNN模型可以减少人工编码的繁琐任务，并可以协助医生做出更好的诊断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e5dd/10340491/3bb84f7a43f4/diagnostics-13-02297-g001.jpg

相似文献

Applying Deep Learning Model to Predict Diagnosis Code of Medical Records.

Diagnostics (Basel). 2023 Jul 6;13(13):2297. doi: 10.3390/diagnostics13132297.

Deep-ADCA: Development and Validation of Deep Learning Model for Automated Diagnosis Code Assignment Using Clinical Notes in Electronic Medical Records.

J Pers Med. 2022 Apr 28;12(5):707. doi: 10.3390/jpm12050707.

Predicting Diagnosis Code from Medication List of an Electronic Medical Record Using Convolutional Neural Network.

Stud Health Technol Inform. 2020 Jun 16;270:1355-1356. doi: 10.3233/SHTI200439.

Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.

J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning.

JMIR Med Inform. 2021 Aug 31;9(8):e23230. doi: 10.2196/23230.

Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical Records.

Sensors (Basel). 2020 Dec 11;20(24):7116. doi: 10.3390/s20247116.

Explainable automated coding of clinical notes using hierarchical label-wise attention networks and label embedding initialisation.

J Biomed Inform. 2021 Apr;116:103728. doi: 10.1016/j.jbi.2021.103728. Epub 2021 Mar 9.

Automatic International Classification of Diseases Coding System: Deep Contextualized Language Model With Rule-Based Approaches.

JMIR Med Inform. 2022 Jun 29;10(6):e37557. doi: 10.2196/37557.

An explainable CNN approach for medical codes prediction from clinical text.

BMC Med Inform Decis Mak. 2021 Nov 16;21(Suppl 9):256. doi: 10.1186/s12911-021-01615-6.

Natural language processing with deep learning for medical adverse event detection from free-text medical narratives: A case study of detecting total hip replacement dislocation.

Comput Biol Med. 2021 Feb;129:104140. doi: 10.1016/j.compbiomed.2020.104140. Epub 2020 Nov 24.

引用本文的文献

Machine Learning and Natural Language Processing to Improve Classification of Atrial Septal Defects in Electronic Health Records.

Birth Defects Res. 2025 Mar;117(3):e2451. doi: 10.1002/bdr2.2451.

A Comparative Analysis of Machine-Learning Algorithms for Automated International Classification of Diseases (ICD)-10 Coding in Malaysian Death Records.

Cureus. 2025 Jan 12;17(1):e77342. doi: 10.7759/cureus.77342. eCollection 2025 Jan.

Forecasting Patient Early Readmission from Irish Hospital Discharge Records Using Conventional Machine Learning Models.

Diagnostics (Basel). 2024 Oct 29;14(21):2405. doi: 10.3390/diagnostics14212405.

A systematic evaluation of the performance of GPT-4 and PaLM2 to diagnose comorbidities in MIMIC-IV patients.

Health Care Sci. 2024 Feb 1;3(1):3-18. doi: 10.1002/hcs2.79. eCollection 2024 Feb.

Empowering Health Care Providers: A Collaborative Approach to Enhance Financial Performance and Productivity in Clinical Practice.

Neurol Clin Pract. 2024 Oct;14(5):e200314. doi: 10.1212/CPJ.0000000000200314. Epub 2024 Jun 11.

Robust diagnosis recommendation system for Primary Care Telemedicine using long short-term memory multi-class sequence classification.

Heliyon. 2024 Feb 29;10(6):e26770. doi: 10.1016/j.heliyon.2024.e26770. eCollection 2024 Mar 30.

本文引用的文献

An explainable CNN approach for medical codes prediction from clinical text.

BMC Med Inform Decis Mak. 2021 Nov 16;21(Suppl 9):256. doi: 10.1186/s12911-021-01615-6.

Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning.

JMIR Med Inform. 2021 Aug 31;9(8):e23230. doi: 10.2196/23230.

Automated ICD coding for primary diagnosis via clinically interpretable machine learning.

Int J Med Inform. 2021 Sep;153:104543. doi: 10.1016/j.ijmedinf.2021.104543. Epub 2021 Jul 27.

Comput Biol Med. 2021 Jun;133:104360. doi: 10.1016/j.compbiomed.2021.104360. Epub 2021 Apr 1.

Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical Records.

Sensors (Basel). 2020 Dec 11;20(24):7116. doi: 10.3390/s20247116.

How to Develop a Risk Prediction Smartphone App.

Surg Innov. 2021 Aug;28(4):438-448. doi: 10.1177/1553350620974827. Epub 2020 Dec 8.

Automatic Medical Code Assignment via Deep Learning Approach for Intelligent Healthcare.

IEEE J Biomed Health Inform. 2020 Sep;24(9):2506-2515. doi: 10.1109/JBHI.2020.2996937. Epub 2020 May 25.

Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach.

JMIR Med Inform. 2020 Jul 30;8(7):e17784. doi: 10.2196/17784.

Natural Language Processing Combined with ICD-9-CM Codes as a Novel Method to Study the Epidemiology of Allergic Drug Reactions.

J Allergy Clin Immunol Pract. 2020 Mar;8(3):1032-1038.e1. doi: 10.1016/j.jaip.2019.12.007. Epub 2019 Dec 16.

An empirical evaluation of deep learning for ICD-9 code assignment using MIMIC-III clinical notes.

Comput Methods Programs Biomed. 2019 Aug;177:141-153. doi: 10.1016/j.cmpb.2019.05.024. Epub 2019 May 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

应用深度学习模型预测病历诊断代码

Applying Deep Learning Model to Predict Diagnosis Code of Medical Records.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献