• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多层注意力 BiRNN 的中文临床记录自动 ICD 编码分配。

Automatic ICD code assignment of Chinese clinical notes based on multilayer attention BiRNN.

机构信息

School of Computer Science and Engineering, Central South University, Changsha 410083, China; School of Computer Science and Technology, University of South China, Hengyang 421001, China.

School of Computer Science and Engineering, Central South University, Changsha 410083, China.

出版信息

J Biomed Inform. 2019 Mar;91:103114. doi: 10.1016/j.jbi.2019.103114. Epub 2019 Feb 12.

DOI:10.1016/j.jbi.2019.103114
PMID:30768971
Abstract

International Classification of Diseases (ICD) code is an important label of electronic health record. The automatic ICD code assignment based on the narrative of clinical documents is an essential task which has drawn much attention recently. When Chinese clinical notes are the input corpus, the nature of Chinese brings some issues that need to be considered, such as the accuracy of word segmentation and the representation of single Chinese characters which contain semantics. Taking the lengthy text of patient notes and the representation of Chinese words into account, we present a multilayer attention bidirectional recurrent neural network (MA-BiRNN) model to implement the assignment of disease codes. A hierarchical approach is used to represent the feature of discharge summaries without manual feature engineering. The combination of character level embedding and word level embedding can improve the representation of words. Attention mechanism is introduced into bidirectional long short term memory networks, which helps to solve the performance dropping problem when plain recurrent neural networks encounter long text sequences. The experiment is carried out on a real-world dataset containing 7732 admission records in Chinese and 1177 unique ICD-10 labels. The proposed model achieves 0.639 and 0.766 in F1-score on full-level code and block-level code, respectively. It outperforms the baseline neural network models and achieves the lowest Hamming loss value. Ablation analysis indicates that the multilevel attention mechanism plays a decisive role in the system for dealing with Chinese clinical notes.

摘要

国际疾病分类(ICD)代码是电子健康记录的重要标签。基于临床文档的叙述自动分配 ICD 代码是一项重要任务,最近引起了广泛关注。当中文临床记录作为输入语料库时,中文的特点带来了一些需要考虑的问题,例如分词的准确性和包含语义的单个汉字的表示。考虑到患者记录的冗长文本和中文单词的表示,我们提出了一种多层注意力双向递归神经网络(MA-BiRNN)模型来实现疾病代码的分配。采用分层方法来表示出院小结的特征,无需手动特征工程。字符级嵌入和单词级嵌入的组合可以提高单词的表示能力。注意力机制被引入到双向长短期记忆网络中,有助于解决当朴素递归神经网络遇到长文本序列时性能下降的问题。实验在一个包含 7732 条中文入院记录和 1177 个独特 ICD-10 标签的真实数据集上进行。所提出的模型在全级别代码和块级别代码上的 F1 得分分别达到 0.639 和 0.766,优于基线神经网络模型,并实现了最低的汉明损失值。消融分析表明,多层次注意力机制在处理中文临床记录的系统中起着决定性的作用。

相似文献

1
Automatic ICD code assignment of Chinese clinical notes based on multilayer attention BiRNN.基于多层注意力 BiRNN 的中文临床记录自动 ICD 编码分配。
J Biomed Inform. 2019 Mar;91:103114. doi: 10.1016/j.jbi.2019.103114. Epub 2019 Feb 12.
2
Medical code prediction via capsule networks and ICD knowledge.基于胶囊网络和 ICD 知识的医疗编码预测。
BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):55. doi: 10.1186/s12911-021-01426-9.
3
An explainable CNN approach for medical codes prediction from clinical text.一种用于从临床文本预测医疗编码的可解释 CNN 方法。
BMC Med Inform Decis Mak. 2021 Nov 16;21(Suppl 9):256. doi: 10.1186/s12911-021-01615-6.
4
Medical Named Entity Extraction from Chinese Resident Admit Notes Using Character and Word Attention-Enhanced Neural Network.基于字符和词注意力增强神经网络的中文住院病案中医学命名实体抽取
Int J Environ Res Public Health. 2020 Mar 2;17(5):1614. doi: 10.3390/ijerph17051614.
5
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
6
Detecting negation and scope in Chinese clinical notes using character and word embedding.使用字符和词嵌入检测中文临床记录中的否定和范围
Comput Methods Programs Biomed. 2017 Mar;140:53-59. doi: 10.1016/j.cmpb.2016.11.009. Epub 2016 Nov 23.
7
Hyperbolic graph convolutional neural network with contrastive learning for automated ICD coding.基于对比学习的双曲图卷积神经网络在自动化 ICD 编码中的应用。
Comput Biol Med. 2024 Jan;168:107797. doi: 10.1016/j.compbiomed.2023.107797. Epub 2023 Dec 1.
8
JLAN: medical code prediction via joint learning attention networks and denoising mechanism.JLAN:基于联合学习注意力网络和去噪机制的医疗编码预测。
BMC Bioinformatics. 2021 Dec 13;22(1):590. doi: 10.1186/s12859-021-04520-x.
9
Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning.自动ICD - 10编码与训练系统:基于监督学习的深度神经网络
JMIR Med Inform. 2021 Aug 31;9(8):e23230. doi: 10.2196/23230.
10
Explainable automated coding of clinical notes using hierarchical label-wise attention networks and label embedding initialisation.使用分层标签分类注意力网络和标签嵌入初始化来实现临床笔记的可解释自动化编码。
J Biomed Inform. 2021 Apr;116:103728. doi: 10.1016/j.jbi.2021.103728. Epub 2021 Mar 9.

引用本文的文献

1
AnEMIC: A Framework for Benchmarking ICD Coding Models.贫血:一种用于对ICD编码模型进行基准测试的框架。
Proc Conf Empir Methods Nat Lang Process. 2022 Dec;2022(SD):109-120. doi: 10.18653/v1/2022.emnlp-demos.11.
2
Systematic evaluation of common natural language processing techniques to codify clinical notes.系统评估常见的自然语言处理技术以对临床记录进行编码。
PLoS One. 2024 Mar 7;19(3):e0298892. doi: 10.1371/journal.pone.0298892. eCollection 2024.
3
Research on performance variations of classifiers with the influence of pre-processing methods for Chinese short text classification.
中文短文本分类中预处理方法对分类器性能变化的影响研究。
PLoS One. 2023 Oct 12;18(10):e0292582. doi: 10.1371/journal.pone.0292582. eCollection 2023.
4
Automatic ICD-10 coding: Deep semantic matching based on analogical reasoning.自动ICD-10编码:基于类比推理的深度语义匹配
Heliyon. 2023 Apr 19;9(4):e15570. doi: 10.1016/j.heliyon.2023.e15570. eCollection 2023 Apr.
5
Automated ICD coding for coronary heart diseases by a deep learning method.一种基于深度学习方法的冠心病自动ICD编码
Heliyon. 2023 Feb 27;9(3):e14037. doi: 10.1016/j.heliyon.2023.e14037. eCollection 2023 Mar.
6
A Curriculum Batching Strategy for Automatic ICD Coding with Deep Multi-Label Classification Models.一种用于深度多标签分类模型的自动ICD编码的课程批处理策略
Healthcare (Basel). 2022 Nov 29;10(12):2397. doi: 10.3390/healthcare10122397.
7
Comparison of different feature extraction methods for applicable automated ICD coding.不同特征提取方法在适用的自动化 ICD 编码中的比较。
BMC Med Inform Decis Mak. 2022 Jan 12;22(1):11. doi: 10.1186/s12911-022-01753-5.
8
Identification of early mild cognitive impairment using multi-modal data and graph convolutional networks.使用多模态数据和图卷积网络识别早期轻度认知障碍。
BMC Bioinformatics. 2020 Nov 18;21(Suppl 6):123. doi: 10.1186/s12859-020-3437-6.
9
Explainable Prediction of Medical Codes With Knowledge Graphs.利用知识图谱对医学编码进行可解释预测。
Front Bioeng Biotechnol. 2020 Aug 14;8:867. doi: 10.3389/fbioe.2020.00867. eCollection 2020.
10
Construction of a semi-automatic ICD-10 coding system.构建一个半自动 ICD-10 编码系统。
BMC Med Inform Decis Mak. 2020 Apr 15;20(1):67. doi: 10.1186/s12911-020-1085-4.