• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多粒度标签预测模型在临床文本自动国际疾病分类编码中的应用

Multigranularity Label Prediction Model for Automatic International Classification of Diseases Coding in Clinical Text.

机构信息

Hunan Provincial Key Laboratory on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, P.R. China.

School of Computer Science, University of South China, Hengyang, P.R. China.

出版信息

J Comput Biol. 2023 Aug;30(8):900-911. doi: 10.1089/cmb.2023.0096. Epub 2023 Jul 31.

DOI:10.1089/cmb.2023.0096
PMID:37523219
Abstract

International Classification of Diseases (ICD) serves as the foundation for generating comparable global disease statistics across regions and over time. The process of ICD coding involves assigning codes to diseases based on clinical notes, which can describe a patient's condition in a standard way. However, this process is complicated by the vast number of codes and the intricate taxonomy of ICD codes, which are hierarchically organized into various levels, including chapter, category, subcategory, and its subdivisions. Many existing studies focus solely on predicting subcategory codes, ignoring the hierarchical relationships among codes. To address this limitation, we propose a multitask learning model that trains multiple classifiers for different code levels, while also capturing the relations between coarser and finer-grained labels through a reinforcement mechanism. Our approach is evaluated on both English and Chinese benchmark dataset, and we demonstrate that our method achieves competitive performance with baseline models, particularly in terms of macro-F1 results. These findings suggest that our approach effectively leverages the hierarchical structure of ICD codes to improve disease code prediction accuracy. Analysis of attention mechanism shows that multigranularity attention of our model captures crucial feature of input text on different granularity levels, which can provide reasonable explanations for the prediction results.

摘要

国际疾病分类(ICD)是生成具有可比性的全球疾病统计数据的基础,可用于比较不同地区和不同时间的疾病情况。ICD 编码过程涉及根据临床记录为疾病分配代码,这些代码可以以标准方式描述患者的病情。然而,由于代码数量庞大且 ICD 代码的分类法复杂,这一过程变得复杂,代码按照层次结构组织成不同的级别,包括章节、类别、子类别及其细分。许多现有的研究仅专注于预测子类别代码,而忽略了代码之间的层次关系。为了解决这个局限性,我们提出了一种多任务学习模型,该模型为不同的代码级别训练多个分类器,同时通过强化机制捕捉更粗粒度和更细粒度标签之间的关系。我们在英语和中文基准数据集上评估了我们的方法,并证明我们的方法在基线模型的基础上取得了有竞争力的性能,尤其是在宏观 F1 结果方面。这些发现表明,我们的方法有效地利用了 ICD 代码的层次结构来提高疾病代码预测的准确性。注意力机制分析表明,我们的模型的多粒度注意力可以在不同的粒度级别上捕获输入文本的关键特征,这可以为预测结果提供合理的解释。

相似文献

1
Multigranularity Label Prediction Model for Automatic International Classification of Diseases Coding in Clinical Text.多粒度标签预测模型在临床文本自动国际疾病分类编码中的应用
J Comput Biol. 2023 Aug;30(8):900-911. doi: 10.1089/cmb.2023.0096. Epub 2023 Jul 31.
2
Automated ICD-10 code assignment of nonstandard diagnoses via a two-stage framework.通过两阶段框架对非标准诊断进行自动ICD-10编码分配
Artif Intell Med. 2020 Aug;108:101939. doi: 10.1016/j.artmed.2020.101939. Epub 2020 Aug 15.
3
An explainable CNN approach for medical codes prediction from clinical text.一种用于从临床文本预测医疗编码的可解释 CNN 方法。
BMC Med Inform Decis Mak. 2021 Nov 16;21(Suppl 9):256. doi: 10.1186/s12911-021-01615-6.
4
Creating a computer assisted ICD coding system: Performance metric choice and use of the ICD hierarchy.创建计算机辅助 ICD 编码系统:性能指标的选择和 ICD 层次结构的使用。
J Biomed Inform. 2024 Apr;152:104617. doi: 10.1016/j.jbi.2024.104617. Epub 2024 Mar 1.
5
Explainable automated coding of clinical notes using hierarchical label-wise attention networks and label embedding initialisation.使用分层标签分类注意力网络和标签嵌入初始化来实现临床笔记的可解释自动化编码。
J Biomed Inform. 2021 Apr;116:103728. doi: 10.1016/j.jbi.2021.103728. Epub 2021 Mar 9.
6
A Pseudo Label-Wise Attention Network for Automatic ICD Coding.基于伪标签注意力网络的 ICD 自动编码方法。
IEEE J Biomed Health Inform. 2022 Oct;26(10):5201-5212. doi: 10.1109/JBHI.2022.3193291. Epub 2022 Oct 5.
7
Automatic International Classification of Diseases Coding via Note-Code Interaction Network with Denoising Mechanism.基于去噪机制的注释-代码交互网络的自动国际疾病分类编码。
J Comput Biol. 2023 Aug;30(8):912-925. doi: 10.1089/cmb.2023.0079.
8
Automatic ICD code assignment of Chinese clinical notes based on multilayer attention BiRNN.基于多层注意力 BiRNN 的中文临床记录自动 ICD 编码分配。
J Biomed Inform. 2019 Mar;91:103114. doi: 10.1016/j.jbi.2019.103114. Epub 2019 Feb 12.
9
Development of a Method for Automatic Matching of Unstructured Medical Data to ICD-10 Codes.开发一种将非结构化医疗数据自动匹配到 ICD-10 编码的方法。
Stud Health Technol Inform. 2024 May 23;314:93-97. doi: 10.3233/SHTI240065.
10
Towards automated clinical coding.迈向自动化临床编码。
Int J Med Inform. 2018 Dec;120:50-61. doi: 10.1016/j.ijmedinf.2018.09.021. Epub 2018 Oct 2.

引用本文的文献

1
Adoption of network and plan-do-check-action in the international classification of disease 10 coding.在国际疾病分类第10版编码中采用网络及计划-执行-检查-行动方法。
World J Clin Cases. 2024 Jul 6;12(19):3734-3743. doi: 10.12998/wjcc.v12.i19.3734.