• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

增强临床文本的 ICD-10 编码分配:基于总结的方法。

Enhanced ICD-10 code assignment of clinical texts: A summarization-based approach.

机构信息

College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, 310027 Hangzhou, Zhejiang Province, China.

Department of Information, Hainan Hospital of Chinese PLA General Hospital, Haitang Bay, 572013 Sanya, Hainan Province, China.

出版信息

Artif Intell Med. 2024 Oct;156:102967. doi: 10.1016/j.artmed.2024.102967. Epub 2024 Aug 20.

DOI:10.1016/j.artmed.2024.102967
PMID:39208710
Abstract

BACKGROUND

Assigning International Classification of Diseases (ICD) codes to clinical texts is a common and crucial practice in patient classification, hospital management, and further statistics analysis. Current auto-coding methods mainly transfer this task to a multi-label classification problem. Such solutions are suffering from high-dimensional mapping space and excessive redundant information in long clinical texts. To alleviate such a situation, we introduce text summarization methods to the ICD coding regime and apply text matching to select ICD codes.

METHOD

We focus on the tenth revision of the ICD (ICD-10) coding and design a novel summarization-based approach (SuM) with an end-to-end strategy to efficiently assign ICD-10 code to clinical texts. In this approach, a knowledge-guided pointer network is purposed to distill and summarize key information in clinical texts precisely. Then a matching model with matching-aggregation architecture follows to align the summary result with code, tuning the one-vs-all scenario to one-vs-one matching so that the large-label-space obstacle laid in classification approaches would be avoided.

RESULT

The 12,788 ICD-10 coded discharge summaries from a Chinese hospital were collected to evaluate the proposed approach. Compared with existing methods, the purposed model achieves the greatest coding results with Micro AUC of 0.9548, MRR@10 of 0.7977, Precision@10 of 0.0944, and Recall@10 of 0.9439 for the TOP-50 Dataset. Results on the FULL-Dataset remain consistent. Also, the proposed knowledge encoder and applied end-to-end strategy are proven to facilitate the whole model to gain efficacy in selecting the most suitable code.

CONCLUSION

The proposed automatic ICD-10 code assignment approach via text summarization can effectively capture critical messages in long clinical texts and improve the performance of ICD-10 coding of clinical texts.

摘要

背景

将国际疾病分类(ICD)代码分配给临床文本是患者分类、医院管理和进一步统计分析的常见且关键的实践。当前的自动编码方法主要将此任务转换为多标签分类问题。此类解决方案在长临床文本中面临高维映射空间和过多冗余信息的问题。为了缓解这种情况,我们将文本摘要方法引入 ICD 编码方案,并应用文本匹配来选择 ICD 代码。

方法

我们专注于 ICD-10 编码的第十版,设计了一种基于摘要的新方法(SuM),采用端到端策略,有效地将 ICD-10 代码分配给临床文本。在这种方法中,提出了一种知识引导的指针网络,精确地提取和总结临床文本中的关键信息。然后,紧随其后的是一个具有匹配-聚合架构的匹配模型,将摘要结果与代码对齐,将一对一的场景调整为一对一的匹配,从而避免了分类方法中存在的大标签空间障碍。

结果

从一家中国医院收集了 12788 个 ICD-10 编码的出院小结来评估所提出的方法。与现有方法相比,所提出的模型在 TOP-50 数据集上取得了最大的编码结果,微 AUC 为 0.9548,MRR@10 为 0.7977,Precision@10 为 0.0944,Recall@10 为 0.9439,对于 TOP-50 数据集。FULL-Dataset 的结果仍然一致。此外,所提出的知识编码器和应用的端到端策略被证明有助于整个模型有效地选择最合适的代码。

结论

通过文本摘要自动分配 ICD-10 代码的方法可以有效地捕获长临床文本中的关键信息,并提高临床文本 ICD-10 编码的性能。

相似文献

1
Enhanced ICD-10 code assignment of clinical texts: A summarization-based approach.增强临床文本的 ICD-10 编码分配:基于总结的方法。
Artif Intell Med. 2024 Oct;156:102967. doi: 10.1016/j.artmed.2024.102967. Epub 2024 Aug 20.
2
Automated ICD-10 code assignment of nonstandard diagnoses via a two-stage framework.通过两阶段框架对非标准诊断进行自动ICD-10编码分配
Artif Intell Med. 2020 Aug;108:101939. doi: 10.1016/j.artmed.2020.101939. Epub 2020 Aug 15.
3
Hyperbolic graph convolutional neural network with contrastive learning for automated ICD coding.基于对比学习的双曲图卷积神经网络在自动化 ICD 编码中的应用。
Comput Biol Med. 2024 Jan;168:107797. doi: 10.1016/j.compbiomed.2023.107797. Epub 2023 Dec 1.
4
Leveraging Semantics in WordNet to Facilitate the Computer-Assisted Coding of ICD-11.利用 WordNet 中的语义来促进 ICD-11 的计算机辅助编码。
IEEE J Biomed Health Inform. 2020 May;24(5):1469-1476. doi: 10.1109/JBHI.2019.2949567. Epub 2019 Oct 25.
5
An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records.对监督学习方法在为电子病历分配诊断代码中的实证评估。
Artif Intell Med. 2015 Oct;65(2):155-66. doi: 10.1016/j.artmed.2015.04.007. Epub 2015 May 15.
6
Automatic International Classification of Diseases Coding System: Deep Contextualized Language Model With Rule-Based Approaches.自动国际疾病分类编码系统:基于规则方法的深度情境化语言模型
JMIR Med Inform. 2022 Jun 29;10(6):e37557. doi: 10.2196/37557.
7
Improving Quality of ICD-10 (International Statistical Classification of Diseases, Tenth Revision) Coding Using AI: Protocol for a Crossover Randomized Controlled Trial.利用人工智能提高国际疾病分类第十次修订版(ICD-10)编码质量:一项交叉随机对照试验方案
JMIR Res Protoc. 2024 Mar 12;13:e54593. doi: 10.2196/54593.
8
A Pseudo Label-Wise Attention Network for Automatic ICD Coding.基于伪标签注意力网络的 ICD 自动编码方法。
IEEE J Biomed Health Inform. 2022 Oct;26(10):5201-5212. doi: 10.1109/JBHI.2022.3193291. Epub 2022 Oct 5.
9
Comparison of different feature extraction methods for applicable automated ICD coding.不同特征提取方法在适用的自动化 ICD 编码中的比较。
BMC Med Inform Decis Mak. 2022 Jan 12;22(1):11. doi: 10.1186/s12911-022-01753-5.
10
Automatic ICD code assignment of Chinese clinical notes based on multilayer attention BiRNN.基于多层注意力 BiRNN 的中文临床记录自动 ICD 编码分配。
J Biomed Inform. 2019 Mar;91:103114. doi: 10.1016/j.jbi.2019.103114. Epub 2019 Feb 12.