• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

可解释的深度学习将诊断文本映射到 ICD-10 代码。

Interpretable deep learning to map diagnostic texts to ICD-10 codes.

机构信息

Department of Languages and Computer Systems. IXA Research Group: http://ixa.eus. University of the Basque Country (UPV-EHU), Leioa, Spain.

出版信息

Int J Med Inform. 2019 Sep;129:49-59. doi: 10.1016/j.ijmedinf.2019.05.015. Epub 2019 May 22.

DOI:10.1016/j.ijmedinf.2019.05.015
PMID:31445289
Abstract

BACKGROUND

Automatic extraction of morbid disease or conditions contained in Death Certificates is a critical process, useful for billing, epidemiological studies and comparison across countries. The fact that these clinical documents are written in regular natural language makes the automatic coding process difficult because, often, spontaneous terms diverge strongly from standard reference terminology such as the International Classification of Diseases (ICD).

OBJECTIVE

Our aim is to propose a general and multilingual approach to render Diagnostic Terms into the standard framework provided by the ICD. We have evaluated our proposal on a set of clinical texts written in French, Hungarian and Italian.

METHODS

ICD-10 encoding is a multi-class classification problem with an extensive (thousands) number of classes. After considering several approaches, we tackle our objective as a sequence-to-sequence task. According to current trends, we opted to use neural networks. We tested different types of neural architectures on three datasets in which Diagnostic Terms (DTs) have their ICD-10 codes associated.

RESULTS AND CONCLUSIONS

Our results give a new state-of-the art on multilingual ICD-10 coding, outperforming several alternative approaches, and showing the feasibility of automatic ICD-10 prediction obtaining an F-measure of 0.838, 0.963 and 0.952 for French, Hungarian and Italian, respectively. Additionally, the results are interpretable, providing experts with supporting evidence when confronted with coding decisions, as the model is able to show the alignments between the original text and each output code.

摘要

背景

从死亡证明中自动提取包含的病态疾病或情况是一个关键过程,对于计费、流行病学研究和国家间比较都非常有用。这些临床文档是用常规自然语言书写的,这使得自动编码过程变得困难,因为自发术语通常与国际疾病分类(ICD)等标准参考术语有很大的差异。

目的

我们的目标是提出一种通用的多语言方法,将诊断术语转换为 ICD 提供的标准框架。我们已经在一组用法语、匈牙利语和意大利语书写的临床文本上评估了我们的提案。

方法

ICD-10 编码是一个多类分类问题,有数千个类。在考虑了几种方法之后,我们将目标视为序列到序列任务。根据当前的趋势,我们选择使用神经网络。我们在三个数据集上测试了不同类型的神经网络架构,其中每个数据集都将诊断术语(DT)与其 ICD-10 代码相关联。

结果与结论

我们的结果在多语言 ICD-10 编码方面取得了新的最新水平,优于几种替代方法,并展示了自动 ICD-10 预测的可行性,在法语、匈牙利语和意大利语上的 F 度量分别为 0.838、0.963 和 0.952。此外,结果是可解释的,为专家在面对编码决策时提供了支持证据,因为模型能够显示原始文本和每个输出代码之间的对齐方式。

相似文献

1
Interpretable deep learning to map diagnostic texts to ICD-10 codes.可解释的深度学习将诊断文本映射到 ICD-10 代码。
Int J Med Inform. 2019 Sep;129:49-59. doi: 10.1016/j.ijmedinf.2019.05.015. Epub 2019 May 22.
2
An empirical evaluation of deep learning for ICD-9 code assignment using MIMIC-III clinical notes.基于 MIMIC-III 临床记录的深度学习方法在 ICD-9 编码任务中的实证评估
Comput Methods Programs Biomed. 2019 Aug;177:141-153. doi: 10.1016/j.cmpb.2019.05.024. Epub 2019 May 25.
3
Supervised Learning for the ICD-10 Coding of French Clinical Narratives.用于法国临床叙述的ICD - 10编码的监督学习
Stud Health Technol Inform. 2020 Jun 16;270:427-431. doi: 10.3233/SHTI200196.
4
Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text.深度学习模型在自由文本中进行 ICD-10 死亡证明和尸检报告编码。
J Biomed Inform. 2018 Apr;80:64-77. doi: 10.1016/j.jbi.2018.02.011. Epub 2018 Feb 26.
5
A Deep Learning Framework for Automated ICD-10 Coding.一种用于自动ICD - 10编码的深度学习框架。
Stud Health Technol Inform. 2021 May 27;281:347-351. doi: 10.3233/SHTI210178.
6
Creating a computer assisted ICD coding system: Performance metric choice and use of the ICD hierarchy.创建计算机辅助 ICD 编码系统:性能指标的选择和 ICD 层次结构的使用。
J Biomed Inform. 2024 Apr;152:104617. doi: 10.1016/j.jbi.2024.104617. Epub 2024 Mar 1.
7
Machine Learning Approaches on Diagnostic Term Encoding With the ICD for Clinical Documentation.基于 ICD 对临床文档进行诊断术语编码的机器学习方法。
IEEE J Biomed Health Inform. 2018 Jul;22(4):1323-1329. doi: 10.1109/JBHI.2017.2743824. Epub 2017 Aug 24.
8
Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity.利用上下文嵌入和标签粒度增强 ICD 多标签健康记录分类。
Comput Methods Programs Biomed. 2020 May;188:105264. doi: 10.1016/j.cmpb.2019.105264. Epub 2019 Dec 10.
9
A cross-lingual approach to automatic ICD-10 coding of death certificates by exploring machine translation.通过探索机器翻译实现死亡证明 ICD-10 自动编码的跨语言方法。
J Biomed Inform. 2019 Jun;94:103207. doi: 10.1016/j.jbi.2019.103207. Epub 2019 May 8.
10
Automated ICD-9 Coding via A Deep Learning Approach.基于深度学习的自动化 ICD-9 编码。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1193-1202. doi: 10.1109/TCBB.2018.2817488. Epub 2018 Mar 20.

引用本文的文献

1
Developing and Validating an Automatic Support System for Tumor Coding in Pathology Reports in Spanish.开发并验证一个用于西班牙语病理报告中肿瘤编码的自动支持系统。
JCO Clin Cancer Inform. 2025 Feb;9:e2400124. doi: 10.1200/CCI.24.00124. Epub 2025 Feb 24.
2
Automating surgical procedure extraction for society of surgeons adult cardiac surgery registry using pretrained language models.使用预训练语言模型实现外科医生协会成人心脏手术登记处手术程序提取的自动化。
JAMIA Open. 2024 Jul 24;7(3):ooae054. doi: 10.1093/jamiaopen/ooae054. eCollection 2024 Oct.
3
Artificial Intelligence and Healthcare: A Journey through History, Present Innovations, and Future Possibilities.
人工智能与医疗保健:一段贯穿历史、当前创新及未来可能性的历程。
Life (Basel). 2024 Apr 26;14(5):557. doi: 10.3390/life14050557.
4
Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt.基于提示的自回归生成式多标签少样本ICD编码
Proc AAAI Conf Artif Intell. 2023 Jun 26;37(4):5366-5374. doi: 10.1609/aaai.v37i4.25668.
5
Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding.基于知识注入提示的多标签少样本ICD编码微调
Proc Conf Empir Methods Nat Lang Process. 2022 Dec;2022:1767-1781.
6
Automatic Identification of Patients With Unexplained Left Ventricular Hypertrophy in Electronic Health Record Data to Improve Targeted Treatment and Family Screening.在电子健康记录数据中自动识别不明原因左心室肥厚患者以改善靶向治疗和家庭筛查
Front Cardiovasc Med. 2022 Apr 15;9:768847. doi: 10.3389/fcvm.2022.768847. eCollection 2022.
7
Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment.从自然语言中对ICD - 10医学实体进行神经翻译和自动识别:模型开发与性能评估
JMIR Med Inform. 2022 Apr 11;10(4):e26353. doi: 10.2196/26353.
8
Can Natural Language Processing and Artificial Intelligence Automate The Generation of Billing Codes From Operative Note Dictations?自然语言处理和人工智能能否根据手术记录口述自动生成计费代码?
Global Spine J. 2023 Sep;13(7):1946-1955. doi: 10.1177/21925682211062831. Epub 2022 Feb 28.
9
Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.对研究不足的医学概念领域进行自动编码:将身体活动报告与《国际功能、残疾和健康分类》相联系。
Front Digit Health. 2021 Mar;3. doi: 10.3389/fdgth.2021.620828. Epub 2021 Mar 10.
10
Automatic multilabel detection of ICD10 codes in Dutch cardiology discharge letters using neural networks.使用神经网络自动多标签检测荷兰心脏病学出院小结中的ICD10编码
NPJ Digit Med. 2021 Feb 26;4(1):37. doi: 10.1038/s41746-021-00404-9.