深度学习模型在自由文本中进行 ICD-10 死亡证明和尸检报告编码。

Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text.

机构信息

INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Portugal.

Direção-Geral da Saúde, Portugal.

出版信息

J Biomed Inform. 2018 Apr;80:64-77. doi: 10.1016/j.jbi.2018.02.011. Epub 2018 Feb 26.

DOI:10.1016/j.jbi.2018.02.011

PMID:29496630

Abstract

We address the assignment of ICD-10 codes for causes of death by analyzing free-text descriptions in death certificates, together with the associated autopsy reports and clinical bulletins, from the Portuguese Ministry of Health. We leverage a deep neural network that combines word embeddings, recurrent units, and neural attention, for the generation of intermediate representations of the textual contents. The neural network also explores the hierarchical nature of the input data, by building representations from the sequences of words within individual fields, which are then combined according to the sequences of fields that compose the inputs. Moreover, we explore innovative mechanisms for initializing the weights of the final nodes of the network, leveraging co-occurrences between classes together with the hierarchical structure of ICD-10. Experimental results attest to the contribution of the different neural network components. Our best model achieves accuracy scores over 89%, 81%, and 76%, respectively for ICD-10 chapters, blocks, and full-codes. Through examples, we also show that our method can produce interpretable results, useful for public health surveillance.

摘要

我们通过分析葡萄牙卫生部的死亡证明中的自由文本描述，以及相关的尸检报告和临床公告，来解决 ICD-10 死因编码的任务。我们利用一种深度神经网络，该网络结合了词嵌入、循环单元和神经注意力，用于生成文本内容的中间表示。该神经网络还通过从单个字段内的单词序列构建表示，然后根据组成输入的字段序列进行组合，探索输入数据的层次性质。此外，我们还探索了利用类之间的共现以及 ICD-10 的层次结构来初始化网络最后节点权重的创新机制。实验结果证明了不同神经网络组件的贡献。我们最好的模型在 ICD-10 章节、块和全码方面的准确率分别超过 89%、81%和 76%。通过示例，我们还表明，我们的方法可以产生可解释的结果，对公共卫生监测有用。

相似文献

Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text.

J Biomed Inform. 2018 Apr;80:64-77. doi: 10.1016/j.jbi.2018.02.011. Epub 2018 Feb 26.

Transformer-based models for ICD-10 coding of death certificates with Portuguese text.

J Biomed Inform. 2022 Dec;136:104232. doi: 10.1016/j.jbi.2022.104232. Epub 2022 Oct 25.

Towards automated clinical coding.

Int J Med Inform. 2018 Dec;120:50-61. doi: 10.1016/j.ijmedinf.2018.09.021. Epub 2018 Oct 2.

Combining deep neural networks, a rule-based expert system and targeted manual coding for ICD-10 coding causes of death of French death certificates from 2018 to 2019.

Int J Med Inform. 2024 Aug;188:105462. doi: 10.1016/j.ijmedinf.2024.105462. Epub 2024 Apr 26.

Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity.

Comput Methods Programs Biomed. 2020 May;188:105264. doi: 10.1016/j.cmpb.2019.105264. Epub 2019 Dec 10.

Explainable automated coding of clinical notes using hierarchical label-wise attention networks and label embedding initialisation.

J Biomed Inform. 2021 Apr;116:103728. doi: 10.1016/j.jbi.2021.103728. Epub 2021 Mar 9.

DRCNN: A deep recurrent convolutional neural network with transfer learning through pre-trained embeddings for automated ICD coding.

Methods. 2022 Sep;205:97-105. doi: 10.1016/j.ymeth.2022.06.004. Epub 2022 Jul 1.

Automated ICD-9 Coding via A Deep Learning Approach.

IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1193-1202. doi: 10.1109/TCBB.2018.2817488. Epub 2018 Mar 20.

Interpretable deep learning to map diagnostic texts to ICD-10 codes.

Int J Med Inform. 2019 Sep;129:49-59. doi: 10.1016/j.ijmedinf.2019.05.015. Epub 2019 May 22.

Creating a computer assisted ICD coding system: Performance metric choice and use of the ICD hierarchy.

J Biomed Inform. 2024 Apr;152:104617. doi: 10.1016/j.jbi.2024.104617. Epub 2024 Mar 1.

引用本文的文献

Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis.

JMIR AI. 2023 Nov 22;2:e40965. doi: 10.2196/40965.

From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality?

Comput Struct Biotechnol J. 2024 May 9;24:362-373. doi: 10.1016/j.csbj.2024.05.004. eCollection 2024 Dec.

Using Natural Language Processing to Predict Fatal Drug Overdose From Autopsy Narrative Text: Algorithm Development and Validation Study.

JMIR Public Health Surveill. 2023 May 19;9:e45246. doi: 10.2196/45246.

A method for rapid machine learning development for data mining with doctor-in-the-loop.

PLoS One. 2023 May 10;18(5):e0284965. doi: 10.1371/journal.pone.0284965. eCollection 2023.

Automatic ICD-10 coding: Deep semantic matching based on analogical reasoning.

Heliyon. 2023 Apr 19;9(4):e15570. doi: 10.1016/j.heliyon.2023.e15570. eCollection 2023 Apr.

Automated extraction of information of lung cancer staging from unstructured reports of PET-CT interpretation: natural language processing with deep-learning.

BMC Med Inform Decis Mak. 2022 Sep 1;22(1):229. doi: 10.1186/s12911-022-01975-7.

DLKN-MLC: A Disease Prediction Model via Multi-Label Learning.

Int J Environ Res Public Health. 2022 Aug 8;19(15):9771. doi: 10.3390/ijerph19159771.

Judicial consequences in Spain for the completion of the medical death certificate.

Int J Legal Med. 2022 Jan;136(1):365-372. doi: 10.1007/s00414-021-02733-6. Epub 2021 Oct 26.

Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.

Front Digit Health. 2021 Mar;3. doi: 10.3389/fdgth.2021.620828. Epub 2021 Mar 10.

Automatic multilabel detection of ICD10 codes in Dutch cardiology discharge letters using neural networks.

NPJ Digit Med. 2021 Feb 26;4(1):37. doi: 10.1038/s41746-021-00404-9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

深度学习模型在自由文本中进行 ICD-10 死亡证明和尸检报告编码。

Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献