基于提示的自回归生成式多标签少样本ICD编码

Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt.

作者信息

Yang Zhichao, Kwon Sunjae, Yao Zonghai, Yu Hong

机构信息

College of Information and Computer Sciences, University of Massachusetts Amherst.

Department of Computer Science, University of Massachusetts Lowell.

出版信息

Proc AAAI Conf Artif Intell. 2023 Jun 26;37(4):5366-5374. doi: 10.1609/aaai.v37i4.25668.

DOI:10.1609/aaai.v37i4.25668

PMID:37635946

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10457101/

Abstract

Automatic International Classification of Diseases (ICD) coding aims to assign multiple ICD codes to a medical note with an average of 3,000+ tokens. This task is challenging due to the high-dimensional space of multi-label assignment (155,000+ ICD code candidates) and the long-tail challenge - Many ICD codes are infrequently assigned yet infrequent ICD codes are important clinically. This study addresses the long-tail challenge by transforming this multi-label classification task into an autoregressive generation task. Specifically, we first introduce a novel pretraining objective to generate free text diagnoses and procedures using the SOAP structure, the medical logic physicians use for note documentation. Second, instead of directly predicting the high dimensional space of ICD codes, our model generates the lower dimension of text descriptions, which then infers ICD codes. Third, we designed a novel prompt template for multi-label classification. We evaluate our Generation with Prompt (GP) model with the benchmark of all code assignment (MIMIC-III-full) and few shot ICD code assignment evaluation benchmark (MIMIC-III-few). Experiments on MIMIC-III-few show that our model performs with a marco F130.2, which substantially outperforms the previous MIMIC-III-full SOTA model (marco F1 4.3) and the model specifically designed for few/zero shot setting (marco F1 18.7). Finally, we design a novel ensemble learner, a cross-attention reranker with prompts, to integrate previous SOTA and our best few-shot coding predictions. Experiments on MIMIC-III-full show that our ensemble learner substantially improves both macro and micro F1, from 10.4 to 14.6 and from 58.2 to 59.1, respectively.

摘要

自动国际疾病分类（ICD）编码旨在为平均包含3000多个词元的医学记录分配多个ICD编码。由于多标签分配的高维空间（超过155,000个ICD编码候选）以及长尾挑战，这项任务具有挑战性——许多ICD编码很少被分配，但罕见的ICD编码在临床上很重要。本研究通过将此多标签分类任务转化为自回归生成任务来应对长尾挑战。具体而言，我们首先引入一种新颖的预训练目标，使用SOAP结构（医生用于记录的医学逻辑）生成自由文本诊断和程序。其次，我们的模型不是直接预测ICD编码的高维空间，而是生成文本描述的低维表示，然后据此推断ICD编码。第三，我们设计了一种新颖的多标签分类提示模板。我们使用所有编码分配基准（MIMIC-III-full）和少样本ICD编码分配评估基准（MIMIC-III-few）对我们的带提示生成（GP）模型进行评估。在MIMIC-III-few上的实验表明，我们的模型的宏F1为30.2，大大优于之前的MIMIC-III-full最优模型（宏F1为4.3）以及专门为少样本/零样本设置设计的模型（宏F1为18.7）。最后，我们设计了一种新颖的集成学习器，即带提示的交叉注意力重排器，以整合之前的最优模型和我们最佳的少样本编码预测。在MIMIC-III-full上的实验表明，我们的集成学习器显著提高了宏F1和微F1，分别从10.4提高到14.6，从58.2提高到59.1。

相似文献

Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt.基于提示的自回归生成式多标签少样本ICD编码

Proc AAAI Conf Artif Intell. 2023 Jun 26;37(4):5366-5374. doi: 10.1609/aaai.v37i4.25668.

Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding.基于知识注入提示的多标签少样本ICD编码微调

Proc Conf Empir Methods Nat Lang Process. 2022 Dec;2022:1767-1781.

Automated ICD coding using extreme multi-label long text transformer-based models.基于极端多标签长文本转换器的自动 ICD 编码。

Artif Intell Med. 2023 Oct;144:102662. doi: 10.1016/j.artmed.2023.102662. Epub 2023 Sep 7.

A Pseudo Label-Wise Attention Network for Automatic ICD Coding.基于伪标签注意力网络的 ICD 自动编码方法。

IEEE J Biomed Health Inform. 2022 Oct;26(10):5201-5212. doi: 10.1109/JBHI.2022.3193291. Epub 2022 Oct 5.

Automatic International Classification of Diseases Coding System: Deep Contextualized Language Model With Rule-Based Approaches.自动国际疾病分类编码系统：基于规则方法的深度情境化语言模型

JMIR Med Inform. 2022 Jun 29;10(6):e37557. doi: 10.2196/37557.

Hierarchical label-wise attention transformer model for explainable ICD coding.基于分层标签注意力转换器模型的可解释 ICD 编码。

J Biomed Inform. 2022 Sep;133:104161. doi: 10.1016/j.jbi.2022.104161. Epub 2022 Aug 20.

Enhanced ICD-10 code assignment of clinical texts: A summarization-based approach.增强临床文本的 ICD-10 编码分配：基于总结的方法。

Artif Intell Med. 2024 Oct;156:102967. doi: 10.1016/j.artmed.2024.102967. Epub 2024 Aug 20.

Hyperbolic graph convolutional neural network with contrastive learning for automated ICD coding.基于对比学习的双曲图卷积神经网络在自动化 ICD 编码中的应用。

Comput Biol Med. 2024 Jan;168:107797. doi: 10.1016/j.compbiomed.2023.107797. Epub 2023 Dec 1.

ICDXML: enhancing ICD coding with probabilistic label trees and dynamic semantic representations.ICDXML：利用概率标签树和动态语义表示增强 ICD 编码。

Sci Rep. 2024 Aug 7;14(1):18319. doi: 10.1038/s41598-024-69214-9.

Can GPT-3.5 generate and code discharge summaries?GPT-3.5 可以生成和编写出院小结吗？

J Am Med Inform Assoc. 2024 Oct 1;31(10):2284-2293. doi: 10.1093/jamia/ocae132.

引用本文的文献

Domain-Specific Pretraining of NorDeClin-Bidirectional Encoder Representations From Transformers for Code Prediction in Norwegian Clinical Texts: Model Development and Evaluation Study.用于挪威临床文本代码预测的基于变压器的挪威语临床双向编码器表示的特定领域预训练：模型开发与评估研究

JMIR AI. 2025 Aug 25;4:e66153. doi: 10.2196/66153.

Developing an ICD-10 Coding Assistant: Pilot Study Using RoBERTa and GPT-4 for Term Extraction and Description-Based Code Selection.开发国际疾病分类第十版（ICD - 10）编码助手：使用RoBERTa和GPT - 4进行术语提取和基于描述的代码选择的试点研究

JMIR Form Res. 2025 Feb 11;9:e60095. doi: 10.2196/60095.

ODD: A Benchmark Dataset for the Natural Language Processing Based Opioid Related Aberrant Behavior Detection.ODD：用于基于自然语言处理的阿片类药物相关异常行为检测的基准数据集。

Proc Conf. 2024 Jun;2024:4338-4359.

ICDXML: enhancing ICD coding with probabilistic label trees and dynamic semantic representations.ICDXML：利用概率标签树和动态语义表示增强 ICD 编码。

Sci Rep. 2024 Aug 7;14(1):18319. doi: 10.1038/s41598-024-69214-9.

Using natural language processing for automated classification of disease and to identify misclassified ICD codes in cardiac disease.利用自然语言处理技术对疾病进行自动分类，并识别心脏病中错误分类的国际疾病分类（ICD）编码。

Eur Heart J Digit Health. 2024 Feb 9;5(3):229-234. doi: 10.1093/ehjdh/ztae008. eCollection 2024 May.

本文引用的文献

Oversampling effect in pretraining for bidirectional encoder representations from transformers (BERT) to localize medical BERT and enhance biomedical BERT.在基于转换器的双向编码器表示预训练（BERT）中进行过采样，以定位医学 BERT 并增强生物医学 BERT。

Artif Intell Med. 2024 Jul;153:102889. doi: 10.1016/j.artmed.2024.102889. Epub 2024 May 5.

Clinical Prompt Learning With Frozen Language Models.临床提示学习与冻结语言模型。

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16453-16463. doi: 10.1109/TNNLS.2023.3294633. Epub 2024 Oct 29.

Context Variance Evaluation of Pretrained Language Models for Prompt-based Biomedical Knowledge Probing.基于提示的生物医学知识探测的预训练语言模型的上下文方差评估

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16;2023:592-601. eCollection 2023.

HealthPrompt: A Zero-shot Learning Paradigm for Clinical Natural Language Processing.健康提示：一种临床自然语言处理的零样本学习范式。

AMIA Annu Symp Proc. 2023 Apr 29;2022:972-981. eCollection 2022.

Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding.基于知识注入提示的多标签少样本ICD编码微调

Proc Conf Empir Methods Nat Lang Process. 2022 Dec;2022:1767-1781.

Does the magic of BERT apply to medical code assignment? A quantitative study.BERT 的魔力是否适用于医疗编码分配？一项定量研究。

Comput Biol Med. 2021 Dec;139:104998. doi: 10.1016/j.compbiomed.2021.104998. Epub 2021 Oct 30.

Generating Accurate Electronic Health Assessment from Medical Graph.从医学图谱生成准确的电子健康评估。

Proc Conf Empir Methods Nat Lang Process. 2020 Nov;2020:3764-3773. doi: 10.18653/v1/2020.findings-emnlp.336.

Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)-Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study.基于大规模电子健康记录笔记对基于变换器的双向编码器表征（BERT）模型进行微调：一项实证研究。

JMIR Med Inform. 2019 Sep 12;7(3):e14830. doi: 10.2196/14830.

Interpretable deep learning to map diagnostic texts to ICD-10 codes.可解释的深度学习将诊断文本映射到 ICD-10 代码。

Int J Med Inform. 2019 Sep;129:49-59. doi: 10.1016/j.ijmedinf.2019.05.015. Epub 2019 May 22.

Biases introduced by filtering electronic health records for patients with "complete data".通过筛选具有“完整数据”的患者的电子健康记录所引入的偏差。

J Am Med Inform Assoc. 2017 Nov 1;24(6):1134-1141. doi: 10.1093/jamia/ocx071.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验