从零到英雄：利用变压器在零样本和少样本上下文中进行生物医学命名实体识别。

From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts.

机构信息

Institute for Artificial Intelligence Research and Development of Serbia, Fruškogorska 1, Novi Sad, 21000, Serbia.

Institute for Artificial Intelligence Research and Development of Serbia, Fruškogorska 1, Novi Sad, 21000, Serbia; Bayer A.G., Research and Development, Mullerstrasse 173, Berlin, 13342, Germany.

出版信息

Artif Intell Med. 2024 Oct;156:102970. doi: 10.1016/j.artmed.2024.102970. Epub 2024 Aug 24.

DOI:10.1016/j.artmed.2024.102970

PMID:39197375

Abstract

Supervised named entity recognition (NER) in the biomedical domain depends on large sets of annotated texts with the given named entities. The creation of such datasets can be time-consuming and expensive, while extraction of new entities requires additional annotation tasks and retraining the model. This paper proposes a method for zero- and few-shot NER in the biomedical domain to address these challenges. The method is based on transforming the task of multi-class token classification into binary token classification and pre-training on a large number of datasets and biomedical entities, which allows the model to learn semantic relations between the given and potentially novel named entity labels. We have achieved average F1 scores of 35.44% for zero-shot NER, 50.10% for one-shot NER, 69.94% for 10-shot NER, and 79.51% for 100-shot NER on 9 diverse evaluated biomedical entities with fine-tuned PubMedBERT-based model. The results demonstrate the effectiveness of the proposed method for recognizing new biomedical entities with no or limited number of examples, outperforming previous transformer-based methods, and being comparable to GPT3-based models using models with over 1000 times fewer parameters. We make models and developed code publicly available.

摘要

在生物医学领域，监督命名实体识别（NER）依赖于具有给定命名实体的大型标注文本集。创建这样的数据集可能既耗时又昂贵，而提取新实体则需要额外的标注任务和重新训练模型。本文提出了一种在生物医学领域进行零样本和少样本 NER 的方法，以解决这些挑战。该方法基于将多类别标记分类任务转换为二类别标记分类，并在大量数据集和生物医学实体上进行预训练，这使得模型能够学习给定和潜在新命名实体标签之间的语义关系。我们在 9 个不同评估的生物医学实体上，使用微调后的基于 PubMedBERT 的模型，实现了零样本 NER 的平均 F1 得分为 35.44%，一样本 NER 的平均 F1 得分为 50.10%，10 样本 NER 的平均 F1 得分为 69.94%，100 样本 NER 的平均 F1 得分为 79.51%。结果表明，该方法在识别具有少量或没有示例的新生物医学实体方面非常有效，优于之前基于转换器的方法，并且与使用 1000 多倍参数较少的模型的 GPT3 模型相当。我们公开了模型和开发的代码。

相似文献

From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts.从零到英雄：利用变压器在零样本和少样本上下文中进行生物医学命名实体识别。

Artif Intell Med. 2024 Oct;156:102970. doi: 10.1016/j.artmed.2024.102970. Epub 2024 Aug 24.

Improving biomedical Named Entity Recognition with additional external contexts.利用额外的外部语境提高生物医学命名实体识别的性能。

J Biomed Inform. 2024 Aug;156:104674. doi: 10.1016/j.jbi.2024.104674. Epub 2024 Jun 11.

Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.评估医疗保健中的实体识别：实体模型定量研究。

JMIR Med Inform. 2024 Oct 17;12:e59782. doi: 10.2196/59782.

Advancing entity recognition in biomedicine via instruction tuning of large language models.通过指令调整大型语言模型推进生物医学中的实体识别。

Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae163.

Transformers-sklearn: a toolkit for medical language understanding with transformer-based models.Transformer-sklearn：一个基于 Transformer 的模型的医学语言理解工具包。

BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):90. doi: 10.1186/s12911-021-01459-0.

Vocabulary Matters: An Annotation Pipeline and Four Deep Learning Algorithms for Enzyme Named Entity Recognition.词汇很重要：用于酶命名实体识别的标注流水线和四个深度学习算法。

J Proteome Res. 2024 Jun 7;23(6):1915-1925. doi: 10.1021/acs.jproteome.3c00367. Epub 2024 May 11.

A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation.基于 Transformer 的双向编码器表示模型的精细调整在食品命名实体识别中的应用：算法开发与验证。

J Med Internet Res. 2021 Aug 9;23(8):e28229. doi: 10.2196/28229.

A comparison of few-shot and traditional named entity recognition models for medical text.医学文本的少样本与传统命名实体识别模型比较

Proc (IEEE Int Conf Healthc Inform). 2022 Jun;2022:84-89. doi: 10.1109/ichi54592.2022.00024. Epub 2022 Sep 8.

Sample Size Considerations for Fine-Tuning Large Language Models for Named Entity Recognition Tasks: Methodological Study.用于命名实体识别任务的大语言模型微调的样本量考量：方法学研究

JMIR AI. 2024 May 16;3:e52095. doi: 10.2196/52095.

Multi-head CRF classifier for biomedical multi-class named entity recognition on Spanish clinical notes.基于多头条件随机场分类器的西班牙语临床文档中生物医学多类命名实体识别。

Database (Oxford). 2024 Jul 30;2024. doi: 10.1093/database/baae068.

引用本文的文献

GRU-SCANET: unleashing the power of GRU-based sinusoidal capture network for precision-driven named entity recognition.GRU-SCANET：释放基于门控循环单元（GRU）的正弦捕获网络的力量，用于精确驱动的命名实体识别。

Bioinform Adv. 2025 Jun 16;5(1):vbaf096. doi: 10.1093/bioadv/vbaf096. eCollection 2025.

[Transformation of free-text radiology reports into structured data].[将自由文本形式的放射学报告转换为结构化数据]

Radiologie (Heidelb). 2025 Apr;65(4):249-256. doi: 10.1007/s00117-025-01422-4. Epub 2025 Feb 11.

Biomedical named entity recognition using improved green anaconda-assisted Bi-GRU-based hierarchical ResNet model.使用改进的绿色蟒蛇辅助的基于双向门控循环单元的分层残差神经网络模型进行生物医学命名实体识别。

BMC Bioinformatics. 2025 Jan 30;26(1):34. doi: 10.1186/s12859-024-06008-w.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从零到英雄：利用变压器在零样本和少样本上下文中进行生物医学命名实体识别。

From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献