医学文本的少样本与传统命名实体识别模型比较

A comparison of few-shot and traditional named entity recognition models for medical text.

作者信息

Ge Yao, Guo Yuting, Yang Yuan-Chi, Al-Garadi Mohammed Ali, Sarker Abeed

机构信息

Department of Biomedical Informatics School of Medicine, Emory University Atlanta, GA.

出版信息

Proc (IEEE Int Conf Healthc Inform). 2022 Jun;2022:84-89. doi: 10.1109/ichi54592.2022.00024. Epub 2022 Sep 8.

DOI:10.1109/ichi54592.2022.00024

PMID:37641590

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10462421/

Abstract

Many research problems involving medical texts have limited amounts of annotated data available (., expressions of rare diseases). Traditional supervised machine learning algorithms, particularly those based on deep neural networks, require large volumes of annotated data, and they underperform when only small amounts of labeled data are available. Few-shot learning (FSL) is a category of machine learning models that are designed with the intent of solving problems that have small annotated datasets available. However, there is no current study that compares the performances of FSL models with traditional models (., conditional random fields) for medical text at different training set sizes. In this paper, we attempted to fill this gap in research by comparing multiple FSL models with traditional models for the task of named entity recognition (NER) from medical texts. Using five health-related annotated NER datasets, we benchmarked three traditional NER models based on BERT-BERT-Linear Classifier (BLC), BERT-CRF (BC) and SANER; and three FSL NER models-StructShot & NNShot, Few-Shot Slot Tagging (FS-ST) and ProtoNER. Our benchmarking results show that almost all models, whether traditional or FSL, achieve significantly lower performances compared to the state-of-the-art with small amounts of training data. For the NER experiments we executed, the F-scores were very low with small training sets, typically below 30%. FSL models that were reported to perform well on non-medical texts significantly underperformed, compared to their reported best, on medical texts. Our experiments also suggest that FSL methods tend to perform worse on data sets from noisy sources of medical texts, such as social media (which includes misspellings and colloquial expressions), compared to less noisy sources such as medical literature. Our experiments demonstrate that the current state-of-the-art FSL systems are not yet suitable for effective NER in medical natural language processing tasks, and further research needs to be carried out to improve their performances. Creation of specialized, standardized datasets replicating real-world scenarios may help to move this category of methods forward.

摘要

许多涉及医学文本的研究问题可用的标注数据量有限（例如，罕见疾病的表述）。传统的监督式机器学习算法，尤其是那些基于深度神经网络的算法，需要大量的标注数据，而当只有少量标注数据可用时，它们的表现就会不佳。少样本学习（FSL）是一类机器学习模型，其设计目的是解决只有少量标注数据集可用的问题。然而，目前尚无研究比较FSL模型与传统模型（例如，条件随机场）在不同训练集规模下对医学文本的性能。在本文中，我们试图通过比较多个FSL模型与传统模型在医学文本命名实体识别（NER）任务中的表现来填补这一研究空白。使用五个与健康相关的标注NER数据集，我们对基于BERT的三个传统NER模型——BERT-线性分类器（BLC）、BERT-条件随机场（BC）和SANER；以及三个FSL NER模型——StructShot & NNShot、少样本槽位标记（FS-ST）和ProtoNER进行了基准测试。我们的基准测试结果表明，几乎所有模型，无论是传统模型还是FSL模型，在训练数据量较少时，与当前最先进的模型相比，性能都显著较低。对于我们执行的NER实验，小训练集的F分数非常低，通常低于30%。据报道在非医学文本上表现良好的FSL模型，与它们报道的最佳表现相比，在医学文本上的表现明显不佳。我们的实验还表明，与医学文献等噪声较小的来源相比，FSL方法在来自医学文本噪声源（如社交媒体，其中包括拼写错误和口语表达）的数据集上往往表现更差。我们的实验表明，当前最先进的FSL系统尚不适用于医学自然语言处理任务中的有效NER，需要进一步开展研究以提高其性能。创建复制现实世界场景的专门、标准化数据集可能有助于推动这类方法的发展。

相似文献

A comparison of few-shot and traditional named entity recognition models for medical text.

Proc (IEEE Int Conf Healthc Inform). 2022 Jun;2022:84-89. doi: 10.1109/ichi54592.2022.00024. Epub 2022 Sep 8.

Few-shot learning for medical text: A review of advances, trends, and opportunities.

J Biomed Inform. 2023 Aug;144:104458. doi: 10.1016/j.jbi.2023.104458. Epub 2023 Jul 23.

Data Augmentation with Nearest Neighbor Classifier for Few-Shot Named Entity Recognition.

Stud Health Technol Inform. 2024 Jan 25;310:690-694. doi: 10.3233/SHTI231053.

Few-Shot Learning for Clinical Natural Language Processing Using Siamese Neural Networks: Algorithm Development and Validation Study.

JMIR AI. 2023 May 4;2:e44293. doi: 10.2196/44293.

Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.

JMIR Med Inform. 2024 Oct 17;12:e59782. doi: 10.2196/59782.

From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts.

Artif Intell Med. 2024 Oct;156:102970. doi: 10.1016/j.artmed.2024.102970. Epub 2024 Aug 24.

Extracting comprehensive clinical information for breast cancer using deep learning methods.

Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.

Adversarial active learning for the identification of medical concepts and annotation inconsistency.

J Biomed Inform. 2020 Aug;108:103481. doi: 10.1016/j.jbi.2020.103481. Epub 2020 Jul 18.

Analyzing transfer learning impact in biomedical cross-lingual named entity recognition and normalization.

BMC Bioinformatics. 2021 Dec 17;22(Suppl 1):601. doi: 10.1186/s12859-021-04247-9.

Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text.

J Med Syst. 2020 Feb 28;44(4):77. doi: 10.1007/s10916-020-1542-8.

引用本文的文献

Prompt Framework for Extracting Scale-Related Knowledge Entities from Chinese Medical Literature: Development and Evaluation Study.

J Med Internet Res. 2025 Mar 18;27:e67033. doi: 10.2196/67033.

Precision in Parsing: Evaluation of an Open-Source Named Entity Recognizer (NER) in Veterinary Oncology.

Vet Comp Oncol. 2025 Mar;23(1):102-108. doi: 10.1111/vco.13035. Epub 2024 Dec 23.

Data Augmentation with Nearest Neighbor Classifier for Few-Shot Named Entity Recognition.

Stud Health Technol Inform. 2024 Jan 25;310:690-694. doi: 10.3233/SHTI231053.

Few-shot learning for medical text: A review of advances, trends, and opportunities.

J Biomed Inform. 2023 Aug;144:104458. doi: 10.1016/j.jbi.2023.104458. Epub 2023 Jul 23.

本文引用的文献

Meta-Learning in Neural Networks: A Survey.

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5149-5169. doi: 10.1109/TPAMI.2021.3079209. Epub 2022 Aug 4.

2018 n2c2 shared task on adverse drug events and medication extraction in electronic health records.

J Am Med Inform Assoc. 2020 Jan 1;27(1):3-12. doi: 10.1093/jamia/ocz166.

Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training.

J Biomed Inform. 2019 Aug;96:103252. doi: 10.1016/j.jbi.2019.103252. Epub 2019 Jul 16.

Social media mining for birth defects research: A rule-based, bootstrapping approach to collecting data for rare health-related events on Twitter.

J Biomed Inform. 2018 Nov;87:68-78. doi: 10.1016/j.jbi.2018.10.001. Epub 2018 Oct 4.

Deep neural networks and distant supervision for geographic location mention extraction.

Bioinformatics. 2018 Jul 1;34(13):i565-i573. doi: 10.1093/bioinformatics/bty273.

MIMIC-III, a freely accessible critical care database.

Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.

Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.

J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S20-S29. doi: 10.1016/j.jbi.2015.07.020. Epub 2015 Aug 28.

Utilizing social media data for pharmacovigilance: A review.

J Biomed Inform. 2015 Apr;54:202-12. doi: 10.1016/j.jbi.2015.02.004. Epub 2015 Feb 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

医学文本的少样本与传统命名实体识别模型比较

A comparison of few-shot and traditional named entity recognition models for medical text.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献