• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医学文本的少样本学习:进展、趋势和机遇综述。

Few-shot learning for medical text: A review of advances, trends, and opportunities.

机构信息

Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA, United States of America.

Department of Biomedical Informatics, Vanderbilt University Medical Center, Vanderbilt University, Nashville, TN, United States of America.

出版信息

J Biomed Inform. 2023 Aug;144:104458. doi: 10.1016/j.jbi.2023.104458. Epub 2023 Jul 23.

DOI:10.1016/j.jbi.2023.104458
PMID:37488023
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10940971/
Abstract

BACKGROUND

Few-shot learning (FSL) is a class of machine learning methods that require small numbers of labeled instances for training. With many medical topics having limited annotated text-based data in practical settings, FSL-based natural language processing (NLP) holds substantial promise. We aimed to conduct a review to explore the current state of FSL methods for medical NLP.

METHODS

We searched for articles published between January 2016 and October 2022 using PubMed/Medline, Embase, ACL Anthology, and IEEE Xplore Digital Library. We also searched the preprint servers (e.g., arXiv, medRxiv, and bioRxiv) via Google Scholar to identify the latest relevant methods. We included all articles that involved FSL and any form of medical text. We abstracted articles based on the data source, target task, training set size, primary method(s)/approach(es), and evaluation metric(s).

RESULTS

Fifty-one articles met our inclusion criteria-all published after 2018, and most since 2020 (42/51; 82%). Concept extraction/named entity recognition was the most frequently addressed task (21/51; 41%), followed by text classification (16/51; 31%). Thirty-two (61%) articles reconstructed existing datasets to fit few-shot scenarios, and MIMIC-III was the most frequently used dataset (10/51; 20%). 77% of the articles attempted to incorporate prior knowledge to augment the small datasets available for training. Common methods included FSL with attention mechanisms (20/51; 39%), prototypical networks (11/51; 22%), meta-learning (7/51; 14%), and prompt-based learning methods, the latter being particularly popular since 2021. Benchmarking experiments demonstrated relative underperformance of FSL methods on biomedical NLP tasks.

CONCLUSION

Despite the potential for FSL in biomedical NLP, progress has been limited. This may be attributed to the rarity of specialized data, lack of standardized evaluation criteria, and the underperformance of FSL methods on biomedical topics. The creation of publicly-available specialized datasets for biomedical FSL may aid method development by facilitating comparative analyses.

摘要

背景

小样本学习(FSL)是一类机器学习方法,仅需少量有标签的实例进行训练。在实际情况下,许多医学主题的基于文本的注释数据有限,因此基于 FSL 的自然语言处理(NLP)具有很大的潜力。我们旨在进行一项综述,以探索医学 NLP 中 FSL 方法的现状。

方法

我们使用 PubMed/Medline、Embase、ACL 文集和 IEEE Xplore 数字图书馆,搜索了 2016 年 1 月至 2022 年 10 月期间发表的文章。我们还通过 Google Scholar 搜索预印本服务器(例如 arXiv、medRxiv 和 bioRxiv),以确定最新的相关方法。我们纳入了所有涉及 FSL 和任何形式的医学文本的文章。我们根据数据源、目标任务、训练集大小、主要方法/方法和评估指标来摘要文章。

结果

符合纳入标准的文章有 51 篇-均发表于 2018 年以后,其中大多数(42/51;82%)发表于 2020 年以后。概念提取/命名实体识别是最常被研究的任务(21/51;41%),其次是文本分类(16/51;31%)。32 篇(61%)文章重建了现有的数据集以适应小样本场景,其中 MIMIC-III 是最常被使用的数据集(10/51;20%)。77%的文章试图利用先验知识来扩充用于训练的小数据集。常见的方法包括具有注意力机制的 FSL(20/51;39%)、原型网络(11/51;22%)、元学习(7/51;14%)和基于提示的学习方法,后者自 2021 年以来特别流行。基准实验表明,FSL 方法在生物医学 NLP 任务中的表现相对较差。

结论

尽管 FSL 在生物医学 NLP 中有潜力,但进展有限。这可能归因于特殊数据的稀有性、缺乏标准化的评估标准以及 FSL 方法在生物医学主题上的表现不佳。为生物医学 FSL 创建公共可用的特殊数据集可能有助于方法开发,促进比较分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/ebffff0e869b/nihms-1921525-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/c31fe0473c32/nihms-1921525-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/e97c70cfddbf/nihms-1921525-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/330012382986/nihms-1921525-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/7f501b57474f/nihms-1921525-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/d609c0fb7103/nihms-1921525-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/409fa3089681/nihms-1921525-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/ebffff0e869b/nihms-1921525-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/c31fe0473c32/nihms-1921525-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/e97c70cfddbf/nihms-1921525-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/330012382986/nihms-1921525-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/7f501b57474f/nihms-1921525-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/d609c0fb7103/nihms-1921525-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/409fa3089681/nihms-1921525-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a42d/10940971/ebffff0e869b/nihms-1921525-f0007.jpg

相似文献

1
Few-shot learning for medical text: A review of advances, trends, and opportunities.医学文本的少样本学习:进展、趋势和机遇综述。
J Biomed Inform. 2023 Aug;144:104458. doi: 10.1016/j.jbi.2023.104458. Epub 2023 Jul 23.
2
A comparison of few-shot and traditional named entity recognition models for medical text.医学文本的少样本与传统命名实体识别模型比较
Proc (IEEE Int Conf Healthc Inform). 2022 Jun;2022:84-89. doi: 10.1109/ichi54592.2022.00024. Epub 2022 Sep 8.
3
Data Augmentation with Nearest Neighbor Classifier for Few-Shot Named Entity Recognition.基于最近邻分类器的数据增强在少样本命名实体识别中的应用。
Stud Health Technol Inform. 2024 Jan 25;310:690-694. doi: 10.3233/SHTI231053.
4
Few-Shot Learning for Clinical Natural Language Processing Using Siamese Neural Networks: Algorithm Development and Validation Study.使用暹罗神经网络的临床自然语言处理少样本学习:算法开发与验证研究
JMIR AI. 2023 May 4;2:e44293. doi: 10.2196/44293.
5
Extracting adverse drug events from clinical Notes: A systematic review of approaches used.从临床记录中提取药物不良事件:对所用方法的系统评价
J Biomed Inform. 2024 Mar;151:104603. doi: 10.1016/j.jbi.2024.104603. Epub 2024 Feb 6.
6
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Deep learning in clinical natural language processing: a methodical review.深度学习在临床自然语言处理中的应用:系统综述。
J Am Med Inform Assoc. 2020 Mar 1;27(3):457-470. doi: 10.1093/jamia/ocz200.
9
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.
10
Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies.自然语言处理算法在将临床文本片段映射到本体概念上的应用:系统评价及对未来研究的建议。
J Biomed Semantics. 2020 Nov 16;11(1):14. doi: 10.1186/s13326-020-00231-z.

引用本文的文献

1
Large Language Models for CAD-RADS 2.0 Extraction From Semi-Structured Coronary CT Angiography Reports: A Multi-Institutional Study.用于从半结构化冠状动脉CT血管造影报告中提取CAD-RADS 2.0的大语言模型:一项多机构研究
Korean J Radiol. 2025 Sep;26(9):817-831. doi: 10.3348/kjr.2025.0293.
2
Scoring Physician Risk Communication in Prostate Cancer Using Large Language Models.使用大语言模型对前列腺癌患者的医生风险沟通进行评分
medRxiv. 2025 Aug 11:2025.08.07.25333034. doi: 10.1101/2025.08.07.25333034.
3
Exploration of 3D Few-Shot Learning Techniques for Classification of Knee Joint Injuries on MR Images.

本文引用的文献

1
A comparison of few-shot and traditional named entity recognition models for medical text.医学文本的少样本与传统命名实体识别模型比较
Proc (IEEE Int Conf Healthc Inform). 2022 Jun;2022:84-89. doi: 10.1109/ichi54592.2022.00024. Epub 2022 Sep 8.
2
Trustworthy assertion classification through prompting.通过提示进行可信断言分类。
J Biomed Inform. 2022 Aug;132:104139. doi: 10.1016/j.jbi.2022.104139. Epub 2022 Jul 8.
3
AT-NeuroEAE: A Joint Extraction Model of Events With Attributes for Research Sharing-Oriented Neuroimaging Provenance Construction.
用于磁共振图像上膝关节损伤分类的三维少样本学习技术探索
Diagnostics (Basel). 2025 Jul 18;15(14):1808. doi: 10.3390/diagnostics15141808.
4
Digital transformation with clinical alerts and personalized care systems in an integrated value based model.在基于综合价值的模式中,通过临床警报和个性化护理系统进行数字化转型。
NPJ Digit Med. 2025 Jul 8;8(1):415. doi: 10.1038/s41746-025-01838-1.
5
Dynamic few-shot prompting for clinical note section classification using lightweight, open-source large language models.使用轻量级开源大语言模型进行临床笔记章节分类的动态少样本提示
J Am Med Inform Assoc. 2025 Jul 1;32(7):1164-1173. doi: 10.1093/jamia/ocaf084.
6
Expert of Experts Verification and Alignment (EVAL) Framework for Large Language Models Safety in Gastroenterology.用于胃肠病学中大型语言模型安全的专家验证与对齐(EVAL)框架。
NPJ Digit Med. 2025 May 3;8(1):242. doi: 10.1038/s41746-025-01589-z.
7
Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.在卫生经济学与结果研究中使用生成式人工智能:技术与突破入门
Pharmacoecon Open. 2025 Apr 29. doi: 10.1007/s41669-025-00580-4.
8
A simplified retriever to improve accuracy of phenotype normalizations by large language models.一种简化的检索器,用于提高大语言模型进行表型标准化的准确性。
Front Digit Health. 2025 Mar 4;7:1495040. doi: 10.3389/fdgth.2025.1495040. eCollection 2025.
9
Leveraging large language models for knowledge-free weak supervision in clinical natural language processing.在临床自然语言处理中利用大语言模型进行无知识弱监督。
Sci Rep. 2025 Mar 10;15(1):8241. doi: 10.1038/s41598-024-68168-2.
10
NLP modeling recommendations for restricted data availability in clinical settings.临床环境中受限数据可用性的自然语言处理建模建议。
BMC Med Inform Decis Mak. 2025 Mar 7;25(1):116. doi: 10.1186/s12911-025-02948-2.
AT-神经自身免疫性脑脊髓炎:一种用于面向研究共享的神经影像来源构建的带属性事件联合提取模型。
Front Neurosci. 2022 Mar 7;15:739535. doi: 10.3389/fnins.2021.739535. eCollection 2021.
4
Neuroimaging-ITM: A Text Mining Pipeline Combining Deep Adversarial Learning with Interaction Based Topic Modeling for Enabling the FAIR Neuroimaging Study.神经影像学-信息技术管理:一种将深度对抗学习与基于交互的主题建模相结合的文本挖掘管道,以推动可信赖的神经影像学研究。
Neuroinformatics. 2022 Jul;20(3):701-726. doi: 10.1007/s12021-022-09571-w. Epub 2022 Mar 2.
5
A novel few-shot learning based multi-modality fusion model for COVID-19 rumor detection from online social media.一种基于少样本学习的新型多模态融合模型,用于从在线社交媒体中检测新冠疫情谣言。
PeerJ Comput Sci. 2021 Aug 20;7:e688. doi: 10.7717/peerj-cs.688. eCollection 2021.
6
Adaptive Prototypical Networks With Label Words and Joint Representation Learning for Few-Shot Relation Classification.基于标签词和联合表示学习的自适应原型网络用于少样本关系分类
IEEE Trans Neural Netw Learn Syst. 2023 Mar;34(3):1406-1417. doi: 10.1109/TNNLS.2021.3105377. Epub 2023 Feb 28.
7
Med7: A transferable clinical natural language processing model for electronic health records.Med7:一种可转移的电子健康记录临床自然语言处理模型。
Artif Intell Med. 2021 Aug;118:102086. doi: 10.1016/j.artmed.2021.102086. Epub 2021 May 18.
8
Bias and fairness assessment of a natural language processing opioid misuse classifier: detection and mitigation of electronic health record data disadvantages across racial subgroups.自然语言处理阿片类药物滥用分类器的偏差和公平性评估:检测和减轻电子健康记录数据在不同种族亚组中的劣势。
J Am Med Inform Assoc. 2021 Oct 12;28(11):2393-2403. doi: 10.1093/jamia/ocab148.
9
Few-shot learning creates predictive models of drug response that translate from high-throughput screens to individual patients.少样本学习创建了药物反应的预测模型,这些模型可以从高通量筛选转化到个体患者身上。
Nat Cancer. 2021 Feb;2(2):233-244. doi: 10.1038/s43018-020-00169-2. Epub 2021 Jan 25.
10
Meta-Learning in Neural Networks: A Survey.元学习在神经网络中的研究进展综述
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5149-5169. doi: 10.1109/TPAMI.2021.3079209. Epub 2022 Aug 4.