• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较大型语言模型与先前深度学习模型在药物不良反应命名实体识别上的表现。

Comparing a Large Language Model with Previous Deep Learning Models on Named Entity Recognition of Adverse Drug Events.

机构信息

Public health and medical information unit, Saint Etienne University Hospital, France.

Laboratoire Inserm, SAINBIOSE, U1059, dysfonction vasculaire et hémostase, université Jean-Monnet, Saint-Étienne, France.

出版信息

Stud Health Technol Inform. 2024 Aug 22;316:781-785. doi: 10.3233/SHTI240528.

DOI:10.3233/SHTI240528
PMID:39176909
Abstract

The ability to fine-tune pre-trained deep learning models to learn how to process a downstream task using a large training set allow to significantly improve performances of named entity recognition. Large language models are recent models based on the Transformers architecture that may be conditioned on a new task with in-context learning, by providing a series of instructions or prompt. These models only require few examples and such approach is defined as few shot learning. Our objective was to compare performances of named entity recognition of adverse drug events between state of the art deep learning models fine-tuned on Pubmed abstracts and a large language model using few-shot learning. Hussain et al's state of the art model (PMID: 34422092) significantly outperformed the ChatGPT-3.5 model (F1-Score: 97.6% vs 86.0%). Few-shot learning is a convenient way to perform named entity recognition when training examples are rare, but performances are still inferior to those of a deep learning model fine-tuned with several training examples. Perspectives are to evaluate few-shot prompting with GPT-4 and perform fine-tuning on GPT-3.5.

摘要

使用大型训练集微调预先训练的深度学习模型以学习如何处理下游任务的能力,可显著提高命名实体识别的性能。大型语言模型是基于 Transformer 架构的最新模型,可通过提供一系列指令或提示,通过上下文学习对新任务进行条件处理。这些模型仅需要少量示例,这种方法被定义为少样本学习。我们的目标是比较在 Pubmed 摘要上微调的最先进的深度学习模型和使用少样本学习的大型语言模型在药物不良事件命名实体识别方面的性能。Hussain 等人的最先进模型(PMID:34422092)显著优于 ChatGPT-3.5 模型(F1 得分:97.6%对 86.0%)。当训练示例很少时,少样本学习是执行命名实体识别的一种便捷方法,但性能仍不如经过几个训练示例微调的深度学习模型。未来的研究方向是评估 GPT-4 的少样本提示并在 GPT-3.5 上进行微调。

相似文献

1
Comparing a Large Language Model with Previous Deep Learning Models on Named Entity Recognition of Adverse Drug Events.比较大型语言模型与先前深度学习模型在药物不良反应命名实体识别上的表现。
Stud Health Technol Inform. 2024 Aug 22;316:781-785. doi: 10.3233/SHTI240528.
2
Improving large language models for clinical named entity recognition via prompt engineering.通过提示工程改进临床命名实体识别的大型语言模型。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1812-1820. doi: 10.1093/jamia/ocad259.
3
Sample Size Considerations for Fine-Tuning Large Language Models for Named Entity Recognition Tasks: Methodological Study.用于命名实体识别任务的大语言模型微调的样本量考量:方法学研究
JMIR AI. 2024 May 16;3:e52095. doi: 10.2196/52095.
4
Deep learning approaches for extracting adverse events and indications of dietary supplements from clinical text.深度学习方法从临床文本中提取膳食补充剂的不良事件和适应证。
J Am Med Inform Assoc. 2021 Mar 1;28(3):569-577. doi: 10.1093/jamia/ocaa218.
5
A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records.基于大型语言模型的生成式自然语言处理框架,在临床笔记上进行了微调,能够从电子健康记录中准确提取头痛频率。
Headache. 2024 Apr;64(4):400-409. doi: 10.1111/head.14702. Epub 2024 Mar 25.
6
From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts.从零到英雄:利用变压器在零样本和少样本上下文中进行生物医学命名实体识别。
Artif Intell Med. 2024 Oct;156:102970. doi: 10.1016/j.artmed.2024.102970. Epub 2024 Aug 24.
7
Named Entity Recognition in Pubmed Abstracts for Pharmacovigilance Using Deep Learning.基于深度学习的药物警戒中 PubMed 文摘命名实体识别。
Stud Health Technol Inform. 2022 May 25;294:878-879. doi: 10.3233/SHTI220615.
8
Artificial intelligence-powered pharmacovigilance: A review of machine and deep learning in clinical text-based adverse drug event detection for benchmark datasets.人工智能驱动的药物警戒:基于机器学习和深度学习的临床文本药物不良事件检测基准数据集综述。
J Biomed Inform. 2024 Apr;152:104621. doi: 10.1016/j.jbi.2024.104621. Epub 2024 Mar 5.
9
Few-Shot Learning for Clinical Natural Language Processing Using Siamese Neural Networks: Algorithm Development and Validation Study.使用暹罗神经网络的临床自然语言处理少样本学习:算法开发与验证研究
JMIR AI. 2023 May 4;2:e44293. doi: 10.2196/44293.
10
Language model based on deep learning network for biomedical named entity recognition.基于深度学习网络的生物医学命名实体识别语言模型。
Methods. 2024 Jun;226:71-77. doi: 10.1016/j.ymeth.2024.04.013. Epub 2024 Apr 17.

引用本文的文献

1
A scoping review on generative AI and large language models in mitigating medication related harm.关于生成式人工智能和大语言模型在减轻药物相关危害方面的范围综述。
NPJ Digit Med. 2025 Mar 28;8(1):182. doi: 10.1038/s41746-025-01565-7.