• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChatGPT在食品命名实体识别与链接方面的零样本评估。

Zero-shot evaluation of ChatGPT for food named-entity recognition and linking.

作者信息

Ogrinc Matevž, Koroušić Seljak Barbara, Eftimov Tome

机构信息

Jožef Stefan International Postgraduate School, Ljubljana, Slovenia.

Department of Computer Systems, Jožef Stefan Institute, Ljubljana, Slovenia.

出版信息

Front Nutr. 2024 Aug 13;11:1429259. doi: 10.3389/fnut.2024.1429259. eCollection 2024.

DOI:10.3389/fnut.2024.1429259
PMID:39290564
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11406469/
Abstract

INTRODUCTION

Recognizing and extracting key information from textual data plays an important role in intelligent systems by maintaining up-to-date knowledge, reinforcing informed decision-making, question-answering, and more. It is especially apparent in the food domain, where critical information guides the decisions of nutritionists and clinicians. The information extraction process involves two natural language processing tasks named entity recognition-NER and named entity linking-NEL. With the emergence of large language models (LLMs), especially ChatGPT, many areas began incorporating its knowledge to reduce workloads or simplify tasks. In the field of food, however, we noticed an opportunity to involve ChatGPT in NER and NEL.

METHODS

To assess ChatGPT's capabilities, we have evaluated its two versions, ChatGPT-3.5 and ChatGPT-4, focusing on their performance across both NER and NEL tasks, emphasizing food-related data. To benchmark our results in the food domain, we also investigated its capabilities in a more broadly investigated biomedical domain. By evaluating its zero-shot capabilities, we were able to ascertain the strengths and weaknesses of the two versions of ChatGPT.

RESULTS

Despite being able to show promising results in NER compared to other models. When tasked with linking entities to their identifiers from semantic models ChatGPT's effectiveness falls drastically.

DISCUSSION

While the integration of ChatGPT holds potential across various fields, it is crucial to approach its use with caution, particularly in relying on its responses for critical decisions in food and bio-medicine.

摘要

引言

从文本数据中识别和提取关键信息在智能系统中起着重要作用,它能保持知识更新、加强明智决策、问答等。这在食品领域尤为明显,关键信息指导着营养学家和临床医生的决策。信息提取过程涉及两个自然语言处理任务,即命名实体识别(NER)和命名实体链接(NEL)。随着大语言模型(LLMs)的出现,尤其是ChatGPT,许多领域开始纳入其知识以减轻工作量或简化任务。然而,在食品领域,我们发现了让ChatGPT参与NER和NEL的机会。

方法

为了评估ChatGPT的能力,我们评估了它的两个版本,ChatGPT-3.5和ChatGPT-4,重点关注它们在NER和NEL任务中的表现,强调与食品相关的数据。为了在食品领域对我们的结果进行基准测试,我们还研究了它在更广泛研究的生物医学领域的能力。通过评估其零样本能力,我们能够确定ChatGPT两个版本的优缺点。

结果

尽管与其他模型相比,ChatGPT在NER方面能够显示出有希望的结果。但当任务是将实体与其语义模型中的标识符链接起来时,ChatGPT的有效性会大幅下降。

讨论

虽然ChatGPT的整合在各个领域都有潜力,但谨慎使用它至关重要,特别是在依靠其回答做出食品和生物医学关键决策时。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/9be205ceefea/fnut-11-1429259-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/8692365fff4d/fnut-11-1429259-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/9868f7bf2042/fnut-11-1429259-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/25414702df76/fnut-11-1429259-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/9dcc854d16e5/fnut-11-1429259-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/b9c188708fcd/fnut-11-1429259-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/9be205ceefea/fnut-11-1429259-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/8692365fff4d/fnut-11-1429259-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/9868f7bf2042/fnut-11-1429259-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/25414702df76/fnut-11-1429259-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/9dcc854d16e5/fnut-11-1429259-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/b9c188708fcd/fnut-11-1429259-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a4a/11406469/9be205ceefea/fnut-11-1429259-g0006.jpg

相似文献

1
Zero-shot evaluation of ChatGPT for food named-entity recognition and linking.ChatGPT在食品命名实体识别与链接方面的零样本评估。
Front Nutr. 2024 Aug 13;11:1429259. doi: 10.3389/fnut.2024.1429259. eCollection 2024.
2
From Answers to Insights: Unveiling the Strengths and Limitations of ChatGPT and Biomedical Knowledge Graphs.从答案到见解:揭示ChatGPT与生物医学知识图谱的优势与局限
Res Sq. 2023 Aug 1:rs.3.rs-3185632. doi: 10.21203/rs.3.rs-3185632/v1.
3
From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts.从零到英雄:利用变压器在零样本和少样本上下文中进行生物医学命名实体识别。
Artif Intell Med. 2024 Oct;156:102970. doi: 10.1016/j.artmed.2024.102970. Epub 2024 Aug 24.
4
Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study.ChatGPT 在临床医学研究生入学考试中的表现:调查研究。
JMIR Med Educ. 2024 Feb 9;10:e48514. doi: 10.2196/48514.
5
Zero-Shot LLMs for Named Entity Recognition: Targeting Cardiac Function Indicators in German Clinical Texts.零样本语言模型在命名实体识别中的应用:以德国临床文本中的心脏功能指标为例。
Stud Health Technol Inform. 2024 Aug 30;317:228-234. doi: 10.3233/SHTI240861.
6
An extensive benchmark study on biomedical text generation and mining with ChatGPT.一项关于使用ChatGPT进行生物医学文本生成和挖掘的广泛基准研究。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad557.
7
Text summarization with ChatGPT for drug labeling documents.利用 ChatGPT 进行药物标签文件的文本摘要。
Drug Discov Today. 2024 Jun;29(6):104018. doi: 10.1016/j.drudis.2024.104018. Epub 2024 May 7.
8
Improving large language models for clinical named entity recognition via prompt engineering.通过提示工程改进临床命名实体识别的大型语言模型。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1812-1820. doi: 10.1093/jamia/ocad259.
9
How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment.ChatGPT在美国医师执照考试(USMLE)中的表现如何?大语言模型对医学教育和知识评估的影响。
JMIR Med Educ. 2023 Feb 8;9:e45312. doi: 10.2196/45312.
10
Beyond Tokens: Fair Evaluation of French Large Language Models for Clinical Named Entity Recognition.超越标记:对法语大型语言模型进行临床命名实体识别的公平评估。
Stud Health Technol Inform. 2024 Aug 22;316:666-670. doi: 10.3233/SHTI240502.

引用本文的文献

1
Large language models in clinical nutrition: an overview of its applications, capabilities, limitations, and potential future prospects.临床营养中的大语言模型:其应用、能力、局限性及潜在未来前景概述
Front Nutr. 2025 Aug 7;12:1635682. doi: 10.3389/fnut.2025.1635682. eCollection 2025.

本文引用的文献

1
"Food Is Medicine" Strategies for Nutrition Security and Cardiometabolic Health Equity: JACC State-of-the-Art Review.《食物是良药》:保障营养安全和实现心血管代谢健康公平的策略:美国心脏病学会最新综述。
J Am Coll Cardiol. 2024 Feb 27;83(8):843-864. doi: 10.1016/j.jacc.2023.12.023.
2
Improving large language models for clinical named entity recognition via prompt engineering.通过提示工程改进临床命名实体识别的大型语言模型。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1812-1820. doi: 10.1093/jamia/ocad259.
3
Opportunities and challenges for ChatGPT and large language models in biomedicine and health.
ChatGPT 和大型语言模型在生物医学和健康领域的机遇与挑战。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad493.
4
An extensive benchmark study on biomedical text generation and mining with ChatGPT.一项关于使用ChatGPT进行生物医学文本生成和挖掘的广泛基准研究。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad557.
5
Generative Artificial Intelligence and ChatGPT.生成式人工智能与ChatGPT
J Perianesth Nurs. 2023 Jun;38(3):519-522. doi: 10.1016/j.jopan.2023.04.001. Epub 2023 Apr 20.
6
CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources.自助餐厅 SA 语料库:在不同的食物语义资源中进行标注的科学摘要。
Database (Oxford). 2022 Dec 16;2022. doi: 10.1093/database/baac107.
7
CafeteriaFCD Corpus: Food Consumption Data Annotated with Regard to Different Food Semantic Resources.自助餐厅FCD语料库:关于不同食物语义资源标注的食物消费数据。
Foods. 2022 Sep 2;11(17):2684. doi: 10.3390/foods11172684.
8
A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation.基于 Transformer 的双向编码器表示模型的精细调整在食品命名实体识别中的应用:算法开发与验证。
J Med Internet Res. 2021 Aug 9;23(8):e28229. doi: 10.2196/28229.
9
FoodBase corpus: a new resource of annotated food entities.FoodBase 语料库:一个新的带注释食物实体资源。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz121.
10
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.