• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

促使大语言模型在文档层面精确且全面地提取化学物质与疾病的关系:一项评估研究。

Prompting large language models to extract chemical‒disease relation precisely and comprehensively at the document level: an evaluation study.

作者信息

Chen Mei, Zhang Tingting, Wang Shibin

机构信息

Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE, Minzu University of China, Beijing 100081, China.

School of Information Engineering, Minzu University of China, Beijing 100081, China.

出版信息

PLoS One. 2025 Apr 8;20(4):e0320123. doi: 10.1371/journal.pone.0320123. eCollection 2025.

DOI:10.1371/journal.pone.0320123
PMID:40198724
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11978106/
Abstract

Given the scarcity of annotated data, current deep learning methods face challenges in the field of document-level chemical-disease relation extraction, making it difficult to achieve precise relation extraction capable of identifying relation types and comprehensive extraction tasks that identify relation-related factors. This study tests the abilities of three large language models (LLMs), GPT3.5, GPT4.0, and Claude-opus, to perform precise and comprehensive extraction in document-level chemical-disease relation extraction on a self-constructed dataset. Firstly, based on the task characteristics, this study designs six workflows for precise extraction and five workflows for comprehensive extraction using prompting engineering strategies. The characteristics of the extraction process are analyzed through the performance differences under different workflows. Secondly, this study analyzes the content bias in LLMs extraction by examining the extraction effectiveness of different workflows on different types of content. Finally, this study analyzes the error characteristics of extracting incorrect examples by the LLMs. The experimental results show that: (1) The LLMs demonstrate good extraction capabilities, achieving the highest F1 scores of 87% and 73% respectively in the tasks of precise extraction and comprehensive extraction; (2) In the extraction process, the LLMs exhibit a certain degree of stubbornness, with limited effectiveness of prompting engineering strategies; (3) In terms of extraction content, the LLMs show a content bias, with stronger abilities to identify positive relations such as induction and acceleration; (4) The essence of extraction errors lies in the LLMs' misunderstanding of the implicit meanings in biomedical texts. This study provides practical workflows for precise and comprehensive extraction of document-level chemical-disease relations and also indicates that optimizing training data is the key to building more efficient and accurate extraction methods in the future.

摘要

鉴于标注数据的稀缺性,当前的深度学习方法在文档级化学-疾病关系提取领域面临挑战,难以实现能够识别关系类型的精确关系提取以及识别关系相关因素的全面提取任务。本研究测试了三种大语言模型(LLMs),即GPT3.5、GPT4.0和Claude-opus,在自建数据集上进行文档级化学-疾病关系提取时进行精确和全面提取的能力。首先,基于任务特征,本研究使用提示工程策略设计了六种精确提取工作流程和五种全面提取工作流程。通过不同工作流程下的性能差异分析提取过程的特点。其次,本研究通过检查不同工作流程对不同类型内容的提取效果来分析大语言模型提取中的内容偏差。最后,本研究分析了大语言模型提取错误示例的错误特征。实验结果表明:(1)大语言模型展示出良好的提取能力,在精确提取和全面提取任务中分别达到了87%和73%的最高F1分数;(2)在提取过程中,大语言模型表现出一定程度的顽固性,提示工程策略的效果有限;(3)在提取内容方面,大语言模型表现出内容偏差,识别诱导和加速等正向关系的能力更强;(4)提取错误的本质在于大语言模型对生物医学文本中隐含意义的误解。本研究为文档级化学-疾病关系的精确和全面提取提供了实用的工作流程,同时也表明优化训练数据是未来构建更高效、准确提取方法的关键。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/9a6a0e0c76e0/pone.0320123.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/485d63e0e35b/pone.0320123.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/8bab24f930fd/pone.0320123.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/a00061349954/pone.0320123.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/540bb67a8a10/pone.0320123.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/e5258491826a/pone.0320123.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/92edc473079a/pone.0320123.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/9a6a0e0c76e0/pone.0320123.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/485d63e0e35b/pone.0320123.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/8bab24f930fd/pone.0320123.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/a00061349954/pone.0320123.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/540bb67a8a10/pone.0320123.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/e5258491826a/pone.0320123.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/92edc473079a/pone.0320123.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65c9/11978106/9a6a0e0c76e0/pone.0320123.g007.jpg

相似文献

1
Prompting large language models to extract chemical‒disease relation precisely and comprehensively at the document level: an evaluation study.促使大语言模型在文档层面精确且全面地提取化学物质与疾病的关系:一项评估研究。
PLoS One. 2025 Apr 8;20(4):e0320123. doi: 10.1371/journal.pone.0320123. eCollection 2025.
2
Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction.模型调优还是提示调优?大型语言模型在临床概念和关系抽取中的应用研究。
J Biomed Inform. 2024 May;153:104630. doi: 10.1016/j.jbi.2024.104630. Epub 2024 Mar 26.
3
An Automatic and End-to-End System for Rare Disease Knowledge Graph Construction Based on Ontology-Enhanced Large Language Models: Development Study.基于本体增强大语言模型的罕见病知识图谱构建自动端到端系统:开发研究
JMIR Med Inform. 2024 Dec 18;12:e60665. doi: 10.2196/60665.
4
Improving entity recognition using ensembles of deep learning and fine-tuned large language models: A case study on adverse event extraction from VAERS and social media.使用深度学习集成和微调大语言模型改进实体识别:以从VAERS和社交媒体中提取不良事件为例
J Biomed Inform. 2025 Mar;163:104789. doi: 10.1016/j.jbi.2025.104789. Epub 2025 Feb 7.
5
LEAP: LLM instruction-example adaptive prompting framework for biomedical relation extraction.LEAP:用于生物医学关系抽取的 LLM 指令-示例自适应提示框架。
J Am Med Inform Assoc. 2024 Sep 1;31(9):2010-2018. doi: 10.1093/jamia/ocae147.
6
The influence of prompt engineering on large language models for protein-protein interaction identification in biomedical literature.提示工程对生物医学文献中蛋白质-蛋白质相互作用识别的大语言模型的影响。
Sci Rep. 2025 May 3;15(1):15493. doi: 10.1038/s41598-025-99290-4.
7
Improving unified information extraction in Chinese mental health domain with instruction-tuned LLMs and type-verification component.使用指令微调的语言模型和类型验证组件改进中文心理健康领域的统一信息提取
Artif Intell Med. 2025 Apr;162:103087. doi: 10.1016/j.artmed.2025.103087. Epub 2025 Feb 19.
8
Extracting Multifaceted Characteristics of Patients With Chronic Disease Comorbidity: Framework Development Using Large Language Models.提取慢性病合并症患者的多方面特征:使用大语言模型进行框架开发
JMIR Med Inform. 2025 May 15;13:e70096. doi: 10.2196/70096.
9
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.
10
Evaluating large language models for health-related text classification tasks with public social media data.利用公共社交媒体数据评估用于健康相关文本分类任务的大型语言模型。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2181-2189. doi: 10.1093/jamia/ocae210.

本文引用的文献

1
Document-level biomedical relation extraction via hierarchical tree graph and relation segmentation module.基于层次树图和关系分割模块的文档级生物医学关系抽取。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae418.
2
OpenMedLM: prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models.OpenMedLM:在使用开源大语言模型进行医学问答时,基于提示的工程学可以胜过微调。
Sci Rep. 2024 Jun 19;14(1):14156. doi: 10.1038/s41598-024-64827-6.
3
The manganese transporter SLC39A8 links alkaline ceramidase 1 to inflammatory bowel disease.
锰转运蛋白 SLC39A8 将碱性鞘磷脂酶 1 与炎症性肠病联系起来。
Nat Commun. 2024 Jun 5;15(1):4775. doi: 10.1038/s41467-024-49049-8.
4
An exposome atlas of serum reveals the risk of chronic diseases in the Chinese population.血清外显子组图谱揭示了中国人群慢性病的发病风险。
Nat Commun. 2024 Mar 13;15(1):2268. doi: 10.1038/s41467-024-46595-z.
5
Altered microbial bile acid metabolism exacerbates T cell-driven inflammation during graft-versus-host disease.微生物胆汁酸代谢改变加剧移植物抗宿主病中的 T 细胞驱动的炎症。
Nat Microbiol. 2024 Mar;9(3):614-630. doi: 10.1038/s41564-024-01617-w. Epub 2024 Mar 1.
6
Extracting accurate materials data from research papers with conversational language models and prompt engineering.利用对话式语言模型和提示工程从研究论文中提取准确的材料数据。
Nat Commun. 2024 Feb 21;15(1):1569. doi: 10.1038/s41467-024-45914-8.
7
Prompt engineering in consistency and reliability with the evidence-based guideline for LLMs.提示工程在与大语言模型基于证据的指南保持一致性和可靠性方面。
NPJ Digit Med. 2024 Feb 20;7(1):41. doi: 10.1038/s41746-024-01029-4.
8
Structured information extraction from scientific text with large language models.利用大语言模型从科学文本中提取结构化信息。
Nat Commun. 2024 Feb 15;15(1):1418. doi: 10.1038/s41467-024-45563-x.
9
The regulation of endocrine-disrupting chemicals to minimize their impact on health.内分泌干扰化学物质的管制,以尽量减少其对健康的影响。
Nat Rev Endocrinol. 2023 Oct;19(10):600-614. doi: 10.1038/s41574-023-00872-x. Epub 2023 Aug 8.
10
BioRED: a rich biomedical relation extraction dataset.BioRED:一个丰富的生物医学关系抽取数据集。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac282.