利用大语言模型从文献中开发生物医学知识库的基础——一项系统评估

Developing foundations for biomedical knowledgebases from literature using large language models - A systematic assessment.

作者信息

Miao Chen, Zhang Zhenghao, Chen Jiamin, Rebibo Daniel, Wu Haoran, Fung Sin-Hang, Cheng Alfred Sze-Lok, Tsui Stephen Kwok-Wing, Sinha Sanju, Cao Qin, Yip Kevin Y

机构信息

School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.

Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.

出版信息

Comput Struct Biotechnol J. 2025 Jul 24;27:3299-3306. doi: 10.1016/j.csbj.2025.07.042. eCollection 2025.

DOI:10.1016/j.csbj.2025.07.042

PMID:40778315

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12329539/

Abstract

While large language models (LLMs) have shown promising capabilities in biomedical applications, measuring their reliability in knowledge extraction remains a challenge. We developed a benchmark to compare LLMs in 11 literature knowledge extraction tasks that are foundational to automatic knowledgebase development, with or without task-specific examples supplied. We found large variation across the LLMs' performance, depending on the level of technical specialization, difficulty of tasks, scattering of original information, and format and terminology standardization requirements. We also found that asking the LLMs to provide the source text behind their answers is useful for overcoming some key challenges, but that specifying this requirement in the prompt is difficult.

摘要

虽然大语言模型（LLMs）在生物医学应用中已展现出颇具前景的能力，但衡量它们在知识提取方面的可靠性仍是一项挑战。我们开发了一个基准，用于比较大语言模型在11个文献知识提取任务中的表现，这些任务是自动知识库开发的基础，无论是否提供特定任务的示例。我们发现，大语言模型的性能存在很大差异，这取决于技术专业化程度、任务难度、原始信息的分散程度以及格式和术语标准化要求。我们还发现，要求大语言模型提供其答案背后的源文本有助于克服一些关键挑战，但在提示中明确这一要求却很困难。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01d9/12329539/ce9b82193e9c/gr1.jpg

相似文献

Developing foundations for biomedical knowledgebases from literature using large language models - A systematic assessment.

Comput Struct Biotechnol J. 2025 Jul 24;27:3299-3306. doi: 10.1016/j.csbj.2025.07.042. eCollection 2025.

Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.

J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.

The first step is the hardest: pitfalls of representing and tokenizing temporal data for large language models.

J Am Med Inform Assoc. 2024 Sep 1;31(9):2151-2158. doi: 10.1093/jamia/ocae090.

A dataset and benchmark for hospital course summarization with adapted large language models.

J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.

Large Language Models and Empathy: Systematic Review.

J Med Internet Res. 2024 Dec 11;26:e52597. doi: 10.2196/52597.

Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.

JMIR Med Inform. 2025 Jul 24;13:e66476. doi: 10.2196/66476.

Advancing entity recognition in biomedicine via instruction tuning of large language models.

Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae163.

Evaluating and Improving Syndrome Differentiation Thinking Ability in Large Language Models: Method Development Study.

JMIR Med Inform. 2025 Jun 20;13:e75103. doi: 10.2196/75103.

Evaluating the effectiveness of biomedical fine-tuning for large language models on clinical tasks.

J Am Med Inform Assoc. 2025 Jun 1;32(6):1015-1024. doi: 10.1093/jamia/ocaf045.

Can open source large language models be used for tumor documentation in Germany?-An evaluation on urological doctors' notes.

BioData Min. 2025 Jul 24;18(1):48. doi: 10.1186/s13040-025-00463-8.

本文引用的文献

Benchmarking large language models for biomedical natural language processing applications and recommendations.

Nat Commun. 2025 Apr 6;16(1):3280. doi: 10.1038/s41467-025-56989-2.

LitSumm: large language models for literature summarization of noncoding RNAs.

Database (Oxford). 2025 Feb 5;2025. doi: 10.1093/database/baaf006.

A scientific-article key-insight extraction system based on multi-actor of fine-tuned open-source large language models.

Sci Rep. 2025 Jan 10;15(1):1608. doi: 10.1038/s41598-025-85715-7.

Toward expert-level medical question answering with large language models.

Nat Med. 2025 Mar;31(3):943-950. doi: 10.1038/s41591-024-03423-7. Epub 2025 Jan 8.

Privacy-preserving large language models for structured medical information retrieval.

NPJ Digit Med. 2024 Sep 20;7(1):257. doi: 10.1038/s41746-024-01233-2.

Optimization of hepatological clinical guidelines interpretation by large language models: a retrieval augmented generation-based framework.

NPJ Digit Med. 2024 Apr 23;7(1):102. doi: 10.1038/s41746-024-01091-y.

Structured information extraction from scientific text with large language models.

Nat Commun. 2024 Feb 15;15(1):1418. doi: 10.1038/s41467-024-45563-x.

Large language models to identify social determinants of health in electronic health records.

NPJ Digit Med. 2024 Jan 11;7(1):6. doi: 10.1038/s41746-023-00970-0.

An extensive benchmark study on biomedical text generation and mining with ChatGPT.

Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad557.

Evaluating large language models on medical evidence summarization.

NPJ Digit Med. 2023 Aug 24;6(1):158. doi: 10.1038/s41746-023-00896-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用大语言模型从文献中开发生物医学知识库的基础——一项系统评估

Developing foundations for biomedical knowledgebases from literature using large language models - A systematic assessment.

作者信息

Miao Chen, Zhang Zhenghao, Chen Jiamin, Rebibo Daniel, Wu Haoran, Fung Sin-Hang, Cheng Alfred Sze-Lok, Tsui Stephen Kwok-Wing, Sinha Sanju, Cao Qin, Yip Kevin Y

机构信息

School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.

Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.

出版信息

Comput Struct Biotechnol J. 2025 Jul 24;27:3299-3306. doi: 10.1016/j.csbj.2025.07.042. eCollection 2025.

DOI:10.1016/j.csbj.2025.07.042

PMID:40778315

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12329539/

Abstract

摘要

利用大语言模型从文献中开发生物医学知识库的基础——一项系统评估

Developing foundations for biomedical knowledgebases from literature using large language models - A systematic assessment.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用大语言模型从文献中开发生物医学知识库的基础——一项系统评估

Developing foundations for biomedical knowledgebases from literature using large language models - A systematic assessment.

作者信息

机构信息

出版信息

相似文献

本文引用的文献