人工智能辅助生物医学文献知识综合以支持精准肿瘤学决策。

Artificial Intelligence-assisted Biomedical Literature Knowledge Synthesis to Support Decision-making in Precision Oncology.

作者信息

He Ting, Kreimeyer Kory, Najjar Mimi, Spiker Jonathan, Fatteh Maria, Anagnostou Valsamo, Botsis Taxiarchis

机构信息

Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD.

Division of Quantitative Sciences, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD.

出版信息

AMIA Annu Symp Proc. 2025 May 22;2024:513-522. eCollection 2024.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12099343/

Abstract

The delivery of effective targeted therapies requires comprehensive analyses of the molecular profiling of tumors and matching with clinical phenotypes in the context of existing knowledge described in biomedical literature, registries, and knowledge bases. We evaluated the performance of natural language processing (NLP) approaches in supporting knowledge retrieval and synthesis from the biomedical literature. We tested PubTator 3.0, Bidirectional Encoder Representations from Transformers (BERT), and Large Language Models (LLMs) and evaluated their ability to support named entity recognition (NER) and relation extraction (RE) from biomedical texts. PubTator 3.0 and the BioBERT model performed best in the NER task (best F1-score 0.93 and 0.89, respectively), while BioBERT outperformed all other solutions in the RE task (best F1-score 0.79) and a specific use case it was applied to by recognizing nearly all entity mentions and most of the relations. Our findings support the use of AI-assisted approaches in facilitating precision oncology decision-making.

摘要

有效的靶向治疗的实施需要对肿瘤的分子特征进行全面分析，并在生物医学文献、登记处和知识库中描述的现有知识背景下与临床表型进行匹配。我们评估了自然语言处理（NLP）方法在支持从生物医学文献中检索和综合知识方面的性能。我们测试了PubTator 3.0、来自变换器的双向编码器表征（BERT）和大语言模型（LLMs），并评估了它们从生物医学文本中支持命名实体识别（NER）和关系提取（RE）的能力。PubTator 3.0和BioBERT模型在NER任务中表现最佳（最佳F1分数分别为0.93和0.89），而BioBERT在RE任务中优于所有其他解决方案（最佳F1分数为0.79），并且通过识别几乎所有实体提及和大多数关系，在其应用的一个特定用例中表现出色。我们的研究结果支持使用人工智能辅助方法来促进精准肿瘤学决策。

相似文献

1

Artificial Intelligence-assisted Biomedical Literature Knowledge Synthesis to Support Decision-making in Precision Oncology.人工智能辅助生物医学文献知识综合以支持精准肿瘤学决策。

AMIA Annu Symp Proc. 2025 May 22;2024:513-522. eCollection 2024.

2

From BERT to generative AI - Comparing encoder-only vs. large language models in a cohort of lung cancer patients for named entity recognition in unstructured medical reports.从BERT到生成式人工智能——在一组肺癌患者中比较仅编码器模型与大语言模型用于非结构化医疗报告中的命名实体识别

Comput Biol Med. 2025 Sep;195:110665. doi: 10.1016/j.compbiomed.2025.110665. Epub 2025 Jun 24.

3

Knowledge Graph-Enhanced Deep Learning Model (H-SYSTEM) for Hypertensive Intracerebral Hemorrhage: Model Development and Validation.用于高血压性脑出血的知识图谱增强深度学习模型（H-SYSTEM）：模型开发与验证

J Med Internet Res. 2025 Jun 12;27:e66055. doi: 10.2196/66055.

4

Predicting Drug-Side Effect Relationships From Parametric Knowledge Embedded in Biomedical BERT Models: Methodological Study With a Natural Language Processing Approach.从生物医学BERT模型中嵌入的参数知识预测药物副作用关系：一种自然语言处理方法的方法学研究

JMIR Med Inform. 2025 Jul 10;13:e67513. doi: 10.2196/67513.

5

Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测：基于放射学报告的多中心方法学研究

J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.

6

A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试，采用了适配的大语言模型。

J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.

7

Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods.使用基于转换器的自然语言处理方法识别与糖尿病视网膜病变相关的临床概念及其属性。

BMC Med Inform Decis Mak. 2022 Sep 27;22(Suppl 3):255. doi: 10.1186/s12911-022-01996-2.

8

Use of deep learning-based NLP models for full-text data elements extraction for systematic literature review tasks.基于深度学习的自然语言处理模型在系统文献综述任务的全文数据元素提取中的应用。

Sci Rep. 2025 Jun 3;15(1):19379. doi: 10.1038/s41598-025-03979-5.

9

Language Models for Multilabel Document Classification of Surgical Concepts in Exploratory Laparotomy Operative Notes: Algorithm Development Study.用于探索性剖腹手术记录中手术概念多标签文档分类的语言模型：算法开发研究

JMIR Med Inform. 2025 Jul 9;13:e71176. doi: 10.2196/71176.

10

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

本文引用的文献

1

PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge.PubTator 3.0：一款人工智能驱动的文献资源，用于解锁生物医学知识。

Nucleic Acids Res. 2024 Jul 5;52(W1):W540-W546. doi: 10.1093/nar/gkae235.

2

Improving large language models for clinical named entity recognition via prompt engineering.通过提示工程改进临床命名实体识别的大型语言模型。

J Am Med Inform Assoc. 2024 Sep 1;31(9):1812-1820. doi: 10.1093/jamia/ocad259.

3

Chatbots and Large Language Models in Radiology: A Practical Primer for Clinical and Research Applications.放射科中的聊天机器人和大型语言模型：临床和研究应用的实用入门指南。

Radiology. 2024 Jan;310(1):e232756. doi: 10.1148/radiol.232756.

4

Opportunities and challenges for ChatGPT and large language models in biomedicine and health.ChatGPT 和大型语言模型在生物医学和健康领域的机遇与挑战。

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad493.

5

Europe PMC annotated full-text corpus for gene/proteins, diseases and organisms.欧洲 PMC 注释全文生物库，包含基因/蛋白质、疾病和生物信息。

Sci Data. 2023 Oct 19;10(1):722. doi: 10.1038/s41597-023-02617-x.

6

Quantifying the Expanding Landscape of Clinical Actionability for Patients with Cancer.量化癌症患者临床可操作性的扩展领域。

Cancer Discov. 2024 Jan 12;14(1):49-65. doi: 10.1158/2159-8290.CD-23-0467.

7

BioREx: Improving biomedical relation extraction by leveraging heterogeneous datasets.BioREx：通过利用异构数据集改进生物医学关系提取

J Biomed Inform. 2023 Oct;146:104487. doi: 10.1016/j.jbi.2023.104487. Epub 2023 Sep 4.

8

Exploring the Potential of GPT-4 in Biomedical Engineering: The Dawn of a New Era.探索GPT-4在生物医学工程中的潜力：新时代的曙光。

Ann Biomed Eng. 2023 Aug;51(8):1645-1653. doi: 10.1007/s10439-023-03221-1. Epub 2023 Apr 28.

9

Chemical identification and indexing in full-text articles: an overview of the NLM-Chem track at BioCreative VII.全文文章中的化学物质鉴定与标引：NLM-Chem 在 BioCreative VII 挑战赛中的概述

Database (Oxford). 2023 Mar 7;2023. doi: 10.1093/database/baad005.

10

BioRED: a rich biomedical relation extraction dataset.BioRED：一个丰富的生物医学关系抽取数据集。

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac282.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验