• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用大型预训练语言模型进行少样本药物对协同作用预测的CancerGPT

CancerGPT for few shot drug pair synergy prediction using large pretrained language models.

作者信息

Li Tianhao, Shetty Sandesh, Kamath Advaith, Jaiswal Ajay, Jiang Xiaoqian, Ding Ying, Kim Yejin

机构信息

School of Information, University of Texas at Austin, Austin, TX, USA.

Manning College of Information and Computer Sciences, University of Massachusetts Amherst, Amherst, MA, USA.

出版信息

NPJ Digit Med. 2024 Feb 19;7(1):40. doi: 10.1038/s41746-024-01024-9.

DOI:10.1038/s41746-024-01024-9
PMID:38374445
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10876664/
Abstract

Large language models (LLMs) have been shown to have significant potential in few-shot learning across various fields, even with minimal training data. However, their ability to generalize to unseen tasks in more complex fields, such as biology and medicine has yet to be fully evaluated. LLMs can offer a promising alternative approach for biological inference, particularly in cases where structured data and sample size are limited, by extracting prior knowledge from text corpora. Here we report our proposed few-shot learning approach, which uses LLMs to predict the synergy of drug pairs in rare tissues that lack structured data and features. Our experiments, which involved seven rare tissues from different cancer types, demonstrate that the LLM-based prediction model achieves significant accuracy with very few or zero samples. Our proposed model, the CancerGPT (with ~ 124M parameters), is comparable to the larger fine-tuned GPT-3 model (with ~ 175B parameters). Our research contributes to tackling drug pair synergy prediction in rare tissues with limited data, and also advancing the use of LLMs for biological and medical inference tasks.

摘要

大语言模型(LLMs)已被证明在跨领域的少样本学习中具有巨大潜力,即使训练数据极少。然而,它们在更复杂领域(如生物学和医学)中推广到未见任务的能力尚未得到充分评估。大语言模型可以通过从文本语料库中提取先验知识,为生物推理提供一种有前景的替代方法,特别是在结构化数据和样本量有限的情况下。在此,我们报告我们提出的少样本学习方法,该方法使用大语言模型来预测缺乏结构化数据和特征的罕见组织中药物对的协同作用。我们的实验涉及来自不同癌症类型的七种罕见组织,结果表明基于大语言模型的预测模型在极少样本或零样本的情况下就能达到显著的准确率。我们提出的模型CancerGPT(约1.24亿个参数)与更大的微调GPT-3模型(约1750亿个参数)相当。我们的研究有助于解决数据有限的罕见组织中的药物对协同作用预测问题,也推动了大语言模型在生物和医学推理任务中的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/6c94e1409205/41746_2024_1024_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/31117ef7d039/41746_2024_1024_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/4543fa808ea5/41746_2024_1024_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/0fbf4177fa5d/41746_2024_1024_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/b582ed16ba21/41746_2024_1024_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/a6f3b86e5866/41746_2024_1024_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/6c94e1409205/41746_2024_1024_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/31117ef7d039/41746_2024_1024_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/4543fa808ea5/41746_2024_1024_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/0fbf4177fa5d/41746_2024_1024_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/b582ed16ba21/41746_2024_1024_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/a6f3b86e5866/41746_2024_1024_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a45/10876664/6c94e1409205/41746_2024_1024_Fig6_HTML.jpg

相似文献

1
CancerGPT for few shot drug pair synergy prediction using large pretrained language models.使用大型预训练语言模型进行少样本药物对协同作用预测的CancerGPT
NPJ Digit Med. 2024 Feb 19;7(1):40. doi: 10.1038/s41746-024-01024-9.
2
CancerGPT: Few-shot Drug Pair Synergy Prediction using Large Pre-trained Language Models.CancerGPT:使用大型预训练语言模型进行少样本药物对协同作用预测。
ArXiv. 2023 Apr 18:arXiv:2304.10946v1.
3
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.
4
Evaluating large language models for health-related text classification tasks with public social media data.利用公共社交媒体数据评估用于健康相关文本分类任务的大型语言模型。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2181-2189. doi: 10.1093/jamia/ocae210.
5
A comprehensive evaluation of large Language models on benchmark biomedical text processing tasks.对基准生物医学文本处理任务中大型语言模型的全面评估。
Comput Biol Med. 2024 Mar;171:108189. doi: 10.1016/j.compbiomed.2024.108189. Epub 2024 Feb 20.
6
Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models.利用生成式人工智能辅助学习罕见且复杂的诊断:对流行的大型语言模型的定性研究。
JMIR Med Educ. 2024 Feb 13;10:e51391. doi: 10.2196/51391.
7
Exploring Large Language Models for Detecting Online Vaccine Reactions.探索大型语言模型以检测在线疫苗反应。
Stud Health Technol Inform. 2024 Sep 24;318:30-35. doi: 10.3233/SHTI240887.
8
Automated Extraction of Patient-Centered Outcomes After Breast Cancer Treatment: An Open-Source Large Language Model-Based Toolkit.基于开源大语言模型的乳腺癌治疗后患者为中心结局自动提取工具包。
JCO Clin Cancer Inform. 2024 Aug;8:e2300258. doi: 10.1200/CCI.23.00258.
9
Unlocking the Secrets Behind Advanced Artificial Intelligence Language Models in Deidentifying Chinese-English Mixed Clinical Text: Development and Validation Study.揭开高级人工智能语言模型在去识别汉英混合临床文本背后的秘密:开发与验证研究。
J Med Internet Res. 2024 Jan 25;26:e48443. doi: 10.2196/48443.
10
Harnessing large language models' zero-shot and few-shot learning capabilities for regulatory research.利用大型语言模型的零样本和少样本学习能力进行监管研究。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae354.

引用本文的文献

1
Mathematical models and computational approaches in CAR-T therapeutics.嵌合抗原受体T细胞(CAR-T)疗法中的数学模型与计算方法。
Front Immunol. 2025 Aug 1;16:1581210. doi: 10.3389/fimmu.2025.1581210. eCollection 2025.
2
Protein Spatial Structure Meets Artificial Intelligence: Revolutionizing Drug Synergy-Antagonism in Precision Medicine.蛋白质空间结构与人工智能相遇:革新精准医学中的药物协同 - 拮抗作用
Adv Sci (Weinh). 2025 Sep;12(33):e07764. doi: 10.1002/advs.202507764. Epub 2025 Aug 7.
3
Development and evaluation of large-language models (LLMs) for oncology: A scoping review.

本文引用的文献

1
Foundation models for generalist medical artificial intelligence.通用型医学人工智能的基础模型。
Nature. 2023 Apr;616(7956):259-265. doi: 10.1038/s41586-023-05881-4. Epub 2023 Apr 12.
2
The debate over understanding in AI's large language models.人工智能大型语言模型中的理解之争。
Proc Natl Acad Sci U S A. 2023 Mar 28;120(13):e2215907120. doi: 10.1073/pnas.2215907120. Epub 2023 Mar 21.
3
CCSynergy: an integrative deep-learning framework enabling context-aware prediction of anti-cancer drug synergy.CCSynergy:一种集成深度学习框架,能够实现基于上下文的抗癌药物协同作用预测。
用于肿瘤学的大语言模型的开发与评估:一项范围综述。
PLOS Digit Health. 2025 Aug 7;4(8):e0000980. doi: 10.1371/journal.pdig.0000980. eCollection 2025 Aug.
4
CRISPR-GPT for agentic automation of gene-editing experiments.用于基因编辑实验自主自动化的CRISPR-GPT
Nat Biomed Eng. 2025 Jul 30. doi: 10.1038/s41551-025-01463-z.
5
Prediction of pathogenic mutations in human transmembrane proteins and their associated diseases via utilizing pre-trained Bio-LLMs.利用预训练的生物语言模型预测人类跨膜蛋白中的致病突变及其相关疾病。
Commun Biol. 2025 Jul 15;8(1):1050. doi: 10.1038/s42003-025-08452-7.
6
Systematic benchmarking of large Language models in programmed cell death-oriented gastric cancer research: a comparative analysis of DeepSeek‑V3, DeepSeek‑R1, and Claude 3.5.程序性细胞死亡导向的胃癌研究中大型语言模型的系统基准测试:DeepSeek-V3、DeepSeek-R1和Claude 3.5的比较分析
Discov Oncol. 2025 Jul 1;16(1):1227. doi: 10.1007/s12672-025-02911-7.
7
Large language models in oncology: a review.肿瘤学中的大语言模型:综述
BMJ Oncol. 2025 May 15;4(1):e000759. doi: 10.1136/bmjonc-2025-000759. eCollection 2025.
8
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
9
Large Language Models in Genomics-A Perspective on Personalized Medicine.基因组学中的大语言模型——个性化医疗视角
Bioengineering (Basel). 2025 Apr 23;12(5):440. doi: 10.3390/bioengineering12050440.
10
Optimizing Large Language Models for Discharge Prediction: Best Practices in Leveraging Electronic Health Record Audit Logs.优化用于出院预测的大语言模型:利用电子健康记录审计日志的最佳实践
AMIA Annu Symp Proc. 2025 May 22;2024:1323-1331. eCollection 2024.
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac588.
4
Multi-way relation-enhanced hypergraph representation learning for anti-cancer drug synergy prediction.多向关系增强超图表示学习在抗癌药物协同作用预测中的应用。
Bioinformatics. 2022 Oct 14;38(20):4782-4789. doi: 10.1093/bioinformatics/btac579.
5
SynPathy: Predicting Drug Synergy through Drug-Associated Pathways Using Deep Learning.SynPathy:利用深度学习通过药物相关通路预测药物协同作用。
Mol Cancer Res. 2022 May 4;20(5):762-769. doi: 10.1158/1541-7786.MCR-21-0735.
6
MatchMaker: A Deep Learning Framework for Drug Synergy Prediction.MatchMaker:一种用于药物协同作用预测的深度学习框架。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2334-2344. doi: 10.1109/TCBB.2021.3086702. Epub 2022 Aug 8.
7
TranSynergy: Mechanism-driven interpretable deep neural network for the synergistic prediction and pathway deconvolution of drug combinations.TranSynergy:用于药物组合协同预测和途径解卷积的基于机制的可解释深度神经网络。
PLoS Comput Biol. 2021 Feb 12;17(2):e1008653. doi: 10.1371/journal.pcbi.1008653. eCollection 2021 Feb.
8
Global computational alignment of tumor and cell line transcriptional profiles.肿瘤和细胞系转录谱的全局计算比对。
Nat Commun. 2021 Jan 4;12(1):22. doi: 10.1038/s41467-020-20294-x.
9
Anticancer drug synergy prediction in understudied tissues using transfer learning.利用迁移学习预测研究较少的组织中的抗癌药物协同作用。
J Am Med Inform Assoc. 2021 Jan 15;28(1):42-51. doi: 10.1093/jamia/ocaa212.
10
DTF: Deep Tensor Factorization for predicting anticancer drug synergy.DTF:用于预测抗癌药物协同作用的深度张量分解。
Bioinformatics. 2020 Aug 15;36(16):4483-4489. doi: 10.1093/bioinformatics/btaa287.