• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于药物发现中模块化任务执行的大语言模型智能体

Large Language Model Agent for Modular Task Execution in Drug Discovery.

作者信息

Ock Janghoon, Meda Radheesh Sharma, Badrinarayanan Srivathsan, Aluru Neha S, Chandrasekhar Achuth, Barati Farimani Amir

机构信息

Department of Chemical Engineering, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, Pennsylvania 15213, United States.

Department of Chemical and Biomolecular Engineering, University of Nebraska─Lincoln, Lincoln, Nebraska 68588, United States.

出版信息

J Chem Inf Model. 2026 Feb 23;66(4):2055-2068. doi: 10.1021/acs.jcim.5c02454. Epub 2026 Feb 9.

DOI:10.1021/acs.jcim.5c02454
PMID:41662220
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12933718/
Abstract

We present a modular framework powered by large language models (LLMs) that automates and streamlines key tasks across the early stage computational drug discovery pipeline. By combining LLM reasoning with domain-specific tools, the framework performs biomedical data retrieval, literature-grounded question answering via retrieval-augmented generation, molecular generation, multiproperty prediction, property-aware molecular refinement, and 3D protein-ligand structure generation. The agent autonomously retrieves relevant biomolecular information, including FASTA sequences, SMILES representations, and literature, and answers mechanistic questions with improved contextual accuracy compared to standard LLMs. It then generates chemically diverse seed molecules and predicted 75 properties, including ADMET-related and general physicochemical descriptors, which guids iterative molecular refinement. Across two refinement rounds, the number of molecules with QED >0.6 increased from 34 to 55. The number of molecules satisfying empirical drug-likeness filters also rose; for example, compliance with the Ghose filter increased from 32 to 55 within a pool of 100 molecules. The framework also employed Boltz-2 to generate 3D protein-ligand complexes and provide rapid binding affinity estimates for candidate compounds. These results demonstrate that the approach effectively supports molecular screening, prioritization, and structure evaluation. Its modular design enables flexible integration of evolving tools and models, providing a scalable foundation for AI-assisted therapeutic discovery.

摘要

我们提出了一个由大语言模型(LLMs)驱动的模块化框架,该框架可自动执行并简化早期计算药物发现流程中的关键任务。通过将大语言模型推理与特定领域工具相结合,该框架可进行生物医学数据检索、通过检索增强生成进行基于文献的问答、分子生成、多属性预测、属性感知分子优化以及三维蛋白质-配体结构生成。该智能体可自主检索相关生物分子信息,包括FASTA序列、SMILES表示和文献,并与标准大语言模型相比,以更高的上下文准确性回答机理问题。然后,它生成化学性质多样的种子分子并预测75种属性,包括与ADMET相关的和一般物理化学描述符,这些属性指导迭代分子优化。在两轮优化中,QED>0.6的分子数量从34个增加到55个。满足经验性类药过滤器的分子数量也有所增加;例如,在100个分子的集合中,符合Ghose过滤器的分子数量从32个增加到55个。该框架还采用Boltz-2生成三维蛋白质-配体复合物,并为候选化合物提供快速结合亲和力估计。这些结果表明,该方法有效地支持了分子筛选、优先级排序和结构评估。其模块化设计能够灵活集成不断发展的工具和模型,为人工智能辅助治疗发现提供了一个可扩展的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/4761de1d2f3a/ci5c02454_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/700992724b06/ci5c02454_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/e7fcec72a8c3/ci5c02454_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/45af0cffd925/ci5c02454_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/48764a5c12c0/ci5c02454_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/8a7f68d66e9c/ci5c02454_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/cf7726d6a654/ci5c02454_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/e42be2b941ad/ci5c02454_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/4761de1d2f3a/ci5c02454_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/700992724b06/ci5c02454_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/e7fcec72a8c3/ci5c02454_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/45af0cffd925/ci5c02454_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/48764a5c12c0/ci5c02454_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/8a7f68d66e9c/ci5c02454_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/cf7726d6a654/ci5c02454_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/e42be2b941ad/ci5c02454_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/335e/12933718/4761de1d2f3a/ci5c02454_0008.jpg

相似文献

1
Large Language Model Agent for Modular Task Execution in Drug Discovery.用于药物发现中模块化任务执行的大语言模型智能体
J Chem Inf Model. 2026 Feb 23;66(4):2055-2068. doi: 10.1021/acs.jcim.5c02454. Epub 2026 Feb 9.
2
Enhancing Large Language Models for Improved Accuracy and Safety in Medical Question Answering: Comparative Study.
JMIR Med Educ. 2025 Dec 2;11:e70190. doi: 10.2196/70190.
3
Revealing the limits of covalent docking and advancing affinity prediction with covalent-aware multi-task learning.
Phys Chem Chem Phys. 2026 Feb 18;28(7):4822-4834. doi: 10.1039/d5cp04981d.
4
ProtTeX: Structure-In-Context Reasoning and Editing of Proteins with Large Language Models.ProtTeX:使用大语言模型进行蛋白质的上下文结构推理与编辑
J Chem Inf Model. 2025 Jul 14;65(13):6599-6612. doi: 10.1021/acs.jcim.5c00585. Epub 2025 Jun 25.
5
RadioRAG: Online Retrieval-augmented Generation for Radiology Question Answering.RadioRAG:用于放射学问答的在线检索增强生成
Radiol Artif Intell. 2025 Jun 18:e240476. doi: 10.1148/ryai.240476.
6
GICL: A Cross-Modal Drug Property Prediction Framework Based on Knowledge Enhancement of Large Language Models.GICL:一种基于大语言模型知识增强的跨模态药物特性预测框架。
J Chem Inf Model. 2025 Jun 9;65(11):5518-5527. doi: 10.1021/acs.jcim.5c00895. Epub 2025 May 27.
7
Agents are all you need: Pioneering the use of agentic artificial intelligence to embrace large language models into dairy science.智能体即所需一切:率先利用智能人工智能将大语言模型应用于乳品科学。
J Dairy Sci. 2025 Sep 11. doi: 10.3168/jds.2025-26775.
8
Scalable evaluation framework for retrieval augmented generation in tobacco research using large Language models.用于烟草研究中使用大语言模型的检索增强生成的可扩展评估框架。
Sci Rep. 2025 Jul 2;15(1):22760. doi: 10.1038/s41598-025-05726-2.
9
Improving Dietary Supplement Information Retrieval: Development of a Retrieval-Augmented Generation System With Large Language Models.改善膳食补充剂信息检索:利用大语言模型开发检索增强生成系统
J Med Internet Res. 2025 Mar 19;27:e67677. doi: 10.2196/67677.
10
Development and evaluation of a retrieval-augmented large language model framework for enhancing endodontic education.用于加强牙髓病学教育的检索增强大语言模型框架的开发与评估
Int J Med Inform. 2025 Nov;203:106006. doi: 10.1016/j.ijmedinf.2025.106006. Epub 2025 Jun 3.