• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ProtAgents:蛋白质发现大型语言模型,结合物理和机器学习的多智能体协作。

ProtAgents: protein discovery large language model multi-agent collaborations combining physics and machine learning.

作者信息

Ghafarollahi Alireza, Buehler Markus J

机构信息

Laboratory for Atomistic and Molecular Mechanics (LAMM), Massachusetts Institute of Technology 77 Massachusetts Ave. Cambridge MA 02139 USA

Center for Computational Science and Engineering, Schwarzman College of Computing, Massachusetts Institute of Technology 77 Massachusetts Ave. Cambridge MA 02139 USA.

出版信息

Digit Discov. 2024 May 17;3(7):1389-1409. doi: 10.1039/d4dd00013g. eCollection 2024 Jul 10.

DOI:10.1039/d4dd00013g
PMID:38993729
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11235180/
Abstract

Designing proteins beyond those found in nature holds significant promise for advancements in both scientific and engineering applications. Current methodologies for protein design often rely on AI-based models, such as surrogate models that address end-to-end problems by linking protein structure to material properties or . However, these models frequently focus on specific material objectives or structural properties, limiting their flexibility when incorporating out-of-domain knowledge into the design process or comprehensive data analysis is required. In this study, we introduce ProtAgents, a platform for protein design based on Large Language Models (LLMs), where multiple AI agents with distinct capabilities collaboratively address complex tasks within a dynamic environment. The versatility in agent development allows for expertise in diverse domains, including knowledge retrieval, protein structure analysis, physics-based simulations, and results analysis. The dynamic collaboration between agents, empowered by LLMs, provides a versatile approach to tackling protein design and analysis problems, as demonstrated through diverse examples in this study. The problems of interest encompass designing new proteins, analyzing protein structures and obtaining new first-principles data - natural vibrational frequencies - physics simulations. The concerted effort of the system allows for powerful automated and synergistic design of proteins with targeted mechanical properties. The flexibility in designing the agents, on one hand, and their capacity in autonomous collaboration through the dynamic LLM-based multi-agent environment on the other hand, unleashes great potentials of LLMs in addressing multi-objective materials problems and opens up new avenues for autonomous materials discovery and design.

摘要

设计自然界中不存在的蛋白质在科学和工程应用的进步方面具有重大前景。当前的蛋白质设计方法通常依赖基于人工智能的模型,例如通过将蛋白质结构与材料特性联系起来解决端到端问题的替代模型。然而,这些模型经常专注于特定的材料目标或结构特性,在将域外知识纳入设计过程或需要进行全面数据分析时,限制了它们的灵活性。在本研究中,我们引入了ProtAgents,这是一个基于大语言模型(LLMs)的蛋白质设计平台,其中多个具有不同能力的人工智能代理在动态环境中协作解决复杂任务。代理开发的多功能性允许在包括知识检索、蛋白质结构分析、基于物理的模拟和结果分析等不同领域拥有专业知识。由大语言模型赋能的代理之间的动态协作提供了一种通用方法来解决蛋白质设计和分析问题,正如本研究中的各种示例所示。感兴趣的问题包括设计新蛋白质、分析蛋白质结构以及获得新的第一性原理数据——自然振动频率——进行物理模拟。系统的协同努力允许对具有目标机械性能的蛋白质进行强大的自动化和协同设计。一方面,设计代理的灵活性,另一方面,它们通过基于大语言模型的动态多代理环境进行自主协作的能力,释放了大语言模型在解决多目标材料问题方面的巨大潜力,并为自主材料发现和设计开辟了新途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/78d607ee6974/d4dd00013g-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/be21e50215cc/d4dd00013g-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/11dbe83e6d57/d4dd00013g-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/addbac4a4df2/d4dd00013g-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/2de21aa939d1/d4dd00013g-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/0b8d4a0749ee/d4dd00013g-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/78d607ee6974/d4dd00013g-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/be21e50215cc/d4dd00013g-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/11dbe83e6d57/d4dd00013g-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/addbac4a4df2/d4dd00013g-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/2de21aa939d1/d4dd00013g-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/0b8d4a0749ee/d4dd00013g-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e74c/11235180/78d607ee6974/d4dd00013g-f6.jpg

相似文献

1
ProtAgents: protein discovery large language model multi-agent collaborations combining physics and machine learning.ProtAgents:蛋白质发现大型语言模型,结合物理和机器学习的多智能体协作。
Digit Discov. 2024 May 17;3(7):1389-1409. doi: 10.1039/d4dd00013g. eCollection 2024 Jul 10.
2
Generative Retrieval-Augmented Ontologic Graph and Multiagent Strategies for Interpretive Large Language Model-Based Materials Design.基于生成式检索增强本体图和多智能体策略的基于解释性大语言模型的材料设计
ACS Eng Au. 2024 Jan 12;4(2):241-277. doi: 10.1021/acsengineeringau.3c00058. eCollection 2024 Apr 17.
3
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
4
Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models.利用生成式人工智能辅助学习罕见且复杂的诊断:对流行的大型语言模型的定性研究。
JMIR Med Educ. 2024 Feb 13;10:e51391. doi: 10.2196/51391.
5
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.
6
Augmenting large language models with chemistry tools.用化学工具增强大语言模型。
Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.
7
Integrating human expertise & automated methods for a dynamic and multi-parametric evaluation of large language models' feasibility in clinical decision-making.整合人类专业知识和自动化方法,对大型语言模型在临床决策中的可行性进行动态和多参数评估。
Int J Med Inform. 2024 Aug;188:105501. doi: 10.1016/j.ijmedinf.2024.105501. Epub 2024 May 26.
8
Assessing the Alignment of Large Language Models With Human Values for Mental Health Integration: Cross-Sectional Study Using Schwartz's Theory of Basic Values.评估大型语言模型与人类心理健康整合价值观的一致性:使用施瓦茨基本价值观理论的横断面研究。
JMIR Ment Health. 2024 Apr 9;11:e55988. doi: 10.2196/55988.
9
A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare.ChatGPT及其他对话式大语言模型在医疗保健领域的系统评价
medRxiv. 2024 Apr 27:2024.04.26.24306390. doi: 10.1101/2024.04.26.24306390.
10
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

引用本文的文献

1
A vision of human-AI collaboration for enhanced biological collection curation and research.关于人类与人工智能合作以加强生物样本管理和研究的愿景。
Bioscience. 2025 Mar 28;75(6):457-471. doi: 10.1093/biosci/biaf021. eCollection 2025 Jun.
2
A conceptual framework for human-AI collaborative genome annotation.人类与人工智能协作进行基因组注释的概念框架。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf377.
3
A Survey of Pretrained Protein Language Models.预训练蛋白质语言模型综述

本文引用的文献

1
Augmenting large language models with chemistry tools.用化学工具增强大语言模型。
Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.
2
Generative Retrieval-Augmented Ontologic Graph and Multiagent Strategies for Interpretive Large Language Model-Based Materials Design.基于生成式检索增强本体图和多智能体策略的基于解释性大语言模型的材料设计
ACS Eng Au. 2024 Jan 12;4(2):241-277. doi: 10.1021/acsengineeringau.3c00058. eCollection 2024 Apr 17.
3
ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a language diffusion model.
Methods Mol Biol. 2025;2941:1-29. doi: 10.1007/978-1-0716-4623-6_1.
4
Learning the rules of peptide self-assembly through data mining with large language models.通过使用大语言模型进行数据挖掘来学习肽自组装的规则。
Sci Adv. 2025 Mar 28;11(13):eadv1971. doi: 10.1126/sciadv.adv1971. Epub 2025 Mar 26.
5
Molecular analysis and design using generative artificial intelligence multi-agent modeling.使用生成式人工智能多智能体建模的分子分析与设计
Mol Syst Des Eng. 2025 Jan 24;10(4):314-337. doi: 10.1039/d4me00174e. eCollection 2025 Mar 31.
6
Automating alloy design and discovery with physics-aware multimodal multiagent AI.利用物理感知多模态多智能体人工智能实现合金设计与发现的自动化。
Proc Natl Acad Sci U S A. 2025 Jan 28;122(4):e2414074122. doi: 10.1073/pnas.2414074122. Epub 2025 Jan 24.
7
Deep learning and generative artificial intelligence in aging research and healthy longevity medicine.深度学习与生成式人工智能在衰老研究和健康长寿医学中的应用
Aging (Albany NY). 2025 Jan 16;17(1):251-275. doi: 10.18632/aging.206190.
8
A review of large language models and autonomous agents in chemistry.化学领域中大型语言模型与自主智能体的综述。
Chem Sci. 2024 Dec 9;16(6):2514-2572. doi: 10.1039/d4sc03921a. eCollection 2025 Feb 5.
9
The Application of Machine Learning on Antibody Discovery and Optimization.机器学习在抗体发现与优化中的应用
Molecules. 2024 Dec 16;29(24):5923. doi: 10.3390/molecules29245923.
10
SciAgents: Automating Scientific Discovery Through Bioinspired Multi-Agent Intelligent Graph Reasoning.科学智能体:通过受生物启发的多智能体智能图推理实现科学发现自动化
Adv Mater. 2025 Jun;37(22):e2413523. doi: 10.1002/adma.202413523. Epub 2024 Dec 18.
ForceGen:基于语言扩散模型的非线性机械展开响应的从头开始的蛋白质从头生成。
Sci Adv. 2024 Feb 9;10(6):eadl4000. doi: 10.1126/sciadv.adl4000. Epub 2024 Feb 7.
4
Autonomous chemical research with large language models.大语言模型驱动的自主化学研究。
Nature. 2023 Dec;624(7992):570-578. doi: 10.1038/s41586-023-06792-0. Epub 2023 Dec 20.
5
14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon.大语言模型如何改变材料科学与化学的14个实例:对一场大语言模型黑客马拉松的思考
Digit Discov. 2023 Aug 8;2(5):1233-1250. doi: 10.1039/d3dd00113j. eCollection 2023 Oct 9.
6
Illuminating protein space with a programmable generative model.用可编程生成模型照亮蛋白质空间。
Nature. 2023 Nov;623(7989):1070-1078. doi: 10.1038/s41586-023-06728-8. Epub 2023 Nov 15.
7
Generative design of proteins based on secondary structure constraints using an attention-based diffusion model.基于二级结构约束,使用基于注意力的扩散模型进行蛋白质的生成式设计。
Chem. 2023 Jul 13;9(7):1828-1849. doi: 10.1016/j.chempr.2023.03.020. Epub 2023 Apr 20.
8
De novo design of protein structure and function with RFdiffusion.利用 RFdiffusion 从头设计蛋白质结构和功能。
Nature. 2023 Aug;620(7976):1089-1100. doi: 10.1038/s41586-023-06415-8. Epub 2023 Jul 11.
9
Fine-tuning large neural language models for biomedical natural language processing.针对生物医学自然语言处理对大型神经语言模型进行微调。
Patterns (N Y). 2023 Apr 14;4(4):100729. doi: 10.1016/j.patter.2023.100729.
10
End-to-End Protein Normal Mode Frequency Predictions Using Language and Graph Models and Application to Sonification.使用语言和图形模型进行端到端蛋白质正常模式频率预测及其在可听化中的应用。
ACS Nano. 2022 Dec 27;16(12):20656-20670. doi: 10.1021/acsnano.2c07681. Epub 2022 Nov 23.