• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质重新设计中的人工智能与第一性原理方法:权宜之计的结合?

Artificial intelligence and first-principle methods in protein redesign: A marriage of convenience?

作者信息

Cianferoni Damiano, Vizarraga David, Fernández-Escamilla Ana María, Fita Ignacio, Hamdani Rahma, Reche Raul, Delgado Javier, Serrano Luis

机构信息

Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Barcelona, Spain.

Universitat de Barcelona (UB), Barcelona, Spain.

出版信息

Protein Sci. 2025 Aug;34(8):e70210. doi: 10.1002/pro.70210.

DOI:10.1002/pro.70210
PMID:40671352
Abstract

Since AlphaFold2's rise, many deep learning methods for protein design have emerged. Here, we validate widely used and recognized tools, compare them with first-principle methods, and explore their combinations, focusing on their effectiveness in protein redesign and potential for therapeutic repurposing. We address two challenges: evaluating tools and combinations ability to detect the effects of multiple concurrent mutations in protein variants, and leveraging large-scale datasets to compare modeling-free methods, namely force fields, which handle point mutations well with limited backbone rearrangement, and inverse folding tools, which excel at native sequence recovery but may struggle with non-natural proteins. Debuting TriCombine, a tool that identifies residue triangles in input structures, matches them to a structural database, and scores mutants based on substitution frequencies, we shortlisted candidates, modeled them with FoldX, and generated 16 SH3 mutants carrying up to 9 concurrent substitutions. The dataset was expanded to include 36 mutants and 11 crystal structures (7 newly solved), along with a parallel set of multiple non-concurrent mutants from three additional proteins. For broader validation, we analyzed 160,000 four-site GB1 mutants and 163,555 (single and double) variants across 179 natural and de novo domains. We show that combining AI-based modeling tools with force field scoring functions yields the most reliable results. Inverse folding tools perform very well but lose accuracy on less-represented proteins. First-principle force fields like FoldX remain the most accurate for point mutations. All methods perform worse when applied to unsolved de novo models, underscoring the need for hybrid strategies in robust protein design.

摘要

自AlphaFold2兴起以来,出现了许多用于蛋白质设计的深度学习方法。在此,我们对广泛使用和认可的工具进行验证,将它们与第一性原理方法进行比较,并探索它们的组合方式,重点关注它们在蛋白质重新设计中的有效性以及治疗性重新利用的潜力。我们解决了两个挑战:评估工具及其组合检测蛋白质变体中多个并发突变影响的能力,以及利用大规模数据集比较无模型方法,即力场(能很好地处理点突变且主链重排有限)和逆折叠工具(在天然序列恢复方面表现出色,但可能难以处理非天然蛋白质)。我们推出了TriCombine工具,该工具可识别输入结构中的残基三角形,将它们与结构数据库进行匹配,并根据替换频率对突变体进行评分。我们筛选出候选者,用FoldX对它们进行建模,并生成了16个携带多达9个并发替换的SH3突变体。数据集得到扩展,包括36个突变体和11个晶体结构(7个新解析的),以及来自另外三种蛋白质的一组平行的多个非并发突变体。为了进行更广泛的验证,我们分析了160,000个四位点GB1突变体以及179个天然和从头设计结构域中的163,555个(单突变和双突变)变体。我们表明,将基于人工智能的建模工具与力场评分函数相结合可产生最可靠的结果。逆折叠工具表现非常出色,但在代表性较差的蛋白质上会失去准确性。像FoldX这样的第一性原理力场在点突变方面仍然是最准确的。当应用于未解析的从头设计模型时,所有方法的性能都会变差,这突出了在稳健的蛋白质设计中采用混合策略的必要性。

相似文献

1
Artificial intelligence and first-principle methods in protein redesign: A marriage of convenience?蛋白质重新设计中的人工智能与第一性原理方法:权宜之计的结合?
Protein Sci. 2025 Aug;34(8):e70210. doi: 10.1002/pro.70210.
2
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
3
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
4
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
5
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物:网状Meta分析
Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.
6
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂(GLP-1 RAs)减肥效果的网状Meta分析的数量、质量及结果:一项范围综述
Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Artificial intelligence for detecting keratoconus.人工智能在圆锥角膜检测中的应用。
Cochrane Database Syst Rev. 2023 Nov 15;11(11):CD014911. doi: 10.1002/14651858.CD014911.pub2.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

引用本文的文献

1
Predicting interacting hotspots for nanobodies' binding using triplets of residues.利用残基三联体预测纳米抗体结合的相互作用热点。
Protein Sci. 2025 Aug;34(8):e70220. doi: 10.1002/pro.70220.

本文引用的文献

1
FoldX force field revisited, an improved version.重新审视的FoldX力场,一个改进版本。
Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf064.
2
Unsupervised evolution of protein and antibody complexes with a structure-informed language model.无监督的蛋白质和抗体复合物的进化与结构信息语言模型。
Science. 2024 Jul 5;385(6704):46-53. doi: 10.1126/science.adk8946. Epub 2024 Jul 4.
3
OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization.OpenFold:重新训练 AlphaFold2 可深入了解其学习机制和泛化能力。
Nat Methods. 2024 Aug;21(8):1514-1524. doi: 10.1038/s41592-024-02272-z. Epub 2024 May 14.
4
Opportunities and challenges in design and optimization of protein function.蛋白质功能设计与优化的机遇与挑战。
Nat Rev Mol Cell Biol. 2024 Aug;25(8):639-653. doi: 10.1038/s41580-024-00718-y. Epub 2024 Apr 2.
5
Protein design using structure-based residue preferences.基于结构的残基偏好的蛋白质设计。
Nat Commun. 2024 Feb 22;15(1):1639. doi: 10.1038/s41467-024-45621-4.
6
Chroma is a generative model for protein design.Chroma是一种用于蛋白质设计的生成模型。
Nat Methods. 2024 Jan;21(1):10. doi: 10.1038/s41592-023-02155-9.
7
Mega-scale experimental analysis of protein folding stability in biology and design.大规模实验分析生物学和设计中的蛋白质折叠稳定性。
Nature. 2023 Aug;620(7973):434-444. doi: 10.1038/s41586-023-06328-6. Epub 2023 Jul 19.
8
De novo design of protein structure and function with RFdiffusion.利用 RFdiffusion 从头设计蛋白质结构和功能。
Nature. 2023 Aug;620(7976):1089-1100. doi: 10.1038/s41586-023-06415-8. Epub 2023 Jul 11.
9
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
10
Robust deep learning-based protein sequence design using ProteinMPNN.使用 ProteinMPNN 进行健壮的基于深度学习的蛋白质序列设计。
Science. 2022 Oct 7;378(6615):49-56. doi: 10.1126/science.add2187. Epub 2022 Sep 15.