• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用光重量化图去噪神经网络进行蛋白质工程。

Protein Engineering with Lightweight Graph Denoising Neural Networks.

机构信息

Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China.

Shanghai National Center for Applied Mathematics (SJTU Center), Shanghai 200240, China.

出版信息

J Chem Inf Model. 2024 May 13;64(9):3650-3661. doi: 10.1021/acs.jcim.4c00036. Epub 2024 Apr 17.

DOI:10.1021/acs.jcim.4c00036
PMID:38630581
Abstract

Protein engineering faces challenges in finding optimal mutants from a massive pool of candidate mutants. In this study, we introduce a deep-learning-based data-efficient fitness prediction tool to steer protein engineering. Our methodology establishes a lightweight graph neural network scheme for protein structures, which efficiently analyzes the microenvironment of amino acids in wild-type proteins and reconstructs the distribution of the amino acid sequences that are more likely to pass natural selection. This distribution serves as a general guidance for scoring proteins toward arbitrary properties on any order of mutations. Our proposed solution undergoes extensive wet-lab experimental validation spanning diverse physicochemical properties of various proteins, including fluorescence intensity, antigen-antibody affinity, thermostability, and DNA cleavage activity. More than 40% of ProtLGN-designed single-site mutants outperform their wild-type counterparts across all studied proteins and targeted properties. More importantly, our model can bypass the negative epistatic effect to combine single mutation sites and form deep mutants with up to seven mutation sites in a single round, whose physicochemical properties are significantly improved. This observation provides compelling evidence of the structure-based model's potential to guide deep mutations in protein engineering. Overall, our approach emerges as a versatile tool for protein engineering, benefiting both the computational and bioengineering communities.

摘要

蛋白质工程在从大量候选突变体中寻找最佳突变体方面面临挑战。在这项研究中,我们引入了一种基于深度学习的数据高效适应性预测工具来指导蛋白质工程。我们的方法建立了一个轻量级的图神经网络方案,用于蛋白质结构,该方案可以有效地分析野生型蛋白质中氨基酸的微环境,并重建更有可能通过自然选择的氨基酸序列分布。该分布为在任意突变数量级上对蛋白质进行任意属性评分提供了一般指导。我们提出的解决方案经过广泛的湿实验室实验验证,涵盖了各种蛋白质的多种物理化学性质,包括荧光强度、抗原-抗体亲和力、热稳定性和 DNA 切割活性。在所有研究的蛋白质和目标特性中,超过 40%的 ProtLGN 设计的单点突变体优于其野生型对应物。更重要的是,我们的模型可以绕过负上位效应,在单个回合中组合单个突变位点,并形成多达七个突变位点的深度突变体,其物理化学性质得到显著改善。这一观察结果为基于结构的模型在蛋白质工程中指导深度突变提供了有力的证据。总的来说,我们的方法是一种多功能的蛋白质工程工具,对计算和生物工程社区都有好处。

相似文献

1
Protein Engineering with Lightweight Graph Denoising Neural Networks.用光重量化图去噪神经网络进行蛋白质工程。
J Chem Inf Model. 2024 May 13;64(9):3650-3661. doi: 10.1021/acs.jcim.4c00036. Epub 2024 Apr 17.
2
Semantical and geometrical protein encoding toward enhanced bioactivity and thermostability.面向增强生物活性和热稳定性的语义和几何蛋白质编码
Elife. 2025 May 2;13:RP98033. doi: 10.7554/eLife.98033.
3
Direct prediction of profiles of sequences compatible with a protein structure by neural networks with fragment-based local and energy-based nonlocal profiles.通过具有基于片段的局部特征和基于能量的非局部特征的神经网络直接预测与蛋白质结构兼容的序列特征。
Proteins. 2014 Oct;82(10):2565-73. doi: 10.1002/prot.24620. Epub 2014 Jun 19.
4
Flattening the curve-How to get better results with small deep-mutational-scanning datasets.拉平曲线——如何从小规模深度突变扫描数据集获得更好的结果。
Proteins. 2024 Jul;92(7):886-902. doi: 10.1002/prot.26686. Epub 2024 Mar 19.
5
Discovery of Novel Gain-of-Function Mutations Guided by Structure-Based Deep Learning.基于结构的深度学习指导下的新型功能获得性突变的发现。
ACS Synth Biol. 2020 Nov 20;9(11):2927-2935. doi: 10.1021/acssynbio.0c00345. Epub 2020 Oct 16.
6
A general temperature-guided language model to design proteins of enhanced stability and activity.一种通用的温度引导的语言模型,用于设计稳定性和活性增强的蛋白质。
Sci Adv. 2024 Nov 29;10(48):eadr2641. doi: 10.1126/sciadv.adr2641. Epub 2024 Nov 27.
7
Protein multi-level structure feature-integrated deep learning method for mutational effect prediction.基于蛋白质多层次结构特征的深度学习基因突变效应预测方法。
Biotechnol J. 2024 Aug;19(8):e2400203. doi: 10.1002/biot.202400203.
8
MVGNN-PPIS: A novel multi-view graph neural network for protein-protein interaction sites prediction based on Alphafold3-predicted structures and transfer learning.MVGNN-PPIS:一种基于Alphafold3预测结构和迁移学习的用于蛋白质-蛋白质相互作用位点预测的新型多视图图神经网络。
Int J Biol Macromol. 2025 Apr;300:140096. doi: 10.1016/j.ijbiomac.2025.140096. Epub 2025 Jan 21.
9
Enhancing efficiency of protein language models with minimal wet-lab data through few-shot learning.通过少样本学习利用最少的湿实验数据提高蛋白质语言模型的效率。
Nat Commun. 2024 Jul 2;15(1):5566. doi: 10.1038/s41467-024-49798-6.
10
Stability Oracle: a structure-based graph-transformer framework for identifying stabilizing mutations.稳定性预测器:一种基于结构的图变换框架,用于识别稳定化突变。
Nat Commun. 2024 Jul 23;15(1):6170. doi: 10.1038/s41467-024-49780-2.

引用本文的文献

1
Sequence-only prediction of binding affinity changes: a robust and interpretable model for antibody engineering.结合亲和力变化的仅序列预测:一种用于抗体工程的稳健且可解释的模型。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf446.
2
From high-throughput evaluation to wet-lab studies: advancing mutation effect prediction with a retrieval-enhanced model.从高通量评估到湿实验室研究:利用检索增强模型推进突变效应预测
Bioinformatics. 2025 Jul 1;41(Supplement_1):i401-i409. doi: 10.1093/bioinformatics/btaf189.
3
Multiplexed, scalable analog recording of gene regulation dynamics over weeks using intracellular protein tapes.
使用细胞内蛋白质磁带对基因调控动力学进行数周的多路复用、可扩展模拟记录。
bioRxiv. 2025 May 10:2025.05.10.653182. doi: 10.1101/2025.05.10.653182.
4
CPE-Pro: A Structure-Sensitive Deep Learning Method for Protein Representation and Origin Evaluation.CPE-Pro:一种用于蛋白质表征和起源评估的结构敏感深度学习方法。
Interdiscip Sci. 2025 Jun 8. doi: 10.1007/s12539-025-00732-4.
5
Semantical and geometrical protein encoding toward enhanced bioactivity and thermostability.面向增强生物活性和热稳定性的语义和几何蛋白质编码
Elife. 2025 May 2;13:RP98033. doi: 10.7554/eLife.98033.
6
Discovery, design, and engineering of enzymes based on molecular retrobiosynthesis.基于分子逆向生物合成的酶的发现、设计与工程
mLife. 2025 Mar 28;4(2):107-125. doi: 10.1002/mlf2.70009. eCollection 2025 Apr.
7
ProCeSa: Contrast-Enhanced Structure-Aware Network for Thermostability Prediction with Protein Language Models.ProCeSa:用于蛋白质语言模型热稳定性预测的对比增强结构感知网络。
J Chem Inf Model. 2025 Mar 10;65(5):2304-2313. doi: 10.1021/acs.jcim.4c01752. Epub 2025 Feb 23.
8
Protein engineering in the deep learning era.深度学习时代的蛋白质工程。
mLife. 2024 Dec 26;3(4):477-491. doi: 10.1002/mlf2.12157. eCollection 2024 Dec.
9
Integration of molecular coarse-grained model into geometric representation learning framework for protein-protein complex property prediction.将分子粗粒度模型集成到几何表示学习框架中,用于预测蛋白质-蛋白质复合物性质。
Nat Commun. 2024 Nov 7;15(1):9629. doi: 10.1038/s41467-024-53583-w.
10
GGN-GO: geometric graph networks for predicting protein function by multi-scale structure features.GGN-GO:基于多尺度结构特征预测蛋白质功能的几何图网络。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae559.