• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用结构感知图卷积网络预测和设计蛋白酶酶特异性。

Prediction and design of protease enzyme specificity using a structure-aware graph convolutional network.

机构信息

Institute for Quantitative Biomedicine, Rutgers-The State University of New Jersey, Piscataway, NJ 08854.

Department of Chemistry and Chemical Biology, Rutgers-The State University of New Jersey, Piscataway, NJ 08854.

出版信息

Proc Natl Acad Sci U S A. 2023 Sep 26;120(39):e2303590120. doi: 10.1073/pnas.2303590120. Epub 2023 Sep 20.

DOI:10.1073/pnas.2303590120
PMID:37729196
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10523478/
Abstract

Site-specific proteolysis by the enzymatic cleavage of small linear sequence motifs is a key posttranslational modification involved in physiology and disease. The ability to robustly and rapidly predict protease-substrate specificity would also enable targeted proteolytic cleavage by designed proteases. Current methods for predicting protease specificity are limited to sequence pattern recognition in experimentally derived cleavage data obtained for libraries of potential substrates and generated separately for each protease variant. We reasoned that a more semantically rich and robust model of protease specificity could be developed by incorporating the energetics of molecular interactions between protease and substrates into machine learning workflows. We present Protein Graph Convolutional Network (PGCN), which develops a physically grounded, structure-based molecular interaction graph representation that describes molecular topology and interaction energetics to predict enzyme specificity. We show that PGCN accurately predicts the specificity landscapes of several variants of two model proteases. Node and edge ablation tests identified key graph elements for specificity prediction, some of which are consistent with known biochemical constraints for protease:substrate recognition. We used a pretrained PGCN model to guide the design of protease libraries for cleaving two noncanonical substrates, and found good agreement with experimental cleavage results. Importantly, the model can accurately assess designs featuring diversity at positions not present in the training data. The described methodology should enable the structure-based prediction of specificity landscapes of a wide variety of proteases and the construction of tailor-made protease editors for site-selectively and irreversibly modifying chosen target proteins.

摘要

位点特异性蛋白水解是一种涉及生理和疾病的关键翻译后修饰,通过酶对小线性序列基序的切割来实现。能够强大且快速地预测蛋白酶-底物特异性也将能够通过设计的蛋白酶进行靶向蛋白水解切割。目前预测蛋白酶特异性的方法仅限于从实验获得的潜在底物库的切割数据中进行序列模式识别,并且这些数据是为每个蛋白酶变体分别生成的。我们认为,可以通过将蛋白酶和底物之间分子相互作用的能量纳入机器学习工作流程,开发出更具语义丰富性和稳健性的蛋白酶特异性模型。我们提出了蛋白质图卷积网络(PGCN),它开发了一种基于物理的、基于结构的分子相互作用图表示形式,用于描述分子拓扑和相互作用能量,以预测酶特异性。我们表明,PGCN 可以准确预测两种模型蛋白酶的几个变体的特异性景观。节点和边消融测试确定了特异性预测的关键图元素,其中一些与蛋白酶:底物识别的已知生化约束一致。我们使用预训练的 PGCN 模型来指导针对两种非典型底物进行切割的蛋白酶文库的设计,并发现与实验切割结果具有良好的一致性。重要的是,该模型可以准确评估在训练数据中不存在的位置具有多样性的设计。所描述的方法学应该能够实现各种蛋白酶特异性景观的基于结构的预测,并构建用于选择性和不可逆地修饰选定靶蛋白的定制蛋白酶编辑器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/ed0e50aad375/pnas.2303590120fig07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/3ef8f35a09f8/pnas.2303590120fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/c1f6096a5118/pnas.2303590120fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/e71eb8f16ce5/pnas.2303590120fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/0ab8eb836c46/pnas.2303590120fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/69c6e3c45f87/pnas.2303590120fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/378eabbe885f/pnas.2303590120fig06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/ed0e50aad375/pnas.2303590120fig07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/3ef8f35a09f8/pnas.2303590120fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/c1f6096a5118/pnas.2303590120fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/e71eb8f16ce5/pnas.2303590120fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/0ab8eb836c46/pnas.2303590120fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/69c6e3c45f87/pnas.2303590120fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/378eabbe885f/pnas.2303590120fig06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7146/10523478/ed0e50aad375/pnas.2303590120fig07.jpg

相似文献

1
Prediction and design of protease enzyme specificity using a structure-aware graph convolutional network.利用结构感知图卷积网络预测和设计蛋白酶酶特异性。
Proc Natl Acad Sci U S A. 2023 Sep 26;120(39):e2303590120. doi: 10.1073/pnas.2303590120. Epub 2023 Sep 20.
2
Prediction and Design of Protease Enzyme Specificity Using a Structure-Aware Graph Convolutional Network.使用结构感知图卷积网络预测和设计蛋白酶特异性
bioRxiv. 2023 Feb 16:2023.02.16.528728. doi: 10.1101/2023.02.16.528728.
3
Data-driven supervised learning of a viral protease specificity landscape from deep sequencing and molecular simulations.基于深度测序和分子模拟的病毒蛋白酶特异性全景数据驱动的有监督学习。
Proc Natl Acad Sci U S A. 2019 Jan 2;116(1):168-176. doi: 10.1073/pnas.1805256116. Epub 2018 Dec 26.
4
iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.iProt-Sub:一个全面的软件包,用于准确地映射和预测蛋白酶特异性底物和切割位点。
Brief Bioinform. 2019 Mar 25;20(2):638-658. doi: 10.1093/bib/bby028.
5
Large-Scale Structure-Based Prediction and Identification of Novel Protease Substrates Using Computational Protein Design.使用计算蛋白质设计基于大规模结构的新型蛋白酶底物预测与鉴定
J Mol Biol. 2017 Jan 20;429(2):220-236. doi: 10.1016/j.jmb.2016.11.031. Epub 2016 Dec 6.
6
PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites.PROSPER:一种基于综合特征的蛋白酶底物切割位点预测工具。
PLoS One. 2012;7(11):e50300. doi: 10.1371/journal.pone.0050300. Epub 2012 Nov 29.
7
Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.二十年来蛋白酶特异性底物和切割位点预测的生物信息学研究:对现有方法的全面回顾和基准测试。
Brief Bioinform. 2019 Nov 27;20(6):2150-2166. doi: 10.1093/bib/bby077.
8
Profiling the Extended Cleavage Specificity of the House Dust Mite Protease Allergens Der p 1, Der p 3 and Der p 6 for the Prediction of New Cell Surface Protein Substrates.分析屋尘螨蛋白酶过敏原Der p 1、Der p 3和Der p 6的扩展切割特异性以预测新的细胞表面蛋白底物
Int J Mol Sci. 2017 Jun 27;18(7):1373. doi: 10.3390/ijms18071373.
9
Procleave: Predicting Protease-specific Substrate Cleavage Sites by Combining Sequence and Structural Information.Procleave:通过结合序列和结构信息预测蛋白酶特异性底物切割位点。
Genomics Proteomics Bioinformatics. 2020 Feb;18(1):52-64. doi: 10.1016/j.gpb.2019.08.002. Epub 2020 May 12.
10
Engineering of TEV protease variants with redesigned substrate specificity.具有重新设计的底物特异性的 TEV 蛋白酶变体的工程改造。
Biotechnol J. 2023 Nov;18(11):e2200625. doi: 10.1002/biot.202200625. Epub 2023 Jul 28.

引用本文的文献

1
Data-driven protease engineering by DNA-recording and epistasis-aware machine learning.通过DNA记录和上位性感知机器学习进行数据驱动的蛋白酶工程。
Nat Commun. 2025 Jul 1;16(1):5466. doi: 10.1038/s41467-025-60622-7.
2
Protease engineering: Approaches, tools, and emerging trends.蛋白酶工程:方法、工具及新趋势
Biotechnol Adv. 2025 Sep;82:108602. doi: 10.1016/j.biotechadv.2025.108602. Epub 2025 May 12.
3
Physics-based modeling in the new era of enzyme engineering.酶工程新时代基于物理学的建模

本文引用的文献

1
Peptide-binding specificity prediction using fine-tuned protein structure prediction networks.使用经过微调的蛋白质结构预测网络进行肽结合特异性预测。
Proc Natl Acad Sci U S A. 2023 Feb 28;120(9):e2216697120. doi: 10.1073/pnas.2216697120. Epub 2023 Feb 21.
2
Mechanism-based traps enable protease and hydrolase substrate discovery.基于机制的陷阱可用于蛋白酶和水解酶底物的发现。
Nature. 2022 Feb;602(7898):701-707. doi: 10.1038/s41586-022-04414-9. Epub 2022 Feb 16.
3
Harnessing protein folding neural networks for peptide-protein docking.
Nat Comput Sci. 2025 Apr;5(4):279-291. doi: 10.1038/s43588-025-00788-8. Epub 2025 Apr 24.
4
Substrate prediction for RiPP biosynthetic enzymes masked language modeling and transfer learning.RiPP生物合成酶的底物预测:掩码语言建模与迁移学习
Digit Discov. 2024 Dec 2;4(2):343-354. doi: 10.1039/d4dd00170b. eCollection 2025 Feb 12.
5
Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning.通过掩码语言建模和迁移学习对核糖体合成和翻译后修饰肽生物合成酶的底物预测
ArXiv. 2024 Feb 23:arXiv:2402.15181v1.
6
Advances in ligand-specific biosensing for structurally similar molecules.配体特异性生物传感技术在结构相似分子分析中的进展。
Cell Syst. 2023 Dec 20;14(12):1024-1043. doi: 10.1016/j.cels.2023.10.009.
利用蛋白质折叠神经网络进行肽-蛋白对接。
Nat Commun. 2022 Jan 10;13(1):176. doi: 10.1038/s41467-021-27838-9.
4
Making the cut with protease engineering.用蛋白酶工程实现精准切割。
Cell Chem Biol. 2022 Feb 17;29(2):177-190. doi: 10.1016/j.chembiol.2021.12.001. Epub 2021 Dec 17.
5
Deorphanizing Caspase-3 and Caspase-9 Substrates In and Out of Apoptosis with Deep Substrate Profiling.通过深度底物谱分析对细胞凋亡内外的 Caspase-3 和 Caspase-9 底物进行孤儿化。
ACS Chem Biol. 2021 Nov 19;16(11):2280-2296. doi: 10.1021/acschembio.1c00456. Epub 2021 Sep 23.
6
Characterising proteolysis during SARS-CoV-2 infection identifies viral cleavage sites and cellular targets with therapeutic potential.阐明 SARS-CoV-2 感染过程中的蛋白水解作用可鉴定具有治疗潜力的病毒裂解位点和细胞靶标。
Nat Commun. 2021 Sep 21;12(1):5553. doi: 10.1038/s41467-021-25796-w.
7
Biochemical Tools for Tracking Proteolysis.追踪蛋白水解的生化工具。
J Proteome Res. 2021 Dec 3;20(12):5264-5279. doi: 10.1021/acs.jproteome.1c00289. Epub 2021 Sep 7.
8
Machine learning differentiates enzymatic and non-enzymatic metals in proteins.机器学习区分蛋白质中的酶促金属和非酶促金属。
Nat Commun. 2021 Jun 17;12(1):3712. doi: 10.1038/s41467-021-24070-3.
9
Targeting the Main Protease of SARS-CoV-2: From the Establishment of High Throughput Screening to the Design of Tailored Inhibitors.靶向 SARS-CoV-2 的主要蛋白酶:从高通量筛选的建立到定制抑制剂的设计。
Angew Chem Int Ed Engl. 2021 Apr 26;60(18):10423-10429. doi: 10.1002/anie.202016961. Epub 2021 Mar 24.
10
Phage-assisted evolution of botulinum neurotoxin proteases with reprogrammed specificity.噬菌体辅助的肉毒神经毒素蛋白酶的进化具有重新编程的特异性。
Science. 2021 Feb 19;371(6531):803-810. doi: 10.1126/science.abf5972.