• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iProt-Sub:一个全面的软件包,用于准确地映射和预测蛋白酶特异性底物和切割位点。

iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.

机构信息

Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia.

Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia and ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Melbourne, VIC 3800, Australia.

出版信息

Brief Bioinform. 2019 Mar 25;20(2):638-658. doi: 10.1093/bib/bby028.

DOI:10.1093/bib/bby028
PMID:29897410
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6556904/
Abstract

Regulation of proteolysis plays a critical role in a myriad of important cellular processes. The key to better understanding the mechanisms that control this process is to identify the specific substrates that each protease targets. To address this, we have developed iProt-Sub, a powerful bioinformatics tool for the accurate prediction of protease-specific substrates and their cleavage sites. Importantly, iProt-Sub represents a significantly advanced version of its successful predecessor, PROSPER. It provides optimized cleavage site prediction models with better prediction performance and coverage for more species-specific proteases (4 major protease families and 38 different proteases). iProt-Sub integrates heterogeneous sequence and structural features and uses a two-step feature selection procedure to further remove redundant and irrelevant features in an effort to improve the cleavage site prediction accuracy. Features used by iProt-Sub are encoded by 11 different sequence encoding schemes, including local amino acid sequence profile, secondary structure, solvent accessibility and native disorder, which will allow a more accurate representation of the protease specificity of approximately 38 proteases and training of the prediction models. Benchmarking experiments using cross-validation and independent tests showed that iProt-Sub is able to achieve a better performance than several existing generic tools. We anticipate that iProt-Sub will be a powerful tool for proteome-wide prediction of protease-specific substrates and their cleavage sites, and will facilitate hypothesis-driven functional interrogation of protease-specific substrate cleavage and proteolytic events.

摘要

蛋白质水解的调控在许多重要的细胞过程中起着关键作用。更好地理解控制这一过程的机制的关键是确定每种蛋白酶的特定底物。为了解决这个问题,我们开发了 iProt-Sub,这是一种强大的生物信息学工具,用于准确预测蛋白酶特异性底物及其切割位点。重要的是,iProt-Sub 是其前身 PROSPER 的一个显著改进版本。它为更多物种特异性的蛋白酶(4 种主要的蛋白酶家族和 38 种不同的蛋白酶)提供了优化的切割位点预测模型,具有更好的预测性能和覆盖范围。iProt-Sub 集成了异构序列和结构特征,并使用两步特征选择过程来进一步去除冗余和不相关的特征,以提高切割位点预测的准确性。iProt-Sub 使用的特征由 11 种不同的序列编码方案编码,包括局部氨基酸序列轮廓、二级结构、溶剂可及性和天然无序性,这将允许更准确地表示大约 38 种蛋白酶的蛋白酶特异性,并训练预测模型。使用交叉验证和独立测试的基准实验表明,iProt-Sub 能够实现比几个现有通用工具更好的性能。我们预计 iProt-Sub 将成为一种强大的工具,用于全蛋白质组范围内预测蛋白酶特异性底物及其切割位点,并促进基于假设的对蛋白酶特异性底物切割和蛋白水解事件的功能研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/7866e2c50b16/bby028f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/363c0b5a45cd/bby028f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/490539e44bc4/bby028f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/1e67b59afc2d/bby028f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/4091936a0e68/bby028f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/3677d077c0aa/bby028f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/d25cac945dc5/bby028f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/b092bd96b81d/bby028f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/e15dbc4def4e/bby028f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/930836e675b4/bby028f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/7866e2c50b16/bby028f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/363c0b5a45cd/bby028f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/490539e44bc4/bby028f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/1e67b59afc2d/bby028f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/4091936a0e68/bby028f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/3677d077c0aa/bby028f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/d25cac945dc5/bby028f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/b092bd96b81d/bby028f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/e15dbc4def4e/bby028f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/930836e675b4/bby028f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ded5/6556904/7866e2c50b16/bby028f10.jpg

相似文献

1
iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.iProt-Sub:一个全面的软件包,用于准确地映射和预测蛋白酶特异性底物和切割位点。
Brief Bioinform. 2019 Mar 25;20(2):638-658. doi: 10.1093/bib/bby028.
2
PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites.PROSPER:一种基于综合特征的蛋白酶底物切割位点预测工具。
PLoS One. 2012;7(11):e50300. doi: 10.1371/journal.pone.0050300. Epub 2012 Nov 29.
3
Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.二十年来蛋白酶特异性底物和切割位点预测的生物信息学研究:对现有方法的全面回顾和基准测试。
Brief Bioinform. 2019 Nov 27;20(6):2150-2166. doi: 10.1093/bib/bby077.
4
Procleave: Predicting Protease-specific Substrate Cleavage Sites by Combining Sequence and Structural Information.Procleave:通过结合序列和结构信息预测蛋白酶特异性底物切割位点。
Genomics Proteomics Bioinformatics. 2020 Feb;18(1):52-64. doi: 10.1016/j.gpb.2019.08.002. Epub 2020 May 12.
5
PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy.PROSPERous:提高准确性的 90 种蛋白酶底物切割位点的高通量预测。
Bioinformatics. 2018 Feb 15;34(4):684-687. doi: 10.1093/bioinformatics/btx670.
6
ProsperousPlus: a one-stop and comprehensive platform for accurate protease-specific substrate cleavage prediction and machine-learning model construction.ProsperousPlus:一个一站式的、全面的平台,用于准确的蛋白酶特异性底物切割预测和机器学习模型构建。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad372.
7
Predicting Proteolysis in Complex Proteomes Using Deep Learning.利用深度学习预测复杂蛋白质组中的蛋白水解。
Int J Mol Sci. 2021 Mar 17;22(6):3071. doi: 10.3390/ijms22063071.
8
Bioinformatic approaches for predicting substrates of proteases.预测蛋白酶底物的生物信息学方法。
J Bioinform Comput Biol. 2011 Feb;9(1):149-78. doi: 10.1142/s0219720011005288.
9
Cascleave 2.0, a new approach for predicting caspase and granzyme cleavage targets.Cascleave 2.0,一种新的预测半胱天冬酶和颗粒酶切割靶点的方法。
Bioinformatics. 2014 Jan 1;30(1):71-80. doi: 10.1093/bioinformatics/btt603. Epub 2013 Oct 21.
10
Prediction of HIV-1 protease cleavage site using a combination of sequence, structural, and physicochemical features.利用序列、结构和物理化学特征相结合的方法预测HIV-1蛋白酶切割位点。
BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):478. doi: 10.1186/s12859-016-1337-6.

引用本文的文献

1
Progress and trends on machine learning in proteomics during 1997-2024: a bibliometric analysis.1997 - 2024年蛋白质组学中机器学习的进展与趋势:文献计量分析
Front Med (Lausanne). 2025 Aug 15;12:1594442. doi: 10.3389/fmed.2025.1594442. eCollection 2025.
2
A deeper evaluation of cytokeratin fragment 21-1 as a lung cancer tumor marker and comparison of different assays.细胞角蛋白片段21-1作为肺癌肿瘤标志物的深入评估及不同检测方法的比较。
Biosens Bioelectron X. 2025 May;23. doi: 10.1016/j.biosx.2025.100593. Epub 2025 Feb 17.
3
Downregulation of PIK3IP1/TrIP on T cells is controlled by TCR signal strength, PKC, and metalloprotease-mediated cleavage.

本文引用的文献

1
iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences.iFeature:一个用于从蛋白质和肽序列中提取和选择特征的 Python 包和网络服务器。
Bioinformatics. 2018 Jul 15;34(14):2499-2502. doi: 10.1093/bioinformatics/bty140.
2
iDNA6mA-PseKNC: Identifying DNA N-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC.iDNA6mA-PseKNC:通过将核苷酸理化性质纳入 PseKNC 来鉴定 DNA N6-甲基腺苷位点。
Genomics. 2019 Jan;111(1):96-102. doi: 10.1016/j.ygeno.2018.01.005. Epub 2018 Jan 31.
3
A Novel Modeling in Mathematical Biology for Classification of Signal Peptides.
T细胞上PIK3IP1/TrIP的下调受TCR信号强度、蛋白激酶C和金属蛋白酶介导的裂解作用控制。
J Biol Chem. 2024 Dec;300(12):107930. doi: 10.1016/j.jbc.2024.107930. Epub 2024 Oct 24.
4
Downregulation of PIK3IP1/TrIP on T cells is controlled by TCR signal strength, PKC and metalloprotease-mediated cleavage.T细胞上PIK3IP1/TrIP的下调受TCR信号强度、蛋白激酶C和金属蛋白酶介导的切割作用控制。
bioRxiv. 2024 Oct 21:2024.04.29.591680. doi: 10.1101/2024.04.29.591680.
5
iACVP-MR: Accurate Identification of Anti-coronavirus Peptide based on Multiple Features Information and Recurrent Neural Network.iACVP-MR:基于多特征信息和循环神经网络的抗冠状病毒肽精确识别
Curr Med Chem. 2025;32(10):2055-2067. doi: 10.2174/0109298673277663240101111507.
6
ADCdb: the database of antibody-drug conjugates.ADCdb:抗体偶联药物数据库。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1097-D1109. doi: 10.1093/nar/gkad831.
7
Prediction and design of protease enzyme specificity using a structure-aware graph convolutional network.利用结构感知图卷积网络预测和设计蛋白酶酶特异性。
Proc Natl Acad Sci U S A. 2023 Sep 26;120(39):e2303590120. doi: 10.1073/pnas.2303590120. Epub 2023 Sep 20.
8
Proteomic data and structure analysis combined reveal interplay of structural rigidity and flexibility on selectivity of cysteine cathepsins.蛋白质组学数据和结构分析联合揭示了结构刚性和柔性对半胱氨酸组织蛋白酶选择性的相互作用。
Commun Biol. 2023 Apr 24;6(1):450. doi: 10.1038/s42003-023-04772-8.
9
N-Terminomic Changes in Neurons During Excitotoxicity Reveal Proteolytic Events Associated With Synaptic Dysfunctions and Potential Targets for Neuroprotection.神经元兴奋毒性过程中 N 端肽段的变化揭示了与突触功能障碍相关的蛋白水解事件及神经保护的潜在靶点。
Mol Cell Proteomics. 2023 May;22(5):100543. doi: 10.1016/j.mcpro.2023.100543. Epub 2023 Apr 6.
10
Prediction and Design of Protease Enzyme Specificity Using a Structure-Aware Graph Convolutional Network.使用结构感知图卷积网络预测和设计蛋白酶特异性
bioRxiv. 2023 Feb 16:2023.02.16.528728. doi: 10.1101/2023.02.16.528728.
一种用于信号肽分类的数学生物学新模型。
Sci Rep. 2018 Jan 18;8(1):1039. doi: 10.1038/s41598-018-19491-y.
4
BioSeq-Analysis: a platform for DNA, RNA and protein sequence analysis based on machine learning approaches.生物序列分析:一个基于机器学习方法的 DNA、RNA 和蛋白质序列分析平台。
Brief Bioinform. 2019 Jul 19;20(4):1280-1294. doi: 10.1093/bib/bbx165.
5
B-factor profile prediction for RNA flexibility using support vector machines.基于支持向量机的 RNA 柔性 B 因子预测。
J Comput Chem. 2018 Mar 30;39(8):407-411. doi: 10.1002/jcc.25124. Epub 2017 Nov 21.
6
Prediction of protein subcellular localization with oversampling approach and Chou's general PseAAC.基于过采样方法和周式广义伪氨基酸组成预测蛋白质亚细胞定位
J Theor Biol. 2018 Jan 21;437:239-250. doi: 10.1016/j.jtbi.2017.10.030. Epub 2017 Oct 31.
7
PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy.PROSPERous:提高准确性的 90 种蛋白酶底物切割位点的高通量预测。
Bioinformatics. 2018 Feb 15;34(4):684-687. doi: 10.1093/bioinformatics/btx670.
8
pLoc-mAnimal: predict subcellular localization of animal proteins with both single and multiple sites.pLoc-mAnimal:预测具有单一位点和多个位点的动物蛋白质的亚细胞定位。
Bioinformatics. 2017 Nov 15;33(22):3524-3531. doi: 10.1093/bioinformatics/btx476.
9
iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC.iPromoter-2L:一种双层预测器,通过基于多窗口的 PseKNC 来识别启动子及其类型。
Bioinformatics. 2018 Jan 1;34(1):33-40. doi: 10.1093/bioinformatics/btx579.
10
PhosphoPredict: A bioinformatics tool for prediction of human kinase-specific phosphorylation substrates and sites by integrating heterogeneous feature selection.磷酸化预测:一种通过整合异构特征选择来预测人类激酶特异性磷酸化底物和位点的生物信息学工具。
Sci Rep. 2017 Jul 31;7(1):6862. doi: 10.1038/s41598-017-07199-4.