预测蛋白质对蛋白水解加工的结构易感性。

Predicting Structural Susceptibility of Proteins to Proteolytic Processing.

机构信息

Skolkovo Institute of Science and Technology, Moscow 121205, Russia.

A.A. Kharkevich Institute for Information Transmission Problems, Moscow 127051, Russia.

出版信息

Int J Mol Sci. 2023 Jun 28;24(13):10761. doi: 10.3390/ijms241310761.

DOI:10.3390/ijms241310761

PMID:37445939

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10342023/

Abstract

The importance of 3D protein structure in proteolytic processing is well known. However, despite the plethora of existing methods for predicting proteolytic sites, only a few of them utilize the structural features of potential substrates as predictors. Moreover, to our knowledge, there is currently no method available for predicting the structural susceptibility of protein regions to proteolysis. We developed such a method using data from CutDB, a database that contains experimentally verified proteolytic events. For prediction, we utilized structural features that have been shown to influence proteolysis in earlier studies, such as solvent accessibility, secondary structure, and temperature factor. Additionally, we introduced new structural features, including length of protruded loops and flexibility of protein termini. To maximize the prediction quality of the method, we carefully curated the training set, selected an appropriate machine learning method, and sampled negative examples to determine the optimal positive-to-negative class size ratio. We demonstrated that combining our method with models of protease primary specificity can outperform existing bioinformatics methods for the prediction of proteolytic sites. We also discussed the possibility of utilizing this method for bioinformatics prediction of other post-translational modifications.

摘要

三维蛋白质结构在蛋白水解加工中的重要性是众所周知的。然而，尽管有大量现有的预测蛋白水解位点的方法，但只有少数方法将潜在底物的结构特征用作预测因子。此外，据我们所知，目前还没有方法可用于预测蛋白质区域对蛋白水解的结构易感性。我们使用 CutDB 数据库中的实验验证的蛋白水解事件数据开发了这样一种方法。对于预测，我们利用了在早期研究中已被证明会影响蛋白水解的结构特征，如溶剂可及性、二级结构和温度因子。此外，我们引入了新的结构特征，包括突出环的长度和蛋白末端的柔韧性。为了最大限度地提高该方法的预测质量，我们仔细编辑了训练集，选择了合适的机器学习方法，并对负例进行了采样，以确定最佳的正例与负例类别大小比。我们证明，将我们的方法与蛋白酶主要特异性模型结合使用，可以优于现有的生物信息学方法，用于预测蛋白水解位点。我们还讨论了利用该方法进行其他翻译后修饰的生物信息学预测的可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f27e/10342023/b8f420c5bf6c/ijms-24-10761-g001.jpg

相似文献

Predicting Structural Susceptibility of Proteins to Proteolytic Processing.预测蛋白质对蛋白水解加工的结构易感性。

Int J Mol Sci. 2023 Jun 28;24(13):10761. doi: 10.3390/ijms241310761.

Sequence-derived structural features driving proteolytic processing.序列衍生的结构特征驱动蛋白水解加工。

Proteomics. 2014 Jan;14(1):42-50. doi: 10.1002/pmic.201300416. Epub 2013 Dec 11.

Structural determinants of limited proteolysis.有限蛋白水解的结构决定因素。

J Proteome Res. 2011 Aug 5;10(8):3642-51. doi: 10.1021/pr200271w. Epub 2011 Jul 8.

PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites.PROSPER：一种基于综合特征的蛋白酶底物切割位点预测工具。

PLoS One. 2012;7(11):e50300. doi: 10.1371/journal.pone.0050300. Epub 2012 Nov 29.

iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.iProt-Sub：一个全面的软件包，用于准确地映射和预测蛋白酶特异性底物和切割位点。

Brief Bioinform. 2019 Mar 25;20(2):638-658. doi: 10.1093/bib/bby028.

Procleave: Predicting Protease-specific Substrate Cleavage Sites by Combining Sequence and Structural Information.Procleave：通过结合序列和结构信息预测蛋白酶特异性底物切割位点。

Genomics Proteomics Bioinformatics. 2020 Feb;18(1):52-64. doi: 10.1016/j.gpb.2019.08.002. Epub 2020 May 12.

Protein TAILS: when termini tell tales of proteolysis and function.蛋白质尾巴：末端讲述蛋白水解和功能的故事。

Curr Opin Chem Biol. 2013 Feb;17(1):73-82. doi: 10.1016/j.cbpa.2012.11.025. Epub 2013 Jan 6.

TopFIND 2.0--linking protein termini with proteolytic processing and modifications altering protein function.TopFIND 2.0——将蛋白质末端与改变蛋白质功能的蛋白水解加工和修饰联系起来。

Nucleic Acids Res. 2012 Jan;40(Database issue):D351-61. doi: 10.1093/nar/gkr1025. Epub 2011 Nov 18.

Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.二十年来蛋白酶特异性底物和切割位点预测的生物信息学研究：对现有方法的全面回顾和基准测试。

Brief Bioinform. 2019 Nov 27;20(6):2150-2166. doi: 10.1093/bib/bby077.

N- and C-terminal degradomics: new approaches to reveal biological roles for plant proteases from substrate identification.N-和 C-末端降解组学：从底物鉴定揭示植物蛋白酶生物学功能的新方法。

Physiol Plant. 2012 May;145(1):5-17. doi: 10.1111/j.1399-3054.2011.01536.x. Epub 2011 Dec 7.

引用本文的文献

Identification of pancreatic cancer-specific protease substrates for protease-dependent targeted delivery.鉴定用于蛋白酶依赖性靶向递送的胰腺癌特异性蛋白酶底物。

Oncogenesis. 2024 Nov 20;13(1):40. doi: 10.1038/s41389-024-00542-1.

Genome-wide bioinformatics analysis of human protease capacity for proteolytic cleavage of the SARS-CoV-2 spike glycoprotein.对人类蛋白酶对 SARS-CoV-2 刺突糖蛋白进行蛋白水解切割的能力进行全基因组生物信息学分析。

Microbiol Spectr. 2024 Feb 6;12(2):e0353023. doi: 10.1128/spectrum.03530-23. Epub 2024 Jan 8.

本文引用的文献

Deciphering protein post-translational modifications using chemical biology tools.使用化学生物学工具解析蛋白质翻译后修饰

Nat Rev Chem. 2020 Dec;4(12):674-695. doi: 10.1038/s41570-020-00223-8. Epub 2020 Oct 6.

AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库：用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。

Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.

Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。

Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.

Genomics Proteomics Bioinformatics. 2020 Feb;18(1):52-64. doi: 10.1016/j.gpb.2019.08.002. Epub 2020 May 12.

DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.DeepCleave：用于半胱天冬酶和基质金属蛋白酶底物及切割位点的深度学习预测器。

Bioinformatics. 2020 Feb 15;36(4):1057-1065. doi: 10.1093/bioinformatics/btz721.

Protein Data Bank: the single global archive for 3D macromolecular structure data.蛋白质数据库：用于存储大分子三维结构数据的全球单一档案库。

Nucleic Acids Res. 2019 Jan 8;47(D1):D520-D528. doi: 10.1093/nar/gky949.

Brief Bioinform. 2019 Nov 27;20(6):2150-2166. doi: 10.1093/bib/bby077.

The ABCs of PTMs.翻译：蛋白质翻译后修饰概述。

Nat Chem Biol. 2018 Feb 14;14(3):188-192. doi: 10.1038/nchembio.2572.

Proteolytic Cleavage-Mechanisms, Function, and "Omic" Approaches for a Near-Ubiquitous Posttranslational Modification.蛋白水解切割机制、功能和用于广泛存在的翻译后修饰的“组学”方法。

Chem Rev. 2018 Feb 14;118(3):1137-1168. doi: 10.1021/acs.chemrev.7b00120. Epub 2017 Dec 21.

The MEROPS database of proteolytic enzymes, their substrates and inhibitors in 2017 and a comparison with peptidases in the PANTHER database.MEROPS 数据库收录了 2017 年的蛋白水解酶、其底物和抑制剂，以及与 PANTHER 数据库中肽酶的比较。

Nucleic Acids Res. 2018 Jan 4;46(D1):D624-D632. doi: 10.1093/nar/gkx1134.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

预测蛋白质对蛋白水解加工的结构易感性。

Predicting Structural Susceptibility of Proteins to Proteolytic Processing.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献