College of Information Engineering, Northwest A&F University, Shaanxi 712100, China.
South Australian immunoGENomics Cancer Institute (SAiGENCI), Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, SA 5005, Australia.
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad372.
Proteases contribute to a broad spectrum of cellular functions. Given a relatively limited amount of experimental data, developing accurate sequence-based predictors of substrate cleavage sites facilitates a better understanding of protease functions and substrate specificity. While many protease-specific predictors of substrate cleavage sites were developed, these efforts are outpaced by the growth of the protease substrate cleavage data. In particular, since data for 100+ protease types are available and this number continues to grow, it becomes impractical to publish predictors for new protease types, and instead it might be better to provide a computational platform that helps users to quickly and efficiently build predictors that address their specific needs. To this end, we conceptualized, developed, tested and released a versatile bioinformatics platform, ProsperousPlus, that empowers users, even those with no programming or little bioinformatics background, to build fast and accurate predictors of substrate cleavage sites. ProsperousPlus facilitates the use of the rapidly accumulating substrate cleavage data to train, empirically assess and deploy predictive models for user-selected substrate types. Benchmarking tests on test datasets show that our platform produces predictors that on average exceed the predictive performance of current state-of-the-art approaches. ProsperousPlus is available as a webserver and a stand-alone software package at http://prosperousplus.unimelb-biotools.cloud.edu.au/.
蛋白酶在广泛的细胞功能中发挥作用。鉴于相对有限的实验数据量,开发基于序列的准确预测子,有助于更好地理解蛋白酶的功能和底物特异性。虽然已经开发了许多针对特定蛋白酶的底物切割位点预测子,但这些努力仍然落后于蛋白酶底物切割数据的增长。特别是,由于有 100 多种以上的蛋白酶类型的数据可用,并且这个数字还在不断增加,因此为新的蛋白酶类型发布预测子变得不切实际,而提供一个计算平台来帮助用户快速有效地构建满足其特定需求的预测子可能会更好。为此,我们构想、开发、测试并发布了一个多功能的生物信息学平台 ProsperousPlus,使即使没有编程或很少有生物信息学背景的用户也能够构建快速而准确的底物切割位点预测子。ProsperousPlus 有助于利用快速积累的底物切割数据,为用户选择的底物类型训练、经验评估和部署预测模型。在测试数据集上的基准测试表明,我们的平台生成的预测子平均超过了当前最先进方法的预测性能。ProsperousPlus 可作为一个网络服务器和独立的软件包在 http://prosperousplus.unimelb-biotools.cloud.edu.au/ 上使用。