基于多级底物噬菌体展示数据通过机器学习得出的蛋白酶底物位点预测器。

Protease substrate site predictors derived from machine learning on multilevel substrate phage display data.

作者信息

Chen Ching-Tai, Yang Ei-Wen, Hsu Hung-Ju, Sun Yi-Kun, Hsu Wen-Lian, Yang An-Suei

机构信息

Institute of Information Science, Academia Sinica, Taipei 115, Taiwan.

出版信息

Bioinformatics. 2008 Dec 1;24(23):2691-7. doi: 10.1093/bioinformatics/btn538. Epub 2008 Oct 29.

DOI:10.1093/bioinformatics/btn538

PMID:18974075

Abstract

MOTIVATION

Regulatory proteases modulate proteomic dynamics with a spectrum of specificities against substrate proteins. Predictions of the substrate sites in a proteome for the proteases would facilitate understanding the biological functions of the proteases. High-throughput experiments could generate suitable datasets for machine learning to grasp complex relationships between the substrate sequences and the enzymatic specificities. But the capability in predicting protease substrate sites by integrating the machine learning algorithms with the experimental methodology has yet to be demonstrated.

RESULTS

Factor Xa, a key regulatory protease in the blood coagulation system, was used as model system, for which effective substrate site predictors were developed and benchmarked. The predictors were derived from bootstrap aggregation (machine learning) algorithms trained with data obtained from multilevel substrate phage display experiments. The experimental sampling and computational learning on substrate specificities can be generalized to proteases for which the active forms are available for the in vitro experiments.

AVAILABILITY

http://asqa.iis.sinica.edu.tw/fXaWeb/

摘要

动机

调节性蛋白酶通过对底物蛋白具有一系列特异性来调节蛋白质组动力学。预测蛋白酶在蛋白质组中的底物位点将有助于理解蛋白酶的生物学功能。高通量实验可为机器学习生成合适的数据集，以掌握底物序列与酶特异性之间的复杂关系。但是，将机器学习算法与实验方法相结合来预测蛋白酶底物位点的能力尚未得到证实。

结果

凝血系统中的关键调节蛋白酶凝血因子Xa被用作模型系统，为此开发了有效的底物位点预测器并进行了基准测试。这些预测器源自通过多级底物噬菌体展示实验获得的数据训练的自助聚合（机器学习）算法。关于底物特异性的实验采样和计算学习可以推广到其活性形式可用于体外实验的蛋白酶。

可用性

http://asqa.iis.sinica.edu.tw/fXaWeb/

相似文献

Protease substrate site predictors derived from machine learning on multilevel substrate phage display data.基于多级底物噬菌体展示数据通过机器学习得出的蛋白酶底物位点预测器。

Bioinformatics. 2008 Dec 1;24(23):2691-7. doi: 10.1093/bioinformatics/btn538. Epub 2008 Oct 29.

Substrate phage display for protease substrate sequence characterization: bovine factor Xa as a model system.用于蛋白酶底物序列表征的底物噬菌体展示：以牛凝血因子Xa作为模型系统

Methods Mol Biol. 2014;1088:107-24. doi: 10.1007/978-1-62703-673-3_8.

Predicting serpin/protease interactions.预测丝氨酸蛋白酶抑制剂/蛋白酶的相互作用。

Methods Enzymol. 2011;501:237-73. doi: 10.1016/B978-0-12-385950-1.00012-2.

Protease proteomics: revealing protease in vivo functions using systems biology approaches.蛋白酶组学：运用系统生物学方法揭示蛋白酶的体内功能

Mol Aspects Med. 2008 Oct;29(5):339-58. doi: 10.1016/j.mam.2008.04.003. Epub 2008 May 1.

Methods for mapping protease specificity.绘制蛋白酶特异性的方法。

Curr Opin Chem Biol. 2007 Feb;11(1):46-51. doi: 10.1016/j.cbpa.2006.11.021. Epub 2006 Dec 6.

GANNPhos: a new phosphorylation site predictor based on a genetic algorithm integrated neural network.GANNPhos：一种基于遗传算法集成神经网络的新型磷酸化位点预测工具。

Protein Eng Des Sel. 2007 Aug;20(8):405-12. doi: 10.1093/protein/gzm035. Epub 2007 Jul 24.

A machine learning evaluation of an artificial immune system.人工免疫系统的机器学习评估

Evol Comput. 2005 Summer;13(2):179-212. doi: 10.1162/1063656054088503.

PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites.PROSPER：一种基于综合特征的蛋白酶底物切割位点预测工具。

PLoS One. 2012;7(11):e50300. doi: 10.1371/journal.pone.0050300. Epub 2012 Nov 29.

Research on machine learning issues in biomedical informatics modeling.生物医学信息学建模中的机器学习问题研究。

J Biomed Inform. 2004 Aug;37(4):221-3. doi: 10.1016/j.jbi.2004.07.004.

Protein function prediction with high-throughput data.利用高通量数据进行蛋白质功能预测。

Amino Acids. 2008 Oct;35(3):517-30. doi: 10.1007/s00726-008-0077-y. Epub 2008 Apr 22.

引用本文的文献

Computational Analysis of Antibody Paratopes for Antibody Sequences in Antibody Libraries.抗体文库中抗体序列的抗体变区计算分析。

Methods Mol Biol. 2023;2552:437-445. doi: 10.1007/978-1-0716-2609-2_24.

An automated protocol for modelling peptide substrates to proteases.一种用于模拟肽底物与蛋白酶相互作用的自动化方案。

BMC Bioinformatics. 2020 Dec 29;21(1):586. doi: 10.1186/s12859-020-03931-6.

PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites.PROSPER：一种基于综合特征的蛋白酶底物切割位点预测工具。

PLoS One. 2012;7(11):e50300. doi: 10.1371/journal.pone.0050300. Epub 2012 Nov 29.

Prediction of carbohydrate binding sites on protein surfaces with 3-dimensional probability density distributions of interacting atoms.利用相互作用原子的三维概率密度分布预测蛋白质表面的碳水化合物结合位点。

PLoS One. 2012;7(7):e40846. doi: 10.1371/journal.pone.0040846. Epub 2012 Jul 25.

Protein-protein interaction site predictions with three-dimensional probability distributions of interacting atoms on protein surfaces.利用蛋白质表面相互作用原子的三维概率分布预测蛋白质-蛋白质相互作用位点。

PLoS One. 2012;7(6):e37706. doi: 10.1371/journal.pone.0037706. Epub 2012 Jun 6.

Update of PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence.PROFEAT 更新：一个用于从氨基酸序列计算蛋白质和肽的结构和物理化学特征的网络服务器。

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W385-90. doi: 10.1093/nar/gkr284. Epub 2011 May 23.

Engineering anti-vascular endothelial growth factor single chain disulfide-stabilized antibody variable fragments (sc-dsFv) with phage-displayed sc-dsFv libraries.利用噬菌体展示 sc-dsFv 文库工程抗血管内皮生长因子单链二硫键稳定化抗体可变片段(sc-dsFv)。

J Biol Chem. 2010 Mar 12;285(11):7880-91. doi: 10.1074/jbc.M109.061457. Epub 2010 Jan 12.

Prediction of protein binding sites in protein structures using hidden Markov support vector machine.利用隐马尔可夫支持向量机预测蛋白质结构中的蛋白质结合位点。

BMC Bioinformatics. 2009 Nov 20;10:381. doi: 10.1186/1471-2105-10-381.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于多级底物噬菌体展示数据通过机器学习得出的蛋白酶底物位点预测器。

Protease substrate site predictors derived from machine learning on multilevel substrate phage display data.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献