蛋白酶及其类型的鉴定。

Identification of proteases and their types.

作者信息

Shen Hong-Bin, Chou Kuo-Chen

机构信息

Institute of Image Processing and Pattern Recognition, Shanghai Jiaotong University, Shanghai 200240, China.

出版信息

Anal Biochem. 2009 Feb 1;385(1):153-60. doi: 10.1016/j.ab.2008.10.020. Epub 2008 Nov 1.

DOI:10.1016/j.ab.2008.10.020

PMID:19007742

Abstract

Called by many as biology's version of Swiss army knives, proteases cut long sequences of amino acids into fragments and regulate most physiological processes. They are vitally important in the life cycle. Different types of proteases have different action mechanisms and biological processes. With the avalanche of protein sequences generated during the postgenomic age, it is highly desirable for both basic research and drug design to develop a fast and reliable method for identifying the types of proteases according to their sequences or even just for whether they are proteases or not. In this article, three recently developed identification methods in this regard are discussed: (i) FunD-PseAAC, (ii) GO-PseAAC, and (iii) FunD-PsePSSM. The first two were established by hybridizing the FunD (functional domain) approach and the GO (gene ontology) approach, respectively, with the PseAAC (pseudo amino acid composition) approach. The third method was established by fusing the FunD approach with the PsePSSM (pseudo position-specific scoring matrix) approach. Of these three methods, only FunD-PsePSSM has provided a server called ProtIdent (protease identifier), which is freely accessible to the public via the website at http://www.csbio.sjtu.edu.cn/bioinf/Protease. For the convenience of users, a step-by-step guide on how to use ProtIdent is illustrated. Meanwhile, the caveat in using ProtIdent and how to understand the success expectancy rate of a statistical predictor are discussed. Finally, the essence of why ProtIdent can yield a high success rate in identifying proteases and their types is elucidated.

摘要

蛋白酶被许多人称为生物学领域的瑞士军刀，它能将长链氨基酸切割成片段，并调节大多数生理过程。它们在生命周期中至关重要。不同类型的蛋白酶具有不同的作用机制和生物学过程。在后基因组时代，随着蛋白质序列的大量涌现，无论是基础研究还是药物设计，都迫切需要开发一种快速可靠的方法，用于根据蛋白酶的序列来识别其类型，甚至仅仅是判断它们是否为蛋白酶。在本文中，将讨论最近在这方面开发的三种识别方法：（i）FunD-PseAAC，（ii）GO-PseAAC，以及（iii）FunD-PsePSSM。前两种方法分别是通过将FunD（功能域）方法和GO（基因本体）方法与PseAAC（伪氨基酸组成）方法杂交建立的。第三种方法是通过将FunD方法与PsePSSM（伪位置特异性评分矩阵）方法融合建立的。在这三种方法中，只有FunD-PsePSSM提供了一个名为ProtIdent（蛋白酶标识符）的服务器，公众可以通过网站http://www.csbio.sjtu.edu.cn/bioinf/Protease免费访问。为方便用户，文中还给出了使用ProtIdent的逐步指南。同时，讨论了使用ProtIdent时的注意事项以及如何理解统计预测器的成功率预期。最后，阐明了ProtIdent在识别蛋白酶及其类型方面能够获得高成功率的本质原因。

相似文献

Identification of proteases and their types.蛋白酶及其类型的鉴定。

Anal Biochem. 2009 Feb 1;385(1):153-60. doi: 10.1016/j.ab.2008.10.020. Epub 2008 Nov 1.

ProtIdent: a web server for identifying proteases and their types by fusing functional domain and sequential evolution information.ProtIdent：一个通过融合功能域和序列进化信息来识别蛋白酶及其类型的网络服务器。

Biochem Biophys Res Commun. 2008 Nov 14;376(2):321-5. doi: 10.1016/j.bbrc.2008.08.125. Epub 2008 Sep 5.

Predicting protease types by hybridizing gene ontology and pseudo amino acid composition.通过基因本体论与伪氨基酸组成的杂交预测蛋白酶类型。

Proteins. 2006 May 15;63(3):681-4. doi: 10.1002/prot.20898.

Prediction of protease types in a hybridization space.杂交空间中蛋白酶类型的预测。

Biochem Biophys Res Commun. 2006 Jan 20;339(3):1015-20. doi: 10.1016/j.bbrc.2005.10.196. Epub 2005 Nov 9.

QuatIdent: a web server for identifying protein quaternary structural attribute by fusing functional domain and sequential evolution information.QuatIdent：一个通过融合功能域和序列进化信息来识别蛋白质四级结构属性的网络服务器。

J Proteome Res. 2009 Mar;8(3):1577-84. doi: 10.1021/pr800957q.

GPCR-2L: predicting G protein-coupled receptors and their types by hybridizing two different modes of pseudo amino acid compositions.GPCR-2L：通过两种不同模式的伪氨基酸组成杂交预测G蛋白偶联受体及其类型。

Mol Biosyst. 2011 Mar;7(3):911-9. doi: 10.1039/c0mb00170h. Epub 2010 Dec 23.

Signal-3L: A 3-layer approach for predicting signal peptides.信号-3L：一种预测信号肽的三层方法。

Biochem Biophys Res Commun. 2007 Nov 16;363(2):297-303. doi: 10.1016/j.bbrc.2007.08.140. Epub 2007 Aug 31.

Using stacked generalization to predict membrane protein types based on pseudo-amino acid composition.基于伪氨基酸组成，使用堆叠泛化预测膜蛋白类型。

J Theor Biol. 2006 Oct 21;242(4):941-6. doi: 10.1016/j.jtbi.2006.05.006. Epub 2006 May 16.

MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM.MemType-2L：一个通过伪位置特异性得分矩阵整合进化信息来预测膜蛋白及其类型的网络服务器。

Biochem Biophys Res Commun. 2007 Aug 24;360(2):339-45. doi: 10.1016/j.bbrc.2007.06.027. Epub 2007 Jun 15.

GPCR-GIA: a web-server for identifying G-protein coupled receptors and their families with grey incidence analysis.GPCR-GIA：一个利用灰色关联分析识别 G 蛋白偶联受体及其家族的网络服务器。

Protein Eng Des Sel. 2009 Nov;22(11):699-705. doi: 10.1093/protein/gzp057. Epub 2009 Sep 22.

引用本文的文献

Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases.使用多种伪氨基酸组成类型和不同的机器学习算法对古菌磷脂酶进行分类和预测。

Mol Biol Res Commun. 2023;12(3):117-126. doi: 10.22099/mbrc.2023.47756.1845.

Phenylethanoid glycosides as a possible COVID-19 protease inhibitor: a virtual screening approach.苯乙醇苷类化合物作为一种可能的 COVID-19 蛋白酶抑制剂：虚拟筛选方法。

J Mol Model. 2021 Nov 3;27(11):341. doi: 10.1007/s00894-021-04963-2.

HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine.HRGPred：基于 k--mer 核苷酸组成特征和支持向量机预测除草剂抗性基因。

Sci Rep. 2019 Jan 28;9(1):778. doi: 10.1038/s41598-018-37309-9.

iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.iProt-Sub：一个全面的软件包，用于准确地映射和预测蛋白酶特异性底物和切割位点。

Brief Bioinform. 2019 Mar 25;20(2):638-658. doi: 10.1093/bib/bby028.

PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy.PROSPERous：提高准确性的 90 种蛋白酶底物切割位点的高通量预测。

Bioinformatics. 2018 Feb 15;34(4):684-687. doi: 10.1093/bioinformatics/btx670.

2L-piRNA: A Two-Layer Ensemble Classifier for Identifying Piwi-Interacting RNAs and Their Function.2L-piRNA：一种用于识别Piwi相互作用RNA及其功能的双层集成分类器。

Mol Ther Nucleic Acids. 2017 Jun 16;7:267-277. doi: 10.1016/j.omtn.2017.04.008. Epub 2017 Apr 13.

Assessing the impact of long term frozen storage of faecal samples on protein concentration and protease activity.评估粪便样本长期冷冻保存对蛋白质浓度和蛋白酶活性的影响。

J Microbiol Methods. 2016 Apr;123:31-8. doi: 10.1016/j.mimet.2016.02.001. Epub 2016 Feb 4.

A computational module assembled from different protease family motifs identifies PI PLC from Bacillus cereus as a putative prolyl peptidase with a serine protease scaffold.一个由不同蛋白酶家族基序组装而成的计算模块将来自蜡状芽孢杆菌的 PI PLC 鉴定为具有丝氨酸蛋白酶支架的假定脯氨酰肽酶。

PLoS One. 2013 Aug 5;8(8):e70923. doi: 10.1371/journal.pone.0070923. Print 2013.

Some remarks on protein attribute prediction and pseudo amino acid composition.关于蛋白质属性预测和伪氨基酸组成的一些说明。

J Theor Biol. 2011 Mar 21;273(1):236-47. doi: 10.1016/j.jtbi.2010.12.024. Epub 2010 Dec 17.

Trends in global warming and evolution of matrix protein 2 family from influenza A virus.全球变暖趋势及甲型流感病毒基质蛋白 2 家族的进化。

Interdiscip Sci. 2009 Dec;1(4):272-9. doi: 10.1007/s12539-009-0053-6. Epub 2009 Nov 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

蛋白酶及其类型的鉴定。

Identification of proteases and their types.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献