Suppr超能文献

蛋白酶及其类型的鉴定。

Identification of proteases and their types.

作者信息

Shen Hong-Bin, Chou Kuo-Chen

机构信息

Institute of Image Processing and Pattern Recognition, Shanghai Jiaotong University, Shanghai 200240, China.

出版信息

Anal Biochem. 2009 Feb 1;385(1):153-60. doi: 10.1016/j.ab.2008.10.020. Epub 2008 Nov 1.

Abstract

Called by many as biology's version of Swiss army knives, proteases cut long sequences of amino acids into fragments and regulate most physiological processes. They are vitally important in the life cycle. Different types of proteases have different action mechanisms and biological processes. With the avalanche of protein sequences generated during the postgenomic age, it is highly desirable for both basic research and drug design to develop a fast and reliable method for identifying the types of proteases according to their sequences or even just for whether they are proteases or not. In this article, three recently developed identification methods in this regard are discussed: (i) FunD-PseAAC, (ii) GO-PseAAC, and (iii) FunD-PsePSSM. The first two were established by hybridizing the FunD (functional domain) approach and the GO (gene ontology) approach, respectively, with the PseAAC (pseudo amino acid composition) approach. The third method was established by fusing the FunD approach with the PsePSSM (pseudo position-specific scoring matrix) approach. Of these three methods, only FunD-PsePSSM has provided a server called ProtIdent (protease identifier), which is freely accessible to the public via the website at http://www.csbio.sjtu.edu.cn/bioinf/Protease. For the convenience of users, a step-by-step guide on how to use ProtIdent is illustrated. Meanwhile, the caveat in using ProtIdent and how to understand the success expectancy rate of a statistical predictor are discussed. Finally, the essence of why ProtIdent can yield a high success rate in identifying proteases and their types is elucidated.

摘要

蛋白酶被许多人称为生物学领域的瑞士军刀,它能将长链氨基酸切割成片段,并调节大多数生理过程。它们在生命周期中至关重要。不同类型的蛋白酶具有不同的作用机制和生物学过程。在后基因组时代,随着蛋白质序列的大量涌现,无论是基础研究还是药物设计,都迫切需要开发一种快速可靠的方法,用于根据蛋白酶的序列来识别其类型,甚至仅仅是判断它们是否为蛋白酶。在本文中,将讨论最近在这方面开发的三种识别方法:(i)FunD-PseAAC,(ii)GO-PseAAC,以及(iii)FunD-PsePSSM。前两种方法分别是通过将FunD(功能域)方法和GO(基因本体)方法与PseAAC(伪氨基酸组成)方法杂交建立的。第三种方法是通过将FunD方法与PsePSSM(伪位置特异性评分矩阵)方法融合建立的。在这三种方法中,只有FunD-PsePSSM提供了一个名为ProtIdent(蛋白酶标识符)的服务器,公众可以通过网站http://www.csbio.sjtu.edu.cn/bioinf/Protease免费访问。为方便用户,文中还给出了使用ProtIdent的逐步指南。同时,讨论了使用ProtIdent时的注意事项以及如何理解统计预测器的成功率预期。最后,阐明了ProtIdent在识别蛋白酶及其类型方面能够获得高成功率的本质原因。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验