Bartoli Lisa, Calabrese Remo, Fariselli Piero, Mita Damiano G, Casadio Rita
Laboratory of Biocomputing, CIRB/Department of Biology, University of Bologna, Bologna, Italy.
BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2105-8-S1-S3.
Peptidases are proteolytic enzymes responsible for fundamental cellular activities in all organisms. Apparently about 2-5% of the genes encode for peptidases, irrespectively of the organism source. The basic peptidase function is "protein digestion" and this can be potentially dangerous in living organisms when it is not strictly controlled by specific inhibitors. In genome annotation a basic question is to predict gene function. Here we describe a computational approach that can filter peptidases and their inhibitors out of a given proteome. Furthermore and as an added value to MEROPS, a specific database for peptidases already available in the public domain, our method can predict whether a pair of peptidase/inhibitor can interact, eventually listing all possible predicted ligands (peptidases and/or inhibitors).
We show that by adopting a decision-tree approach the accuracy of PROSITE and HMMER in detecting separately the four major peptidase types (Serine, Aspartic, Cysteine and Metallo- Peptidase) and their inhibitors among a non redundant set of globular proteins can be improved by some percentage points with respect to that obtained with each method separately. More importantly, our method can then predict pairs of peptidases and interacting inhibitors, scoring a joint global accuracy of 99% with coverage for the positive cases (peptidase/inhibitor) close to 100% and a correlation coefficient of 0.91%. In this task the decision-tree approach outperforms the single methods.
The decision-tree can reliably classify protein sequences as peptidases or inhibitors, belonging to a certain class, and can provide a comprehensive list of possible interacting pairs of peptidase/inhibitor. This information can help the design of experiments to detect interacting peptidase/inhibitor complexes and can speed up the selection of possible interacting candidates, without searching for them separately and manually combining the obtained results. A web server specifically developed for annotating peptidases and their inhibitors (HIPPIE) is available at http://gpcr.biocomp.unibo.it/cgi/predictors/hippie/pred_hippie.cgi.
肽酶是一类蛋白水解酶,负责所有生物体中的基本细胞活动。显然,无论生物体来源如何,约2 - 5%的基因编码肽酶。肽酶的基本功能是“蛋白质消化”,当它不受特定抑制剂严格控制时,在活生物体中可能具有潜在危险性。在基因组注释中,一个基本问题是预测基因功能。在此,我们描述一种计算方法,该方法可以从给定的蛋白质组中筛选出肽酶及其抑制剂。此外,作为对已在公共领域可用的肽酶特定数据库MEROPS的附加值,我们的方法可以预测一对肽酶/抑制剂是否能够相互作用,并最终列出所有可能的预测配体(肽酶和/或抑制剂)。
我们表明,通过采用决策树方法,相对于分别使用PROSITE和HMMER这两种方法各自获得的结果,在非冗余球状蛋白集中分别检测四种主要肽酶类型(丝氨酸、天冬氨酸、半胱氨酸和金属肽酶)及其抑制剂时,准确性可提高几个百分点。更重要的是,我们的方法随后可以预测肽酶与相互作用抑制剂的配对,联合全局准确性达到99%,阳性案例(肽酶/抑制剂)的覆盖率接近100%,相关系数为0.91%。在这项任务中,决策树方法优于单一方法。
决策树能够可靠地将蛋白质序列分类为属于某一特定类别的肽酶或抑制剂,并能提供肽酶/抑制剂可能相互作用对的综合列表。这些信息有助于设计检测相互作用的肽酶/抑制剂复合物的实验,并能加快可能相互作用候选物的选择,而无需分别搜索并手动组合所得结果。一个专门为注释肽酶及其抑制剂而开发的网络服务器(HIPPIE)可在http://gpcr.biocomp.unibo.it/cgi/predictors/hippie/pred_hippie.cgi获取。