利用配体相互作用和富含结合位点的蛋白质三角形进行配体结合位点预测。

Ligand-binding site prediction using ligand-interacting and binding site-enriched protein triangles.

机构信息

Institute of Biomedical Informatics, National Yang-Ming University, Taipei 112, Taiwan.

出版信息

Bioinformatics. 2012 Jun 15;28(12):1579-85. doi: 10.1093/bioinformatics/bts182. Epub 2012 Apr 11.

DOI:10.1093/bioinformatics/bts182

PMID:22495747

Abstract

MOTIVATION

Knowledge about the site at which a ligand binds provides an important clue for predicting the function of a protein and is also often a prerequisite for performing docking computations in virtual drug design and screening. We have previously shown that certain ligand-interacting triangles of protein atoms, called protein triangles, tend to occur more frequently at ligand-binding sites than at other parts of the protein.

RESULTS

In this work, we describe a new ligand-binding site prediction method that was developed based on binding site-enriched protein triangles. The new method was tested on 2 benchmark datasets and on 19 targets from two recent community-based studies of such predictions, and excellent results were obtained. Where comparisons were made, the success rates for the new method for the first predicted site were significantly better than methods that are not a meta-predictor. Further examination showed that, for most of the unsuccessful predictions, the pocket of the ligand-binding site was identified, but not the site itself, whereas for some others, the failure was not due to the method itself but due to the use of an incorrect biological unit in the structure examined, although using correct biological units would not necessarily improve the prediction success rates. These results suggest that the new method is a valuable new addition to a suite of existing structure-based bioinformatics tools for studies of molecular recognition and related functions of proteins in post-genomics research.

AVAILABILITY

The executable binaries and a web server for our method are available from http://sourceforge.net/projects/msdock/ and http://lise.ibms.sinica.edu.tw, respectively, free for academic users.

摘要

动机

了解配体结合的位置为预测蛋白质的功能提供了重要线索，并且通常也是进行虚拟药物设计和筛选中对接计算的前提。我们之前已经表明，某些配体相互作用的蛋白质原子三角形，称为蛋白质三角形，往往比蛋白质的其他部分更频繁地出现在配体结合部位。

结果

在这项工作中，我们描述了一种新的基于结合部位富集的蛋白质三角形的配体结合部位预测方法。该新方法在 2 个基准数据集和 2 个最近基于社区的此类预测研究中的 19 个靶标上进行了测试，获得了优异的结果。在进行比较的情况下，新方法对于第一个预测的结合部位的成功率明显优于不是元预测器的方法。进一步的检查表明，对于大多数不成功的预测，已经识别出了配体结合部位的口袋，但未识别出结合部位本身，而对于其他一些预测，失败不是由于方法本身，而是由于所检查的结构中使用了不正确的生物单元，尽管使用正确的生物单元不一定会提高预测成功率。这些结果表明，新方法是现有基于结构的生物信息学工具套件的一个有价值的补充，可用于研究后基因组研究中蛋白质的分子识别和相关功能。