Structural Bioinformatics, BIOTEC Technical University of Dresden, Tatzberg 47-51, 01307 Dresden, Germany.
Bioinformatics. 2011 Feb 1;27(3):351-8. doi: 10.1093/bioinformatics/btq672. Epub 2010 Dec 6.
Identification of ligand binding pockets on proteins is crucial for the characterization of protein functions. It provides valuable information for protein-ligand docking and rational engineering of small molecules that regulate protein functions. A major number of current prediction algorithms of ligand binding pockets are based on cubic grid representation of proteins and, thus, the results are often protein orientation dependent.
We present the MSPocket program for detecting pockets on the solvent excluded surface of proteins. The core algorithm of the MSPocket approach does not use any cubic grid system to represent proteins and is therefore independent of protein orientations. We demonstrate that MSPocket is able to achieve an accuracy of 75% in predicting ligand binding pockets on a test dataset used for evaluating several existing methods. The accuracy is 92% if the top three predictions are considered. Comparison to one of the recently published best performing methods shows that MSPocket reaches similar performance with the additional feature of being protein orientation independent. Interestingly, some of the predictions are different, meaning that the two methods can be considered complementary and combined to achieve better prediction accuracy. MSPocket also provides a graphical user interface for interactive investigation of the predicted ligand binding pockets. In addition, we show that overlap criterion is a better strategy for the evaluation of predicted ligand binding pockets than the single point distance criterion.
The MSPocket source code can be downloaded from http://appserver.biotec.tu-dresden.de/MSPocket/. MSPocket is also available as a PyMOL plugin with a graphical user interface.
鉴定蛋白质上的配体结合口袋对于蛋白质功能的描述至关重要。它为蛋白质-配体对接和调控蛋白质功能的小分子的合理设计提供了有价值的信息。目前,大多数配体结合口袋的预测算法都是基于蛋白质的立方网格表示,因此结果往往依赖于蛋白质的取向。
我们提出了 MSPocket 程序,用于检测蛋白质溶剂排除表面上的口袋。MSPocket 方法的核心算法不使用任何立方网格系统来表示蛋白质,因此不依赖于蛋白质的取向。我们证明,MSPocket 能够在用于评估几种现有方法的测试数据集上达到 75%的预测配体结合口袋的准确性。如果考虑前三个预测,则准确性为 92%。与最近发表的一种表现最好的方法进行比较表明,MSPocket 达到了类似的性能,并且具有不依赖于蛋白质取向的额外特点。有趣的是,一些预测结果是不同的,这意味着这两种方法可以互补并用,以达到更高的预测准确性。MSPocket 还提供了一个图形用户界面,用于交互式研究预测的配体结合口袋。此外,我们表明,与单点距离标准相比,重叠标准是评估预测配体结合口袋的更好策略。
MSPocket 的源代码可从 http://appserver.biotec.tu-dresden.de/MSPocket/ 下载。MSPocket 也作为一个带有图形用户界面的 PyMOL 插件提供。