Graduate School of Information Science, Nara Institute of Science and Technology, Takayama 8916-5, Ikoma, Nara 630-0192, Japan.
Proteins. 2010 Apr;78(5):1195-211. doi: 10.1002/prot.22639.
Detection of pockets on protein surfaces is an important step toward finding the binding sites of small molecules. In a previous study, we defined a pocket as a space into which a small spherical probe can enter, but a large probe cannot. The radius of the large probes corresponds to the shallowness of pockets. We showed that each type of binding molecule has a characteristic shallowness distribution. In this study, we introduced fundamental changes to our previous algorithm by using a 3D grid representation of proteins and probes, and the theory of mathematical morphology. We invented an efficient algorithm for calculating deep and shallow pockets (multiscale pockets) simultaneously, using several different sizes of spherical probes (multiscale probes). We implemented our algorithm as a new program, ghecom (grid-based HECOMi finder). The statistics of calculated pockets for the structural dataset showed that our program had a higher performance of detecting binding pockets, than four other popular pocket-finding programs proposed previously. The ghecom also calculates the shallowness of binding ligands, R(inaccess) (minimum radius of inaccessible spherical probes) that can be obtained from the multiscale molecular volume. We showed that each part of the binding molecule had a bias toward a specific range of shallowness. These findings will be useful for predicting the types of molecules that will be most likely to bind putative binding pockets, as well as the configurations of binding molecules. The program ghecom is available through the Web server (http://biunit.naist.jp/ghecom).
检测蛋白质表面的口袋是寻找小分子结合位点的重要步骤。在之前的研究中,我们将口袋定义为一个可以容纳小球探针进入但大探针无法进入的空间。大探针的半径对应于口袋的深浅。我们表明,每种结合分子都有其特征性的深浅分布。在这项研究中,我们通过使用蛋白质和探针的 3D 网格表示以及数学形态学理论,对我们之前的算法进行了根本性的改变。我们发明了一种使用几种不同大小的球形探针(多尺度探针)同时计算深口袋和浅口袋(多尺度口袋)的有效算法。我们将我们的算法实现为一个新程序,ghecom(基于网格的 HECOMi 查找器)。对于结构数据集计算出的口袋统计数据表明,我们的程序在检测结合口袋方面的性能优于之前提出的四个其他流行的口袋查找程序。ghecom 还计算了结合配体的深浅度,R(inaccess)(不可进入的球形探针的最小半径),可以从多尺度分子体积中获得。我们表明,结合分子的每个部分都偏向于特定的深浅度范围。这些发现将有助于预测最有可能与假定结合口袋结合的分子类型以及结合分子的构型。程序 ghecom 可通过 Web 服务器(http://biunit.naist.jp/ghecom)获得。