Kawabata Takeshi, Go Nobuhiro
Graduate School of Information Science, Nara Institute of Science and Technology, Ikoma, Nara, Japan.
Proteins. 2007 Aug 1;68(2):516-29. doi: 10.1002/prot.21283.
One of the simplest ways to predict ligand binding sites is to identify pocket-shaped regions on the protein surface. Many programs have already been proposed to identify these pocket regions. Examination of their algorithms revealed that a pocket intrinsically has two arbitrary properties, "size" and "depth". We proposed a new definition for pockets using two explicit adjustable parameters that correspond to these two arbitrary properties. A pocket region is defined as a space into which a small probe can enter, but a large probe cannot. The radii of small and large probe spheres are the two parameters that correspond to the "size" and "depth" of the pockets, respectively. These values can be adjusted individual putative ligand molecule. To determine the optimal value of the large probe spheres radius, we generated pockets for thousands of protein structures in the database, using several size of large probe spheres, examined the correspondence of these pockets with known binding site positions. A new measure of shallowness, a minimum inaccessible radius, R(inaccess), indicated that binding sites of coenzymes are very deep, while those for adenine/guanine mononucleotide have only medium shallowness and those for short peptides and oligosaccharides are shallow. The optimal radius of large probe spheres was 3-4 A for the coenzymes, 4 A for adenine/guanine mononucleotides, and 5 A or more for peptides/oligosaccharides. Comparison of our program with two other popular pocket-finding programs showed that our program had a higher performance of detecting binding pockets, although it required more computational time.
预测配体结合位点最简单的方法之一是识别蛋白质表面呈口袋状的区域。已经有许多程序被提出来识别这些口袋区域。对它们算法的研究表明,口袋本质上具有两个任意属性,即“大小”和“深度”。我们使用与这两个任意属性相对应的两个显式可调参数,为口袋提出了一个新定义。口袋区域被定义为一个小探针能够进入但大探针不能进入的空间。小探针球和大探针球的半径分别是与口袋的“大小”和“深度”相对应的两个参数。这些值可以针对每个假定的配体分子进行调整。为了确定大探针球半径的最佳值,我们使用几种大小的大探针球,为数据库中的数千个蛋白质结构生成口袋,检查这些口袋与已知结合位点位置的对应关系。一种新的浅度度量,即最小不可接近半径R(inaccess),表明辅酶的结合位点非常深,而腺嘌呤/鸟嘌呤单核苷酸的结合位点只有中等浅度,短肽和寡糖的结合位点则很浅。辅酶的大探针球最佳半径为3 - 埃,腺嘌呤/鸟嘌呤单核苷酸为4埃,肽/寡糖为5埃或更大。将我们的程序与其他两个流行的口袋查找程序进行比较表明,尽管我们的程序需要更多的计算时间,但它在检测结合口袋方面具有更高的性能。