Department of Computational Chemistry and Cheminformatics, Wyeth Research, 401 N. Middletown Rd., Pearl River, NY 10965, USA.
Chem Biol Drug Des. 2010 Aug;76(2):142-53. doi: 10.1111/j.1747-0285.2010.00994.x.
The Protein Data Bank is the most comprehensive source of experimental macromolecular structures. It can, however, be difficult at times to locate relevant structures with the Protein Data Bank search interface. This is particularly true when searching for complexes containing specific interactions between protein and ligand atoms. Moreover, searching within a family of proteins can be tedious. For example, one cannot search for some conserved residue as residue numbers vary across structures. We describe herein three databases, Protein Relational Database, Kinase Knowledge Base, and Matrix Metalloproteinase Knowledge Base, containing protein structures from the Protein Data Bank. In Protein Relational Database, atom-atom distances between protein and ligand have been precalculated allowing for millisecond retrieval based on atom identity and distance constraints. Ring centroids, centroid-centroid and centroid-atom distances and angles have also been included permitting queries for pi-stacking interactions and other structural motifs involving rings. Other geometric features can be searched through the inclusion of residue pair and triplet distances. In Kinase Knowledge Base and Matrix Metalloproteinase Knowledge Base, the catalytic domains have been aligned into common residue numbering schemes. Thus, by searching across Protein Relational Database and Kinase Knowledge Base, one can easily retrieve structures wherein, for example, a ligand of interest is making contact with the gatekeeper residue.
蛋白质数据库是实验性大分子结构最全面的来源。然而,有时使用蛋白质数据库搜索界面定位相关结构可能会很困难。当搜索包含蛋白质和配体原子之间特定相互作用的复合物时尤其如此。此外,在蛋白质家族中搜索可能很乏味。例如,由于结构之间的残基数不同,无法搜索某些保守残基。我们在此描述了三个数据库,即蛋白质关系数据库、激酶知识库和基质金属蛋白酶知识库,它们包含来自蛋白质数据库的蛋白质结构。在蛋白质关系数据库中,已经预先计算了蛋白质和配体之间的原子-原子距离,允许根据原子身份和距离约束进行毫秒级检索。还包括环质心、质心-质心和质心-原子距离和角度,允许查询π堆积相互作用和涉及环的其他结构基序。通过包括残基对和三联体距离,可以搜索其他几何特征。在激酶知识库和基质金属蛋白酶知识库中,催化结构域已对齐到常见的残基编号方案中。因此,通过在蛋白质关系数据库和激酶知识库中进行搜索,可以轻松检索到例如,感兴趣的配体与守门员残基接触的结构。