Ben-Shimon Avraham, Eisenstein Miriam
Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel.
J Mol Biol. 2005 Aug 12;351(2):309-26. doi: 10.1016/j.jmb.2005.06.047.
Analysis of the distances of the exposed residues in 175 enzymes from the centroids of the molecules indicates that catalytic residues are very often found among the 5% of residues closest to the enzyme centroid. This property of catalytic residues is implemented in a new prediction algorithm (named EnSite) for locating the active sites of enzymes and in a new scheme for re-ranking enzyme-ligand docking solutions. EnSite examines only 5% of the molecular surface (represented by surface dots) that is closest to the centroid, identifying continuous surface segments and ranking them by their area size. EnSite ranks the correct prediction 1-4 in 97% of the cases in a dataset of 65 monomeric enzymes (rank 1 for 89% of the cases) and in 86% of the cases in a dataset of 176 monomeric and multimeric enzymes from all six top-level enzyme classifications (rank 1 in 74% of the cases). Importantly, identification of buried or flat active sites is straightforward because EnSite "looks" at the molecular surface from the inside out. Detailed examination of the results indicates that the proximity of the catalytic residues to the centroid is a property of the functional unit, defined as the assembly of domains or chains that form the active site (in most cases the functional unit corresponds to a single whole polypeptide chain). Using the functional unit in the prediction further improves the results. The new property of active sites is also used for re-evaluating enzyme-inhibitor unbound docking results. Sorting the docking solutions by the distance of the interface to the centroid of the enzyme improves remarkably the ranks of nearly correct solutions compared to ranks based on geometric-electrostatic-hydrophobic complementarity scores.
对175种酶中暴露残基到分子质心的距离进行分析表明,催化残基经常出现在最接近酶质心的5%的残基之中。催化残基的这一特性被应用于一种用于定位酶活性位点的新预测算法(名为EnSite)以及一种用于对酶-配体对接解决方案重新排序的新方案中。EnSite仅检查最接近质心的5%的分子表面(由表面点表示),识别连续的表面片段并按其面积大小进行排序。在一个包含65种单体酶的数据集里,EnSite在97%的情况下将正确预测排在1-4位(89%的情况排在第1位);在一个来自所有六个顶级酶分类的包含176种单体和多聚体酶的数据集里,86%的情况下排在1-4位(74%的情况排在第1位)。重要的是,识别埋藏或扁平的活性位点很直接,因为EnSite是从分子内部向外“观察”分子表面。对结果的详细检查表明,催化残基与质心的接近是功能单元的一个特性,功能单元被定义为形成活性位点的结构域或链的集合(在大多数情况下,功能单元对应于单个完整的多肽链)。在预测中使用功能单元可进一步改善结果。活性位点的这一新特性还用于重新评估酶-抑制剂未结合的对接结果。与基于几何-静电-疏水互补性分数的排名相比,根据酶与配体对接界面到酶质心的距离对对接解决方案进行排序,可显著提高几乎正确解决方案的排名。