Shulman-Peleg Alexandra, Nussinov Ruth, Wolfson Haim J
School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel.
J Mol Biol. 2004 Jun 4;339(3):607-33. doi: 10.1016/j.jmb.2004.04.012.
Recognition of regions on the surface of one protein, that are similar to a binding site of another is crucial for the prediction of molecular interactions and for functional classifications. We first describe a novel method, SiteEngine, that assumes no sequence or fold similarities and is able to recognize proteins that have similar binding sites and may perform similar functions. We achieve high efficiency and speed by introducing a low-resolution surface representation via chemically important surface points, by hashing triangles of physico-chemical properties and by application of hierarchical scoring schemes for a thorough exploration of global and local similarities. We proceed to rigorously apply this method to functional site recognition in three possible ways: first, we search a given functional site on a large set of complete protein structures. Second, a potential functional site on a protein of interest is compared with known binding sites, to recognize similar features. Third, a complete protein structure is searched for the presence of an a priori unknown functional site, similar to known sites. Our method is robust and efficient enough to allow computationally demanding applications such as the first and the third. From the biological standpoint, the first application may identify secondary binding sites of drugs that may lead to side-effects. The third application finds new potential sites on the protein that may provide targets for drug design. Each of the three applications may aid in assigning a function and in classification of binding patterns. We highlight the advantages and disadvantages of each type of search, provide examples of large-scale searches of the entire Protein Data Base and make functional predictions.
识别一种蛋白质表面与另一种蛋白质结合位点相似的区域,对于预测分子相互作用和进行功能分类至关重要。我们首先描述一种新方法SiteEngine,它不假设序列或折叠相似性,能够识别具有相似结合位点且可能执行相似功能的蛋白质。我们通过经由化学重要表面点引入低分辨率表面表示、对物理化学性质的三角形进行哈希处理以及应用分层评分方案来全面探索全局和局部相似性,从而实现高效和快速。我们以三种可能的方式严格地将此方法应用于功能位点识别:第一,在一大组完整蛋白质结构上搜索给定的功能位点。第二,将感兴趣蛋白质上的潜在功能位点与已知结合位点进行比较,以识别相似特征。第三,在完整蛋白质结构中搜索与已知位点相似的先验未知功能位点的存在。我们的方法足够稳健和高效,能够允许进行诸如第一种和第三种计算要求高的应用。从生物学角度来看,第一种应用可能识别出可能导致副作用的药物的二级结合位点。第三种应用在蛋白质上找到可能为药物设计提供靶点的新潜在位点。这三种应用中的每一种都可能有助于功能分配和结合模式分类。我们突出每种搜索类型的优缺点,提供对整个蛋白质数据库进行大规模搜索的示例并进行功能预测。