Park Keunwan, Kim Dongsup
Department of BioSystems, Korea Advanced Institute of Science and Technology, Daejeon, 305-701, Korea.
Genome Inform. 2006;17(2):216-25.
As the number of protein sequences with unknown function increases, assigning accurate function to unknown protein becomes increasingly an important issue. Protein function is often encoded in a small number of residues located in binding pocket, and there have been many attempts to predict the function using the binding site. Here, we developed a binding site comparison method which can easily identify spatially matched residues between binding sites. Using clique detection algorithm, the new method finds the matched residues of maximum size, and then these matched residues are scored in a way similar to sequence alignment scoring. In addition, the significance of matched score is estimated from the empirical random score distribution. Results of benchmark test suggest that the method successfully detects functionally related binding sites. Furthermore, conserved residues and subfamily-specific residues in the functional family can be identified. In addition, we investigated systematic relationship between binding sites and functions using the binding site comparison method. Result showed that proteins with similar binding site largely perform similar function.
随着功能未知的蛋白质序列数量的增加,为未知蛋白质赋予准确的功能变得越来越重要。蛋白质功能通常编码在位于结合口袋中的少数残基中,并且已经有许多尝试使用结合位点来预测功能。在这里,我们开发了一种结合位点比较方法,该方法可以轻松识别结合位点之间空间匹配的残基。使用团检测算法,新方法找到最大尺寸的匹配残基,然后以类似于序列比对评分的方式对这些匹配残基进行评分。此外,匹配分数的显著性是根据经验随机分数分布来估计的。基准测试结果表明该方法成功检测到功能相关的结合位点。此外,可以识别功能家族中的保守残基和亚家族特异性残基。此外,我们使用结合位点比较方法研究了结合位点与功能之间的系统关系。结果表明,具有相似结合位点的蛋白质在很大程度上具有相似的功能。