Hu Jing, Yan Changhui
Department of Computer Science, Utah State University, Logan, UT, USA.
BMC Struct Biol. 2009 Aug 3;9:52. doi: 10.1186/1472-6807-9-52.
In the research on protein functional sites, researchers often need to identify binding-site residues on a protein. A commonly used strategy is to find a complex structure from the Protein Data Bank (PDB) that consists of the protein of interest and its interacting partner(s) and calculate binding-site residues based on the complex structure. However, since a protein may participate in multiple interactions, the binding-site residues calculated based on one complex structure usually do not reveal all binding sites on a protein. Thus, this requires researchers to find all PDB complexes that contain the protein of interest and combine the binding-site information gleaned from them. This process is very time-consuming. Especially, combing binding-site information obtained from different PDB structures requires tedious work to align protein sequences. The process becomes overwhelmingly difficult when researchers have a large set of proteins to analyze, which is usually the case in practice.
In this study, we have developed a tool for calculating binding-site residues on proteins, TCBRP http://yanbioinformatics.cs.usu.edu:8080/ppbindingsubmit. For an input protein, TCBRP can quickly find all binding-site residues on the protein by automatically combining the information obtained from all PDB structures that consist of the protein of interest. Additionally, TCBRP presents the binding-site residues in different categories according to the interaction type. TCBRP also allows researchers to set the definition of binding-site residues.
The developed tool is very useful for the research on protein binding site analysis and prediction.
在蛋白质功能位点的研究中,研究人员经常需要识别蛋白质上的结合位点残基。一种常用的策略是从蛋白质数据库(PDB)中找到一个由感兴趣的蛋白质及其相互作用伙伴组成的复杂结构,并根据该复杂结构计算结合位点残基。然而,由于一种蛋白质可能参与多种相互作用,基于一个复杂结构计算出的结合位点残基通常并不能揭示该蛋白质上的所有结合位点。因此,这就要求研究人员找到所有包含感兴趣蛋白质的PDB复合物,并整合从它们那里收集到的结合位点信息。这个过程非常耗时。特别是,整合从不同PDB结构中获得的结合位点信息需要进行繁琐的蛋白质序列比对工作。当研究人员有大量蛋白质需要分析时,这个过程就变得极其困难,而在实际情况中通常就是这样。
在本研究中,我们开发了一种用于计算蛋白质上结合位点残基的工具,即TCBRP http://yanbioinformatics.cs.usu.edu:8080/ppbindingsubmit。对于输入的蛋白质,TCBRP可以通过自动整合从所有包含感兴趣蛋白质的PDB结构中获得的信息,快速找到该蛋白质上的所有结合位点残基。此外,TCBRP会根据相互作用类型将结合位点残基呈现为不同类别。TCBRP还允许研究人员设置结合位点残基的定义。
所开发的工具对于蛋白质结合位点分析和预测的研究非常有用。