Aloy P, Querol E, Aviles F X, Sternberg M J
Institut de Biologia Fonamental and Departament de Bioquimica, Universitat Autonoma de Barcelona, Bellaterra, Barcelona, 08193, Spain.
J Mol Biol. 2001 Aug 10;311(2):395-408. doi: 10.1006/jmbi.2001.4870.
A major problem in genome annotation is whether it is valid to transfer the function from a characterised protein to a homologue of unknown activity. Here, we show that one can employ a strategy that uses a structure-based prediction of protein functional sites to assess the reliability of functional inheritance. We have automated and benchmarked a method based on the evolutionary trace approach. Using a multiple sequence alignment, we identified invariant polar residues, which were then mapped onto the protein structure. Spatial clusters of these invariant residues formed the predicted functional site. For 68 of 86 proteins examined, the method yielded information about the observed functional site. This algorithm for functional site prediction was then used to assess the validity of transferring the function between homologues. This procedure was tested on 18 pairs of homologous proteins with unrelated function and 70 pairs of proteins with related function, and was shown to be 94 % accurate. This automated method could be linked to schemes for genome annotation. Finally, we examined the use of functional site prediction in protein-protein and protein-DNA docking. The use of predicted functional sites was shown to filter putative docked complexes with a discrimination similar to that obtained by manually including biological information about active sites or DNA-binding residues.
基因组注释中的一个主要问题是,将一个已明确功能的蛋白质的功能转移到一个活性未知的同源物上是否合理。在此,我们表明可以采用一种策略,即使用基于结构的蛋白质功能位点预测来评估功能遗传的可靠性。我们已经对一种基于进化踪迹方法的手段进行了自动化处理并开展了基准测试。利用多序列比对,我们识别出不变的极性残基,然后将其映射到蛋白质结构上。这些不变残基的空间簇构成了预测的功能位点。在所检测的86种蛋白质中,有68种该方法产生了关于观察到的功能位点的信息。然后,这种功能位点预测算法被用于评估同源物之间功能转移的有效性。该程序在18对功能不相关的同源蛋白质和70对功能相关的蛋白质上进行了测试,结果显示准确率达94%。这种自动化方法可以与基因组注释方案相联系。最后,我们研究了功能位点预测在蛋白质-蛋白质和蛋白质-DNA对接中的应用。结果表明,使用预测的功能位点来筛选假定的对接复合物,其辨别能力与通过手动纳入有关活性位点或DNA结合残基的生物学信息所获得的辨别能力相似。