Murga Leonel F, Wei Ying, Ondrechen Mary Jo
Department of Chemistry & Chemical Biology and Institute for Complex Scientific Software, Northeastern University, Boston, MA 02115, USA.
Genome Inform. 2007;19:107-18.
Prediction of protein functional sites from 3D structure is an important problem, particularly as structural genomics projects produce hundreds of structures of unknown function, including novel folds and the structures of orphan sequences. The present paper shows how computed protonation properties provide unique and powerful capabilities for the prediction of catalytic sites from the 3D structure alone. These protonation properties of the ionizable residues in a protein may be computed from the 3D structure using the calculated electrical potential function. In particular, the shapes of the theoretical microscopic titration curves (THEMATICS) enable selection of the residues involved in catalysis or small molecule recognition with good sensitivity and precision. Results are shown for 169 annotated enzymes in the Catalytic Site Atlas (CSA). Performance, as measured by residue recall and precision, is clearly better than that of other 3D-structure-based methods. When compared with methods based on sequence alignments and structural comparisons, THEMATICS performance is competitive for well-characterized enzymes. However THEMATICS performance does not degrade in the absence of similarity, as do the alignment-based methods, even if there are few or no sequence homologues or few or no proteins of similar structure. It is further shown that the protonation properties perform well on open, unbound structures, even if there is substantial conformational change upon ligand binding.
从三维结构预测蛋白质功能位点是一个重要问题,尤其是在结构基因组学项目产生数百个功能未知的结构时,这些结构包括新的折叠方式以及孤儿序列的结构。本文展示了如何通过计算质子化性质,仅从三维结构就能为催化位点的预测提供独特且强大的功能。蛋白质中可电离残基的这些质子化性质可以使用计算出的电势函数从三维结构中计算得出。特别地,理论微观滴定曲线(THEMATICS)的形状能够以良好的灵敏度和精度选择参与催化或小分子识别的残基。文中给出了催化位点图谱(CSA)中169种注释酶的结果。通过残基召回率和精度衡量的性能明显优于其他基于三维结构的方法。与基于序列比对和结构比较的方法相比,对于特征明确的酶,THEMATICS的性能具有竞争力。然而,与基于比对的方法不同,即使没有相似性,即几乎没有或没有序列同源物,或几乎没有或没有结构相似的蛋白质,THEMATICS的性能也不会下降。进一步表明,即使在配体结合时存在大量构象变化,质子化性质在开放的未结合结构上也表现良好。