Bioinformatics Institute, A*STAR, 30 Biopolis Street, #07-01 Matrix, Singapore 138671.
BMC Bioinformatics. 2010 Mar 26;11:155. doi: 10.1186/1471-2105-11-155.
Computational comparison of two protein structures is the starting point of many methods that build on existing knowledge, such as structure modeling (including modeling of protein complexes and conformational changes), molecular replacement, or annotation by structural similarity. In a commonly used strategy, significant effort is invested in matching two sets of atoms. In a complementary approach, a global descriptor is assigned to the overall structure, thus losing track of the substructures within.
Using a small set of geometric features, we define a reduced representation of protein structure, together with an optimizing function for matching two representations, to provide a pre-filtering stage in a database search. We show that, in a straightforward implementation, the representation performs well in terms of resolution in the space of protein structures, and its ability to make new predictions.
Perhaps unexpectedly, a substantial discriminating power already exists at the level of main features of protein structure, such as directions of secondary structural elements, possibly constrained by their sequential order. This can be used toward efficient comparison of protein (sub)structures, allowing for various degrees of conformational flexibility within the compared pair, which in turn can be used for modeling by homology of protein structure and dynamics.
比较两种蛋白质结构的计算是许多基于现有知识的方法的起点,例如结构建模(包括蛋白质复合物和构象变化的建模)、分子置换或结构相似性注释。在常用的策略中,需要投入大量精力来匹配两组原子。在互补方法中,会为整体结构分配全局描述符,从而失去对内部子结构的跟踪。
使用一小部分几何特征,我们定义了蛋白质结构的简化表示,以及用于匹配两个表示的优化函数,以在数据库搜索中提供预筛选阶段。我们表明,在直接实现中,该表示在蛋白质结构空间中的分辨率及其进行新预测的能力方面表现良好。
出乎意料的是,蛋白质结构的主要特征(如二级结构元素的方向)可能受到其顺序的限制,在该水平上已经存在很大的区分能力。这可用于有效地比较蛋白质(子)结构,允许在比较对中具有各种程度的构象灵活性,这反过来又可用于同源建模蛋白质结构和动力学。