Van Walle Ivo, Lasters Ignace, Wyns Lode
Department of Ultrastructure, Vrije Universiteit Brussel, Sint-Genesius Rode, Belgium.
Proteins. 2003 Apr 1;51(1):1-9. doi: 10.1002/prot.10293.
Comparing two remotely similar structures is a difficult problem: more often than not, resulting structure alignments will show ambiguities and a unique answer usually does not even exist. In addition, alignments in general have a limited information content because every aligned residue is considered equally important. To solve these issues to a certain extent, one can take the perspective of a whole group of similar structures and then evaluate common structural features. Here, we describe a consistency approach that, although not actually performing a multiple structure alignment, does produce the information that one would conceivably want from such an experiment: the key structural features of the group, e.g., a fold, which in this case are projected onto either a pair of proteins or a single protein. Both representations are useful for a number of applications, ranging from the detection of (partially) wrong structure alignments to protein structure classification and fold recognition. To demonstrate some of these applications, the procedure was applied to 195 SCOP folds containing a total of 1802 domains sharing very low sequence similarity.
通常情况下,最终的结构比对会显示出模糊性,甚至往往不存在唯一答案。此外,一般来说比对的信息含量有限,因为每个比对的残基都被视为同等重要。为了在一定程度上解决这些问题,可以从一整个相似结构组的角度出发,然后评估共同的结构特征。在此,我们描述了一种一致性方法,虽然它实际上并未进行多结构比对,但确实能产生人们可以想象从这样一个实验中获得的信息:该结构组的关键结构特征,例如一种折叠,在这种情况下它被投影到一对蛋白质或单个蛋白质上。这两种表示形式对于许多应用都很有用,从检测(部分)错误的结构比对到蛋白质结构分类和折叠识别。为了展示其中一些应用,该程序被应用于195个SCOP折叠,这些折叠总共包含1802个结构域,它们的序列相似性非常低。