Ortiz Angel R, Strauss Charlie E M, Olmea Osvaldo
Department of Physiology and Biophysics, Mount Sinai School of Medicine, New York University, New York, New York 10029, USA.
Protein Sci. 2002 Nov;11(11):2606-21. doi: 10.1110/ps.0215902.
Advances in structural genomics and protein structure prediction require the design of automatic, fast, objective, and well benchmarked methods capable of comparing and assessing the similarity of low-resolution three-dimensional structures, via experimental or theoretical approaches. Here, a new method for sequence-independent structural alignment is presented that allows comparison of an experimental protein structure with an arbitrary low-resolution protein tertiary model. The heuristic algorithm is given and then used to show that it can describe random structural alignments of proteins with different folds with good accuracy by an extreme value distribution. From this observation, a structural similarity score between two proteins or two different conformations of the same protein is derived from the likelihood of obtaining a given structural alignment by chance. The performance of the derived score is then compared with well established, consensus manual-based scores and data sets. We found that the new approach correlates better than other tools with the gold standard provided by a human evaluator. Timings indicate that the algorithm is fast enough for routine use with large databases of protein models. Overall, our results indicate that the new program (MAMMOTH) will be a good tool for protein structure comparisons in structural genomics applications. MAMMOTH is available from our web site at http://physbio.mssm.edu/~ortizg/.
结构基因组学和蛋白质结构预测的进展需要设计自动、快速、客观且经过良好基准测试的方法,这些方法能够通过实验或理论途径比较和评估低分辨率三维结构的相似性。在此,提出了一种用于序列无关结构比对的新方法,该方法允许将实验性蛋白质结构与任意低分辨率蛋白质三级模型进行比较。给出了启发式算法,然后用它表明通过极值分布,该算法能够以良好的准确性描述具有不同折叠的蛋白质的随机结构比对。基于这一观察结果,从偶然获得给定结构比对的可能性中得出两种蛋白质或同一蛋白质的两种不同构象之间的结构相似性得分。然后将所得分数的性能与基于专家共识的成熟得分和数据集进行比较。我们发现,新方法与人类评估者提供的黄金标准的相关性优于其他工具。计时结果表明,该算法速度足够快,可用于对大型蛋白质模型数据库进行常规操作。总体而言,我们的结果表明,新程序(MAMMOTH)将成为结构基因组学应用中蛋白质结构比较的良好工具。可从我们的网站http://physbio.mssm.edu/~ortizg/获取MAMMOTH。