Li Hongzhi, Zhou Yaoqi
Department of Physiology & Biophysics, Howard Hughes Medical Institute Center for Single Molecule Biophysics, State University of New York at Buffalo, 124 Sherman Hall, Buffalo, NY 14214, USA.
J Comput Chem. 2005 Aug;26(11):1189-92. doi: 10.1002/jcc.20251.
We developed a method for fast decoy clustering by using reference root-mean-squared distance (rRMSD) rather than commonly used pairwise RMSD (pRMSD) values. For 41 proteins with 2000 decoys each, the computing efficiency increases nine times without a significant change in the accuracy of near-native selections. Tests on additional protein decoys based on different reference conformations confirmed this result. Further analysis indicates that the pRMSD and rRMSD values are highly correlated (with an average correlation coefficient of 0.82) and the clusters obtained from pRMSD and rRMSD values are highly similar (the representative structures of the top five largest clusters from the two methods are 74% identical). SCUD (Structure ClUstering of Decoys) with an automatic cutoff value is available at http://theory.med.buffalo.edu.
我们开发了一种通过使用参考均方根距离(rRMSD)而非常用的成对均方根距离(pRMSD)值来进行快速诱饵聚类的方法。对于41种每种都有2000个诱饵的蛋白质,计算效率提高了九倍,同时近天然选择的准确性没有显著变化。基于不同参考构象对其他蛋白质诱饵进行的测试证实了这一结果。进一步分析表明,pRMSD和rRMSD值高度相关(平均相关系数为0.82),并且从pRMSD和rRMSD值获得的聚类高度相似(两种方法中最大的五个聚类的代表性结构有74%相同)。具有自动截止值的SCUD(诱饵结构聚类)可在http://theory.med.buffalo.edu获取。