Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan.
IEEE/ACM Trans Comput Biol Bioinform. 2010 Apr-Jun;7(2):333-41. doi: 10.1109/TCBB.2008.62.
Analysis of conformational changes is one of the keys to the understanding of protein functions and interactions. For the analysis, we often compare two protein structures, taking flexible regions like hinge regions into consideration. The Root Mean Square Deviation (RMSD) is the most popular measure for comparing two protein structures, but it is only for rigid structures without hinge regions. In this paper, we propose a new measure called RMSD considering hinges (RMSDh) and its variant RMSDh(k) for comparing two flexible proteins with hinge regions. We also propose novel efficient algorithms for computing them, which can detect the hinge positions at the same time. The RMSDh is suitable for cases where there is one small hinge region in each of the two target structures. The new algorithm for computing the RMSDh runs in linear time, which is the same as the time complexity for computing the RMSD and is faster than any of previous algorithms for hinge detection. The RMSDh(k) is designed for comparing structures with more than one hinge region. The RMSDh(k) measure considers at most k small hinge region, i.e., the RMSDh(k) value should be small if the two structures are similar except for at most k hinge regions. To compute the value, we propose an O(kn2)-time and O(n)-space algorithm based on a new dynamic programming technique. With the same computational time and space, we can enumerate the predicted hinge positions. We also test our algorithms against actual flexible protein structures, and show that the hinge positions can be correctly detected by our algorithms.
构象变化分析是理解蛋白质功能和相互作用的关键之一。在分析中,我们通常比较两个蛋白质结构,同时考虑铰链等柔性区域。均方根偏差(RMSD)是比较两个蛋白质结构最常用的度量标准,但它仅适用于没有铰链区域的刚性结构。在本文中,我们提出了一种新的度量标准,称为考虑铰链的均方根偏差(RMSDh)及其变体 RMSDh(k),用于比较具有铰链区域的两个柔性蛋白质。我们还提出了用于计算它们的新颖高效算法,这些算法可以同时检测铰链位置。RMSDh 适用于两个目标结构中每个结构都有一个小铰链区域的情况。用于计算 RMSDh 的新算法以线性时间运行,与计算 RMSD 的时间复杂度相同,并且比任何以前的铰链检测算法都快。RMSDh(k) 用于比较具有多个铰链区域的结构。RMSDh(k) 度量标准最多考虑 k 个小铰链区域,即如果两个结构除了最多 k 个铰链区域之外相似,则 RMSDh(k) 值应该较小。为了计算该值,我们提出了一种基于新动态规划技术的 O(kn2)-时间和 O(n)-空间算法。使用相同的计算时间和空间,我们可以枚举预测的铰链位置。我们还针对实际的柔性蛋白质结构测试了我们的算法,并表明我们的算法可以正确检测铰链位置。