Yesylevskyy Semen O, Kharkyanen Valery N, Demchenko Alexander P
Department of Physics of Biological Systems, Institute of Physics, National Academy of Sciences of Ukraine, Kiev, Ukraine.
Biophys J. 2006 Jul 15;91(2):670-85. doi: 10.1529/biophysj.105.078584. Epub 2006 Apr 21.
Existing methods of domain identification in proteins usually provide no information about the degree of domain independence and stability. However, this information is vital for many areas of protein research. The recently developed hierarchical clustering of correlation patterns (HCCP) technique provides machine-based domain identification in a computationally simple and physically consistent way. Here we present the modification of this technique, which not only allows determination of the most plausible number of dynamic domains but also makes it possible to estimate the degree of their independence (the extent of correlated motion) and stability (the range of environmental conditions, where domains remain intact). With this technique we provided domain assignments and calculated intra- and interdomain correlations and interdomain energies for >2500 test proteins. It is shown that mean intradomain correlation of motions can serve as a quantitative criterion of domain independence, and the HCCP stability gap is a measure of their stability. Our data show that the motions of domains with high stability are usually independent. In contrast, the domains with moderate stability usually exhibit a substantial degree of correlated motions. It is shown that in multidomain proteins the domains are most stable if they are of similar size, and this correlates with the observed abundance of such proteins.
蛋白质中现有结构域识别方法通常无法提供有关结构域独立性和稳定性程度的信息。然而,这些信息对蛋白质研究的许多领域至关重要。最近开发的相关模式层次聚类(HCCP)技术以计算简单且物理上一致的方式提供基于机器的结构域识别。在此,我们展示了对该技术的改进,其不仅能够确定最合理的动态结构域数量,还能够估计其独立性程度(相关运动的范围)和稳定性(结构域保持完整的环境条件范围)。利用该技术,我们为超过2500种测试蛋白质提供了结构域分配,并计算了结构域内和结构域间的相关性以及结构域间能量。结果表明,结构域内运动的平均相关性可作为结构域独立性的定量标准,而HCCP稳定性差距是其稳定性的一种度量。我们的数据表明,高稳定性结构域的运动通常是独立的。相反,中等稳定性的结构域通常表现出相当程度的相关运动。结果表明,在多结构域蛋白质中,如果结构域大小相似,则它们最稳定,这与观察到的此类蛋白质的丰度相关。