Zagrovic Bojan, Pande Vijay S
Biophysics Program, Stanford University, Stanford, California 94305-5080, USA.
Biophys J. 2004 Oct;87(4):2240-6. doi: 10.1529/biophysj.104.042184.
Recent algorithmic advances and continual increase in computational power have made it possible to simulate protein folding and dynamics on the level of ensembles. Furthermore, analyzing protein structure by using ensemble representation is intrinsic to certain experimental techniques, such as nuclear magnetic resonance. This creates a problem of how to compare an ensemble of molecules with a given reference structure. Recently, we used distance-based root-mean-square deviation (dRMS) to compare the native structure of a protein with its unfolded-state ensemble. We showed that for small, mostly alpha-helical proteins, the mean unfolded-state Calpha-Calpha distance matrix is significantly more nativelike than the Calpha-Calpha matrices corresponding to the individual members of the unfolded ensemble. Here, we give a mathematical derivation that shows that, for any ensemble of structures, the dRMS deviation between the ensemble-averaged distance matrix and any given reference distance matrix is always less than or equal to the average dRMS deviation of the individual members of the ensemble from the same reference matrix. This holds regardless of the nature of the reference structure or the structural ensemble in question. In other words, averaging of distance matrices can only increase their level of similarity to a given reference matrix, relative to the individual matrices comprising the ensemble. Furthermore, we show that the above inequality holds in the case of Cartesian coordinate-based root-mean-square deviation as well. We discuss this in the context of our proposal that the average structure of the unfolded ensemble of small helical proteins is close to the native structure, and demonstrate that this finding goes beyond the above mathematical fact.
近期算法的进步以及计算能力的持续提升,使得在系综层面模拟蛋白质折叠和动力学成为可能。此外,通过使用系综表示来分析蛋白质结构对于某些实验技术(如核磁共振)而言是固有的。这就产生了一个问题,即如何将一组分子与给定的参考结构进行比较。最近,我们使用基于距离的均方根偏差(dRMS)来比较蛋白质的天然结构与其未折叠状态的系综。我们表明,对于小型的、主要为α螺旋的蛋白质,平均未折叠状态的Cα - Cα距离矩阵比未折叠系综中各个成员对应的Cα - Cα矩阵更接近天然结构。在此,我们给出一个数学推导,表明对于任何结构系综,系综平均距离矩阵与任何给定参考距离矩阵之间的dRMS偏差总是小于或等于系综中各个成员与同一参考矩阵的平均dRMS偏差。无论所讨论的参考结构或结构系综的性质如何,这一结论都成立。换句话说,相对于构成系综的各个矩阵,距离矩阵的平均只能增加它们与给定参考矩阵的相似程度。此外,我们还表明上述不等式在基于笛卡尔坐标的均方根偏差情况下同样成立。我们在我们提出的小型螺旋蛋白质未折叠系综的平均结构接近天然结构这一观点的背景下进行了讨论,并证明这一发现超越了上述数学事实。