Lange Oliver F, Grubmüller Helmut
Department of Theoretical and Computational Biophysics, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, Göttingen 37077, Germany.
Proteins. 2008 Mar;70(4):1294-312. doi: 10.1002/prot.21618.
Correlated motions in biomolecules are often essential for their function, for example, allosteric signal transduction or mechanical/thermodynamic energy transport. Principal component analysis (PCA) is a widely used method to extract functionally relevant collective motions from a molecular dynamics (MD) trajectory. Being based on the covariance matrix, however, PCA detects only linear correlations. Here we present a new method, full correlation analysis (FCA), which is based on mutual information and thus quantifies all correlations, including nonlinear and higher order correlations. For comparison, we applied both, PCA and FCA, to approximately 100 ns MD trajectories of T4 lysozyme and the hexapeptide neurotensin. For both systems, FCA yielded better resolved conformational substates and aligned its modes more often with actual transition pathways. This improved resolution is shown to be due to a strongly increased anharmonicity of FCA modes as compared to the respective PCA modes. The high anharmonicity further suggests that the motions extracted by FCA are functionally more relevant than those captured by PCA. In summary, FCA should provide improved collective degrees of freedom for dimension-reduced descriptions of macromolecular dynamics.
生物分子中的相关运动通常对其功能至关重要,例如变构信号转导或机械/热力学能量传输。主成分分析(PCA)是一种广泛用于从分子动力学(MD)轨迹中提取功能相关集体运动的方法。然而,基于协方差矩阵的PCA仅检测线性相关性。在此,我们提出一种新方法——全相关分析(FCA),它基于互信息,因此可以量化所有相关性,包括非线性和高阶相关性。为作比较,我们将PCA和FCA都应用于T4溶菌酶和六肽神经降压素约100 ns的MD轨迹。对于这两个系统,FCA产生了分辨率更高的构象亚态,并且其模式更常与实际过渡途径对齐。结果表明,与各自的PCA模式相比,FCA模式的非谐性大幅增加,从而提高了分辨率。高非谐性进一步表明,FCA提取的运动在功能上比PCA捕获的运动更相关。总之,FCA应该为大分子动力学的降维描述提供改进的集体自由度。