Suppr超能文献

时变主成分分析:基于绝热动力学的高维数据降维统一方法。

Time-dependent principal component analysis: A unified approach to high-dimensional data reduction using adiabatic dynamics.

机构信息

Research Center for Computational Design of Advanced Functional Materials (CD-FMat), National Institute of Advanced Industrial Science and Technology (AIST), Central 2, 1-1-1 Umezono, Tsukuba 305-8568, Japan and Mathematics for Advanced Materials Open Innovation Laboratory (MathAM-OIL), National Institute of Advanced Industrial Science and Technology (AIST), c/o AIMR, Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai 980-8577, Japan.

出版信息

J Chem Phys. 2021 Oct 7;155(13):134114. doi: 10.1063/5.0061874.

Abstract

Systematic reduction of the dimensionality is highly demanded in making a comprehensive interpretation of experimental and simulation data. Principal component analysis (PCA) is a widely used technique for reducing the dimensionality of molecular dynamics (MD) trajectories, which assists our understanding of MD simulation data. Here, we propose an approach that incorporates time dependence in the PCA algorithm. In the standard PCA, the eigenvectors obtained by diagonalizing the covariance matrix are time independent. In contrast, they are functions of time in our new approach, and their time evolution is implemented in the framework of Car-Parrinello or Born-Oppenheimer type adiabatic dynamics. Thanks to the time dependence, each of the step-by-step structural changes or intermittent collective fluctuations is clearly identified, which are often keys to provoking a drastic structural transformation but are easily masked in the standard PCA. The time dependence also allows for reoptimization of the principal components (PCs) according to the structural development, which can be exploited for enhanced sampling in MD simulations. The present approach is applied to phase transitions of a water model and conformational changes of a coarse-grained protein model. In the former, collective dynamics associated with the dihedral-motion in the tetrahedral network structure is found to play a key role in crystallization. In the latter, various conformations of the protein model were successfully sampled by enhancing structural fluctuation along the periodically optimized PC. Both applications clearly demonstrate the virtue of the new approach, which we refer to as time-dependent PCA.

摘要

在对实验和模拟数据进行全面解释时,需要进行系统的降维。主成分分析(PCA)是一种广泛用于降低分子动力学(MD)轨迹维度的技术,有助于我们理解 MD 模拟数据。在这里,我们提出了一种将时间相关性纳入 PCA 算法的方法。在标准 PCA 中,通过对角化协方差矩阵得到的特征向量与时间无关。相比之下,在我们的新方法中,它们是时间的函数,并且它们的时间演化是在 Car-Parrinello 或 Born-Oppenheimer 型绝热动力学的框架内实现的。由于时间相关性,可以清楚地识别出逐步的结构变化或间歇性的集体波动,这些通常是引发剧烈结构转变的关键,但在标准 PCA 中很容易被掩盖。时间相关性还允许根据结构发展重新优化主成分(PC),这可以用于增强 MD 模拟中的采样。本方法应用于水模型的相变和粗粒化蛋白模型的构象变化。在前一种情况下,与四面体网络结构中的二面角运动相关的集体动力学被发现对结晶起着关键作用。在后一种情况下,通过沿周期性优化的 PC 增强结构波动,成功地对蛋白模型的各种构象进行了采样。这两个应用都清楚地展示了新方法的优点,我们称之为时变 PCA。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验