Victor Paul Raj Fredrick Robin Devadoss, Exner Thomas E
Department of Chemistry, Konstanz Research School Chemical Biology, and Zukunftskolleg, University of Konstanz, 78457, Konstanz, Germany,
J Mol Model. 2014 Apr;20(4):2196. doi: 10.1007/s00894-014-2196-6. Epub 2014 Apr 12.
Given the increasing complexity of simulated molecular systems, and the fact that simulation times have now reached milliseconds to seconds, immense amounts of data (in the gigabyte to terabyte range) are produced in current molecular dynamics simulations. Manual analysis of these data is a very time-consuming task, and important events that lead from one intermediate structure to another can become occluded in the noise resulting from random thermal fluctuations. To overcome these problems and facilitate a semi-automated data analysis, we introduce in this work a measure based on C(α) torsion angles: torsion angles formed by four consecutive C(α) atoms. This measure describes changes in the backbones of large systems on a residual length scale (i.e., a small number of residues at a time). Cluster analysis of individual C(α) torsion angles and its fuzzification led to continuous time patches representing (meta)stable conformations and to the identification of events acting as transitions between these conformations. The importance of a change in torsion angle to structural integrity is assessed by comparing this change to the average fluctuations in the same torsion angle over the complete simulation. Using this novel measure in combination with other measures such as the root mean square deviation (RMSD) and time series of distance measures, we performed an in-depth analysis of a simulation of the open form of DNA polymerase I. The times at which major conformational changes occur and the most important parts of the molecule and their interrelations were pinpointed in this analysis. The simultaneous determination of the time points and localizations of major events is a significant advantage of the new bottom-up approach presented here, as compared to many other (top-down) approaches in which only the similarity of the complete structure is analyzed.
鉴于模拟分子系统日益复杂,且模拟时间现已达到毫秒至秒级,当前的分子动力学模拟会产生海量数据(范围从千兆字节到太字节)。对这些数据进行人工分析是一项非常耗时的任务,并且从一个中间结构到另一个中间结构的重要事件可能会被随机热涨落产生的噪声所掩盖。为了克服这些问题并促进半自动数据分析,我们在这项工作中引入了一种基于C(α)扭转角的度量:由四个连续的C(α)原子形成的扭转角。该度量在残基长度尺度上(即一次少量残基)描述了大型系统主链的变化。对单个C(α)扭转角进行聚类分析及其模糊化处理,得到了代表(亚)稳定构象的连续时间片段,并确定了作为这些构象之间转变的事件。通过将扭转角的变化与整个模拟过程中相同扭转角的平均涨落进行比较,评估扭转角变化对结构完整性的重要性。结合使用这种新度量与其他度量,如均方根偏差(RMSD)和距离度量的时间序列,我们对DNA聚合酶I开放形式的模拟进行了深入分析。在该分析中确定了主要构象变化发生的时间以及分子中最重要的部分及其相互关系。与许多其他仅分析完整结构相似性的(自上而下)方法相比,同时确定主要事件的时间点和定位是这里提出的新的自下而上方法的一个显著优势。