Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.
Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; College of Mathematics and Informatics, South China Agricultural University, Guangzhou, 510642, China.
Methods. 2024 Sep;229:163-174. doi: 10.1016/j.ymeth.2024.06.011. Epub 2024 Jul 6.
Molecular dynamics simulation is a crucial research domain within the life sciences, focusing on comprehending the mechanisms of biomolecular interactions at atomic scales. Protein simulation, as a critical subfield, often utilizes MD for implementation, with trajectory data play a pivotal role in drug discovery. The advancement of high-performance computing and deep learning technology becomes popular and critical to predict protein properties from vast trajectory data, posing challenges regarding data features extraction from the complicated simulation data and dimensionality reduction. Simultaneously, it is essential to provide a meaningful explanation of the biological mechanism behind dimensionality. To tackle this challenge, we propose a new unsupervised model named RevGraphVAMP to intelligently analyze the simulation trajectory. This model is based on the variational approach for Markov processes (VAMP) and integrates graph convolutional neural networks and physical constraint optimization to enhance the learning performance. Additionally, we introduce attention mechanism to assess the importance of key interaction region, facilitating the interpretation of molecular mechanism. In comparison to other VAMPNets models, our model showcases competitive performance, improved accuracy in state transition prediction, as demonstrated through its application to two public datasets and the Shank3-Rap1 complex, which is associated with autism spectrum disorder. Moreover, it enhanced dimensionality reduction discrimination across different substates and provides interpretable results for protein structural characterization.
分子动力学模拟是生命科学中的一个重要研究领域,专注于理解生物分子相互作用的原子尺度机制。蛋白质模拟作为一个关键的子领域,通常利用 MD 进行实现,轨迹数据在药物发现中起着关键作用。高性能计算和深度学习技术的发展变得越来越流行和关键,能够从大量的轨迹数据中预测蛋白质的性质,这就提出了从复杂的模拟数据中提取数据特征和降维的挑战。同时,提供对维度背后的生物学机制的有意义的解释也是至关重要的。为了应对这一挑战,我们提出了一种名为 RevGraphVAMP 的新无监督模型,用于智能分析模拟轨迹。该模型基于马尔可夫过程的变分方法(VAMP),并集成了图卷积神经网络和物理约束优化,以提高学习性能。此外,我们引入了注意力机制来评估关键相互作用区域的重要性,有助于解释分子机制。与其他 VAMPNets 模型相比,我们的模型在状态转移预测方面表现出了有竞争力的性能,提高了准确性,这在两个公共数据集和与自闭症谱系障碍相关的 Shank3-Rap1 复合物上的应用中得到了验证。此外,它增强了不同亚状态之间的降维区分,并为蛋白质结构特征提供了可解释的结果。