Manookian Babgen, Mukhaleva Elizaveta, Gogoshin Grigoriy, Bhattacharya Supriyo, Vaidehi Nagarajan, Rodin Andrei S, Branciamore Sergio
Department of Computational and Quantitative Medicine, Beckman Research Institute of the City of Hope, Duarte, CA.
Irell and Manella Graduate School of Biological Sciences, Beckman Research Institute of the City of Hope, Duarte, CA.
bioRxiv. 2025 Aug 4:2025.08.04.668534. doi: 10.1101/2025.08.04.668534.
Conformational transitions are central to protein function, yet their mechanistic analysis remains challenging due to the multi-dimensionality and timescales underlying the molecular motions. While interpretable network models such as Bayesian networks have advanced the identification of key residue interactions in molecular dynamics (MD) data, they lack temporal resolution and cannot capture the sequence of events during transitions. Here, we introduce ynamically esolved niversal odel for aysin network racking or DRUMBEAT, a machine learning approach that combines a universal graph topology with sliding-window rescoring to generate interpretable, time-resolved maps of cooperative events in MD trajectories. Applying DRUMBEAT to the benchmark Fip35 WW domain folding trajectories from DE Shaw Research Group, we recover both major folding pathways and critical residues previously highlighted by experiment. Importantly, DRUMBEAT provides new insight in two ways: (1) uncover unknown protein features important for transition, and (2) dissect the order and timing of conformational changes, revealing the precise sequence of residue contact closures during individual folding events. Robustness analysis demonstrates that both the universal graph and time-resolved results are highly consistent across multiple sampling replicates. These findings establish DRUMBEAT as a scalable and interpretable machine learning framework for dissecting the dynamics of protein folding and other conformational transitions, offering a generalizable tool for the mechanistic study of biomolecular dynamics.
构象转变是蛋白质功能的核心,但由于分子运动背后的多维性和时间尺度,对其进行机理分析仍然具有挑战性。虽然诸如贝叶斯网络等可解释的网络模型推动了分子动力学(MD)数据中关键残基相互作用的识别,但它们缺乏时间分辨率,无法捕捉转变过程中的事件序列。在此,我们引入了用于贝叶斯网络追踪的动态解析通用模型(DRUMBEAT),这是一种机器学习方法,它将通用图拓扑与滑动窗口重新评分相结合,以生成MD轨迹中协同事件的可解释、时间分辨图谱。将DRUMBEAT应用于德劭研究集团的基准Fip35 WW结构域折叠轨迹,我们恢复了先前实验所突出的主要折叠途径和关键残基。重要的是,DRUMBEAT在两个方面提供了新的见解:(1)揭示对转变重要的未知蛋白质特征,以及(2)剖析构象变化的顺序和时间,揭示单个折叠事件中残基接触闭合的精确序列。稳健性分析表明,通用图和时间分辨结果在多个采样重复中高度一致。这些发现确立了DRUMBEAT作为一个可扩展且可解释的机器学习框架,用于剖析蛋白质折叠和其他构象转变的动力学,为生物分子动力学的机理研究提供了一个可推广的工具。