Lidbrink Samuel Eriksson, Howard Rebecca J, Haloi Nandan, Lindahl Erik
Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden.
Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, Solna, Sweden.
PLoS Comput Biol. 2025 Jun 27;21(6):e1013187. doi: 10.1371/journal.pcbi.1013187. eCollection 2025 Jun.
The function of a protein is enabled by its conformational landscape. For non-rigid proteins, a complete characterization of this landscape requires understanding the protein's structure in all functional states, the stability of these states under target conditions, and the transition pathways between them. Several strategies have recently been developed to drive the machine learning algorithm AlphaFold2 (AF) to sample multiple conformations, but it is more challenging to a priori predict what states are stabilized in particular conditions and how the transition occurs. Here, we combine AF sampling with small-angle scattering curves to obtain a weighted conformational ensemble of functional states under target environmental conditions. We apply this to the pentameric ion channel GLIC using small-angle neutron scattering (SANS) curves, and identify apparent closed and open states. By comparing experimental SANS data under resting and activating conditions, we can quantify the subpopulation of closed channels that open upon activation, matching both experiments and extensive simulation sampling using Markov state models. The predicted closed and open states closely resemble crystal structures determined under resting and activating conditions respectively, and project to predicted basins in free energy landscapes calculated from the Markov state models. Further, without using any structural information, the AF sampling also correctly captures intermediate conformations and projects onto the transition pathway resolved in the extensive sampling. This combination of machine learning algorithms and low-dimensional experimental data appears to provide an efficient way to predict not only stable conformations but also accurately sample the transition pathways several orders of magnitude faster than simulation-based sampling.
蛋白质的功能由其构象态势决定。对于非刚性蛋白质而言,要全面表征这种态势,就需要了解蛋白质在所有功能状态下的结构、这些状态在目标条件下的稳定性以及它们之间的转变途径。最近已开发出多种策略来驱动机器学习算法AlphaFold2(AF)对多种构象进行采样,但要先验预测在特定条件下哪些状态会稳定以及转变如何发生则更具挑战性。在此,我们将AF采样与小角散射曲线相结合,以获得目标环境条件下功能状态的加权构象系综。我们利用小角中子散射(SANS)曲线将此方法应用于五聚体离子通道GLIC,并识别出明显的关闭态和开放态。通过比较静息和激活条件下的实验SANS数据,我们可以量化激活时开放的关闭通道亚群,这与实验以及使用马尔可夫状态模型的广泛模拟采样均相符。预测的关闭态和开放态分别与在静息和激活条件下测定的晶体结构非常相似,并投影到由马尔可夫状态模型计算出的自由能景观中的预测盆地。此外,在不使用任何结构信息的情况下,AF采样还能正确捕获中间构象,并投影到在广泛采样中解析出的转变途径上。机器学习算法与低维实验数据的这种结合似乎提供了一种有效的方法,不仅可以预测稳定构象,还能比基于模拟的采样快几个数量级准确地对转变途径进行采样。