Department of Cell Biology, Microbiology and Molecular Biology, University of South Florida, Tampa, Florida 33620, USA.
J Chem Phys. 2018 Jun 28;148(24):241726. doi: 10.1063/1.5022469.
Many proteins are regulated by dynamic allostery wherein regulator-induced changes in structure are comparable with thermal fluctuations. Consequently, understanding their mechanisms requires assessment of relationships between and within conformational ensembles of different states. Here we show how machine learning based approaches can be used to simplify this high-dimensional data mining task and also obtain mechanistic insight. In particular, we use these approaches to investigate two fundamental questions in dynamic allostery. First, how do regulators modify inter-site correlations in conformational fluctuations (C)? Second, how are regulator-induced shifts in conformational ensembles at two different sites in a protein related to each other? We address these questions in the context of the human protein tyrosine phosphatase 1E's PDZ2 domain, which is a model protein for studying dynamic allostery. We use molecular dynamics to generate conformational ensembles of the PDZ2 domain in both the regulator-bound and regulator-free states. The employed protocol reproduces methyl deuterium order parameters from NMR. Results from unsupervised clustering of C combined with flow analyses of weighted graphs of C show that regulator binding significantly alters the global signaling network in the protein; however, not by altering the spatial arrangement of strongly interacting amino acid clusters but by modifying the connectivity between clusters. Additionally, we find that regulator-induced shifts in conformational ensembles, which we evaluate by repartitioning ensembles using supervised learning, are, in fact, correlated. This correlation Δ is less extensive compared to C, but in contrast to C, Δ depends inversely on the distance from the regulator binding site. Assuming that Δ is an indicator of the transduction of the regulatory signal leads to the conclusion that the regulatory signal weakens with distance from the regulatory site. Overall, this work provides new approaches to analyze high-dimensional molecular simulation data and also presents applications that yield new insight into dynamic allostery.
许多蛋白质受到动态变构调节,其中结构的调节剂诱导变化与热波动相当。因此,理解它们的机制需要评估不同状态构象集合之间和内部的关系。在这里,我们展示了如何使用基于机器学习的方法来简化这项高维数据挖掘任务,并获得机制上的见解。特别是,我们使用这些方法来研究动态变构中的两个基本问题。首先,调节剂如何改变构象波动(C)中的站点间相关性?其次,蛋白质中两个不同位点的构象集合在调节剂诱导下的位移如何相互关联?我们在人蛋白酪氨酸磷酸酶 1E 的 PDZ2 结构域的背景下解决了这些问题,该结构域是研究动态变构的模型蛋白。我们使用分子动力学生成 PDZ2 结构域在有调节剂和无调节剂结合状态下的构象集合。所采用的方案再现了 NMR 中的甲基氘序参数。C 的无监督聚类与 C 的加权图的流量分析相结合的结果表明,调节剂结合显著改变了蛋白质中的全局信号网络;然而,不是通过改变强相互作用氨基酸簇的空间排列,而是通过修饰簇之间的连接性来实现。此外,我们发现,通过使用监督学习重新分配集合来评估构象集合的调节剂诱导位移实际上是相关的。与 C 相比,这种相关性 Δ 不太广泛,但与 C 相反,Δ 与距调节剂结合位点的距离成反比。假设 Δ 是调节信号转导的指标,会导致这样的结论:随着与调节位点的距离增加,调节信号会减弱。总的来说,这项工作为分析高维分子模拟数据提供了新方法,并提供了对动态变构的新见解。