Department of Biochemistry and Molecular Biophysics, Columbia University, 650 West 168th Street, New York, New York 10032, United States.
Department of Physics, University of Wisconsin Milwaukee, 3135 North Maryland Avenue, Milwaukee, Wisconsin 53211, United States.
J Chem Inf Model. 2020 May 26;60(5):2484-2491. doi: 10.1021/acs.jcim.9b01115. Epub 2020 Apr 2.
Recent approaches to the study of biological molecules employ manifold learning to single-particle cryo-EM data sets to map the continuum of states of a molecule into a low-dimensional space spanned by eigenvectors or "conformational coordinates". This is done separately for each projection direction (PD) on an angular grid. One important step in deriving a consolidated map of occupancies, from which the free energy landscape of the molecule can be derived, is to propagate the conformational coordinates from a given choice of "anchor PD" across the entire angular space. Even when one eigenvector dominates, its sign might invert from one PD to the next. The propagation of the second eigenvector is particularly challenging when eigenvalues of the second and third eigenvector are closely matched, leading to occasional inversions in their ranking as we move across the angular grid. In the absence of a computational approach, this propagation across the angular space has been done thus far "by hand" using visual clues, thus greatly limiting the general use of the technique. In this work we have developed a method that is able to solve the propagation problem computationally, by using optical flow and a probabilistic graphical model. We demonstrate its utility by selected examples.
最近研究生物分子的方法采用多重学习来对单颗粒冷冻电镜数据集进行分析,以便将分子的连续状态映射到由特征向量或“构象坐标”所张成的低维空间中。这是在角网格的每个角度方向 (PD) 上分别进行的。从占据物的综合图谱中得出分子的自由能景观的一个重要步骤是从给定的“锚定 PD”传播构象坐标到整个角度空间。即使主导一个特征向量,它的符号也可能从一个 PD 到下一个 PD 反转。当第二个和第三个特征向量的特征值非常匹配时,第二个特征向量的传播特别具有挑战性,这导致在我们穿过角度网格时它们的排名偶尔会反转。在没有计算方法的情况下,到目前为止,这种在角度空间中的传播一直是通过使用视觉线索“手动”完成的,因此极大地限制了该技术的广泛使用。在这项工作中,我们开发了一种能够通过光流和概率图形模型进行计算求解传播问题的方法。我们通过选择的例子展示了它的实用性。