Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, 2629 HZ Delft, The Netherlands.
Science and Technology Facilities Council, Research Complex at Harwell, Oxon OX11 0FA, United Kingdom.
IUCrJ. 2024 Nov 1;11(Pt 6):951-965. doi: 10.1107/S2052252524009321.
Conformational heterogeneity of biological macromolecules is a challenge in single-particle averaging (SPA). Current standard practice is to employ classification and filtering methods that may allow a discrete number of conformational states to be reconstructed. However, the conformation space accessible to these molecules is continuous and, therefore, explored incompletely by a small number of discrete classes. Recently developed heterogeneous reconstruction algorithms (HRAs) to analyse continuous heterogeneity rely on machine-learning methods that employ low-dimensional latent space representations. The non-linear nature of many of these methods poses a challenge to their validation and interpretation and to identifying functionally relevant conformational trajectories. These methods would benefit from in-depth benchmarking using high-quality synthetic data and concomitant ground truth information. We present a framework for the simulation and subsequent analysis with respect to the ground truth of cryo-EM micrographs containing particles whose conformational heterogeneity is sourced from molecular dynamics simulations. These synthetic data can be processed as if they were experimental data, allowing aspects of standard SPA workflows as well as heterogeneous reconstruction methods to be compared with known ground truth using available utilities. The simulation and analysis of several such datasets are demonstrated and an initial investigation into HRAs is presented.
生物大分子的构象异质性是单颗粒平均(SPA)的一个挑战。目前的标准做法是采用分类和过滤方法,这些方法可能允许重建离散数量的构象状态。然而,这些分子可达到的构象空间是连续的,因此,通过少量离散类无法完全探索。最近开发的用于分析连续异质性的异构重建算法(HRA)依赖于机器学习方法,这些方法采用低维潜在空间表示。这些方法中的许多方法的非线性性质对其验证和解释以及识别功能相关构象轨迹构成了挑战。这些方法将受益于使用高质量合成数据和伴随的地面实况信息进行深入基准测试。我们提出了一个框架,用于模拟和随后分析包含源自分子动力学模拟的构象异质性的粒子的冷冻电子显微镜显微照片的地面实况。这些合成数据可以像实验数据一样进行处理,从而可以使用可用的实用程序将标准 SPA 工作流程的各个方面以及异构重建方法与已知的地面实况进行比较。演示了几个这样的数据集的模拟和分析,并提出了对 HRA 的初步研究。