Department of Physics and the Center for Molecular Study of Condensed Soft Matter, Illinois Institute of Technology, Chicago, Illinois.
National Institute of Standards and Technology Center for Neutron Research, National Institute of Standards and Technology, Gaithersburg, Maryland.
Biophys J. 2019 Aug 6;117(3):399-407. doi: 10.1016/j.bpj.2019.06.024. Epub 2019 Jul 18.
Many biomolecular complexes exist in a flexible ensemble of states in solution that is necessary to perform their biological function. Small-angle scattering (SAS) measurements are a popular method for characterizing these flexible molecules because of their relative ease of use and their ability to simultaneously probe the full ensemble of states. However, SAS data is typically low dimensional and difficult to interpret without the assistance of additional structural models. In theory, experimental SAS curves can be reconstituted from a linear combination of theoretical models, although this procedure carries a significant risk of overfitting the inherently low-dimensional SAS data. Previously, we developed a Bayesian-based method for fitting ensembles of model structures to experimental SAS data that rigorously avoids overfitting. However, we have found that these methods can be difficult to incorporate into typical SAS modeling workflows, especially for users that are not experts in computational modeling. To this end, we present the Bayesian Ensemble Estimation from SAS (BEES) program. Two forks of BEES are available, the primary one existing as a module for the SASSIE web server and a developmental version that is a stand-alone Python program. BEES allows users to exhaustively sample ensemble models constructed from a library of theoretical states and to interactively analyze and compare each model's performance. The fitting routine also allows for secondary data sets to be supplied, thereby simultaneously fitting models to both SAS data as well as orthogonal information. The flexible ensemble of K63-linked ubiquitin trimers is presented as an example of BEES' capabilities.
许多生物分子复合物以溶液中灵活的状态集合存在,这是它们执行生物功能所必需的。小角散射(SAS)测量是一种用于表征这些灵活分子的流行方法,因为它们相对易于使用,并且能够同时探测整个状态集合。然而,SAS 数据通常是低维的,如果没有额外的结构模型的帮助,就很难解释。从理论上讲,可以通过理论模型的线性组合来重建实验 SAS 曲线,尽管这种方法存在严重的过度拟合固有低维 SAS 数据的风险。以前,我们开发了一种基于贝叶斯的方法,用于将模型结构的集合拟合到实验 SAS 数据,该方法严格避免过度拟合。然而,我们发现这些方法可能难以纳入典型的 SAS 建模工作流程,尤其是对于非计算建模专家的用户。为此,我们提出了贝叶斯 SAS 模型中的集合估计(BEES)程序。BEES 有两个分支,主要分支作为 SASSIE 网络服务器的一个模块存在,开发版本是一个独立的 Python 程序。BEES 允许用户从理论状态库中彻底采样集合模型,并交互分析和比较每个模型的性能。拟合例程还允许提供辅助数据集,从而同时将模型拟合到 SAS 数据和正交信息。K63 连接的泛素三聚体的灵活集合就是 BEES 能力的一个示例。