Zhang Yichi, Jindal Muskaan, Viswanath Shruthi, Sitharam Meera
CISE Department, University of Florida, Gainesville, Florida 32611-6120, United States.
National Center for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru 560065, India.
bioRxiv. 2025 Jan 7:2024.10.24.619977. doi: 10.1101/2024.10.24.619977.
The structures of protein complexes allow us to understand and modulate the biological functions of the proteins. Integrative docking is a computational method to obtain the structures of a protein complex, given the atomic structures of the constituent proteins along with other experimental data on the complex, such as chemical crosslinks or SAXS profiles. Here, we develop a new discrete geometry-based method, wall-EASAL, for integrative rigid docking of protein pairs given the structures of the constituent proteins and chemical crosslinks. The method is an adaptation of EASAL (Efficient Atlasing and Search of Assembly Landscapes), a state-of-the-art discrete geometry method for efficient and exhaustive sampling of macromolecular configurations under pairwise inter-molecular distance constraints. We provide a mathematical proof that the method finds a structure satisfying the crosslink constraints under a natural condition satisfied by energy landscapes. We compare wall-EASAL with IMP (Integrative Modeling Platform), a commonly used integrative modeling method, on a benchmark, varying the numbers, types, and sources of input crosslinks, and sources of monomer structures. The wall-EASAL method performs better than IMP in terms of the average satisfaction of the configurations to the input crosslinks and the average similarity of the configurations to their corresponding native structures. The ensembles from IMP exhibit greater variability in these two measures. Further, wall-EASAL is more efficient than IMP. Although the current study uses crosslinks, the method is general and any source of distance constraints can be used for integrative docking with wall-EASAL. However, the current implementation only supports binary rigid protein docking, ., assumes that the monomer structures are known and remain rigid. Additionally, the current implementation is deterministic, ., it does not account for uncertainties in the crosslinking data beyond using distance bounds. Neither of these appears to be a theoretical or algorithmic limitation of the EASAL methodology. Structures from wall-EASAL can be incorporated in methods for modeling large macromolecular assemblies, for example by suggesting rigid bodies or restraints for use in these methods. This will facilitate the characterization of assemblies and cellular neighborhoods at increased efficiency, accuracy, and precision. The wall-EASAL method is available at https://bitbucket.org/geoplexity/easal-dev/src/Crosslink and the benchmark is available at https://github.com/isblab/Integrative_docking_benchmark.
蛋白质复合物的结构使我们能够理解并调节蛋白质的生物学功能。整合对接是一种计算方法,在已知组成蛋白质的原子结构以及关于该复合物的其他实验数据(如化学交联或小角X射线散射谱)的情况下,用于获得蛋白质复合物的结构。在此,我们开发了一种新的基于离散几何的方法wall-EASAL,用于在已知组成蛋白质结构和化学交联的情况下对蛋白质对进行整合刚性对接。该方法是对EASAL(高效图谱绘制与装配景观搜索)的一种改编,EASAL是一种先进的离散几何方法,用于在分子间距离约束下对大分子构型进行高效且详尽的采样。我们提供了一个数学证明,即在能量景观满足的自然条件下,该方法能找到一个满足交联约束的结构。我们在一个基准测试中,改变输入交联的数量、类型和来源以及单体结构的来源,将wall-EASAL与常用的整合建模方法IMP(整合建模平台)进行比较。在构型对输入交联的平均满足度以及构型与其相应天然结构的平均相似度方面,wall-EASAL方法比IMP表现更好。IMP的集合在这两种度量上表现出更大的变异性。此外,wall-EASAL比IMP更高效。尽管当前研究使用了交联,但该方法具有通用性,任何距离约束源都可用于与wall-EASAL进行整合对接。然而,当前实现仅支持二元刚性蛋白质对接,即假设单体结构已知且保持刚性。此外,当前实现是确定性的,即除了使用距离界限外,它没有考虑交联数据中的不确定性。这两者似乎都不是EASAL方法的理论或算法限制。来自wall-EASAL的结构可纳入用于对大型大分子组装体进行建模的方法中,例如通过为这些方法建议刚体或约束。这将有助于以更高的效率、准确性和精度对组装体和细胞邻域进行表征。wall-EASAL方法可在https://bitbucket.org/geoplexity/easal-dev/src/Crosslink获取,基准测试可在https://github.com/isblab/Integrative_docking_benchmark获取。