Zhang Yichi, Jindal Muskaan, Viswanath Shruthi, Sitharam Meera
CISE Department, University of Florida, Gainesville 32611-6120, Florida, United States.
National Center for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru 560065, India.
J Chem Inf Model. 2025 May 12;65(9):4576-4592. doi: 10.1021/acs.jcim.4c02412. Epub 2025 Apr 29.
The structures of protein complexes allow us to understand and modulate the biological functions of the proteins. Integrative docking is a computational method to obtain the structures of a protein complex, given the atomic structures of the constituent proteins along with other experimental data on the complex, such as chemical cross-links or SAXS profiles. Here, we develop a new discrete geometry-based method, wall-EASAL, for integrative rigid docking of protein pairs given the structures of the constituent proteins and chemical cross-links. The method is an adaptation of efficient atlasing and search of assembly landscapes (EASAL), a state-of-the-art discrete geometry method for efficient and exhaustive sampling of macromolecular configurations under pairwise intermolecular distance constraints. We provide a mathematical proof that the method finds a structure satisfying the cross-link constraints under a natural condition satisfied by energy landscapes. We compare wall-EASAL with integrative modeling platform (IMP), a commonly used integrative modeling method, on a benchmark, varying the numbers, types, and sources of input cross-links, and sources of monomer structures. The wall-EASAL method performs similarly to IMP in terms of the average satisfaction of the configurations to the input cross-links and the average similarity of the configurations to their corresponding native structures. But wall-EASAL is more efficient than IMP and more robust against false positive cross-links in the context of binary integrative rigid docking. Although the current study uses cross-links, the method is general and any source of distance constraints can be used for integrative docking with wall-EASAL. However, the current implementation only supports binary rigid protein docking, i.e., assumes that the monomer structures are known and remain rigid. Additionally, the current implementation is deterministic, i.e., it does not account for some uncertainties in the cross-linking data, such as noise in the cross-link distances. Neither of these appears to be a theoretical or algorithmic limitation of the EASAL methodology. Structures from wall-EASAL can be incorporated in methods for modeling large macromolecular assemblies, for example by suggesting rigid bodies or restraints for use in these methods. This will facilitate the characterization of assemblies and cellular neighborhoods at increased efficiency, accuracy, and precision. The wall-EASAL method is available at https://bitbucket.org/geoplexity/easal-dev/src/Crosslink and the benchmark is available at https://github.com/isblab/Integrative_docking_benchmark.
蛋白质复合物的结构使我们能够理解和调节蛋白质的生物学功能。整合对接是一种计算方法,在已知组成蛋白质的原子结构以及关于复合物的其他实验数据(如化学交联或小角X射线散射谱)的情况下,获得蛋白质复合物的结构。在此,我们开发了一种新的基于离散几何的方法wall-EASAL,用于在已知组成蛋白质结构和化学交联的情况下对蛋白质对进行整合刚性对接。该方法是对高效图谱绘制和装配景观搜索(EASAL)的改进,EASAL是一种先进的离散几何方法,用于在分子间成对距离约束下对大分子构型进行高效且详尽的采样。我们提供了一个数学证明,即在能量景观满足的自然条件下,该方法能找到满足交联约束的结构。我们在一个基准测试中,改变输入交联的数量、类型和来源以及单体结构的来源,将wall-EASAL与常用的整合建模方法整合建模平台(IMP)进行比较。在构型对输入交联的平均满足度以及构型与其相应天然结构的平均相似度方面,wall-EASAL方法与IMP表现相似。但在二元整合刚性对接的情况下,wall-EASAL比IMP更高效,并且对假阳性交联更具鲁棒性。尽管当前研究使用了交联,但该方法具有通用性,任何距离约束源都可用于与wall-EASAL进行整合对接。然而,当前实现仅支持二元刚性蛋白质对接,即假设单体结构已知且保持刚性。此外,当前实现是确定性的,即它没有考虑交联数据中的一些不确定性,如交联距离中的噪声。这些似乎都不是EASAL方法的理论或算法限制。来自wall-EASAL的结构可纳入用于建模大型大分子装配体的方法中,例如通过为这些方法建议刚体或约束条件。这将有助于以更高的效率、准确性和精度来表征装配体和细胞邻域。wall-EASAL方法可在https://bitbucket.org/geoplexity/easal-dev/src/Crosslink获取,基准测试可在https://github.com/isblab/Integrative_docking_benchmark获取。