Faculty of Science/Chemistry, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, 3584CH, The Netherlands.
Proteins. 2013 Dec;81(12):2119-28. doi: 10.1002/prot.24382. Epub 2013 Oct 17.
Information-driven docking is currently one of the most successful approaches to obtain structural models of protein interactions as demonstrated in the latest round of CAPRI. While various experimental and computational techniques can be used to retrieve information about the binding mode, the availability of three-dimensional structures of the interacting partners remains a limiting factor. Fortunately, the wealth of structural information gathered by large-scale initiatives allows for homology-based modeling of a significant fraction of the protein universe. Defining the limits of information-driven docking based on such homology models is therefore highly relevant. Here we show, using previous CAPRI targets, that out of a variety of measures, the global sequence identity between template and target is a simple but reliable predictor of the achievable quality of the docking models. This indicates that a well-defined overall fold is critical for the interaction. Furthermore, the quality of the data at our disposal to characterize the interaction plays a determinant role in the success of the docking. Given reliable interface information we can obtain acceptable predictions even at low global sequence identity. These results, which define the boundaries between trustworthy and unreliable predictions, should guide both experts and nonexperts in defining the limits of what is achievable by docking. This is highly relevant considering that the fraction of the interactome amenable for docking is only bound to grow as the number of experimentally solved structures increases.
信息驱动对接目前是获得蛋白质相互作用结构模型最成功的方法之一,这在最近一轮的 CAPRI 中得到了验证。虽然可以使用各种实验和计算技术来检索关于结合模式的信息,但相互作用伙伴的三维结构的可用性仍然是一个限制因素。幸运的是,大规模计划收集的大量结构信息允许对蛋白质宇宙的很大一部分进行基于同源建模。因此,基于这些同源模型定义信息驱动对接的限制是非常相关的。在这里,我们使用以前的 CAPRI 靶标表明,在各种测量中,模板和靶标之间的全局序列同一性是可实现对接模型质量的简单但可靠的预测因子。这表明明确的整体折叠对于相互作用至关重要。此外,我们用来描述相互作用的数据质量在对接的成功中起着决定性的作用。给定可靠的接口信息,即使在全局序列同一性较低的情况下,我们也可以获得可接受的预测。这些定义可信赖和不可靠预测之间界限的结果,应该指导专家和非专家确定通过对接可实现的目标的限制。考虑到可用于对接的相互作用的分数随着实验解决结构数量的增加而只能增长,这一点非常重要。