Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA.
Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, South Kensington, London, UK.
Proteins. 2020 Sep;88(9):1180-1188. doi: 10.1002/prot.25889. Epub 2020 Mar 20.
Protein docking is essential for structural characterization of protein interactions. Besides providing the structure of protein complexes, modeling of proteins and their complexes is important for understanding the fundamental principles and specific aspects of protein interactions. The accuracy of protein modeling, in general, is still less than that of the experimental approaches. Thus, it is important to investigate the applicability of docking techniques to modeled proteins. We present new comprehensive benchmark sets of protein models for the development and validation of protein docking, as well as a systematic assessment of free and template-based docking techniques on these sets. As opposed to previous studies, the benchmark sets reflect the real case modeling/docking scenario where the accuracy of the models is assessed by the modeling procedure, without reference to the native structure (which would be unknown in practical applications). We also expanded the analysis to include docking of protein pairs where proteins have different structural accuracy. The results show that, in general, the template-based docking is less sensitive to the structural inaccuracies of the models than the free docking. The near-native docking poses generated by the template-based approach, typically, also have higher ranks than those produces by the free docking (although the free docking is indispensable in modeling the multiplicity of protein interactions in a crowded cellular environment). The results show that docking techniques are applicable to protein models in a broad range of modeling accuracy. The study provides clear guidelines for practical applications of docking to protein models.
蛋白质对接对于蛋白质相互作用的结构特征至关重要。除了提供蛋白质复合物的结构外,蛋白质及其复合物的建模对于理解蛋白质相互作用的基本原理和特定方面也很重要。一般来说,蛋白质建模的准确性仍然低于实验方法。因此,研究对接技术在建模蛋白质上的适用性非常重要。我们提出了新的综合蛋白质模型基准集,用于蛋白质对接的开发和验证,以及对这些集合上的自由对接和基于模板的对接技术的系统评估。与以前的研究不同,基准集反映了实际的建模/对接情况,其中模型的准确性是通过建模过程评估的,而不参考(在实际应用中未知的)天然结构。我们还将分析扩展到包括对接具有不同结构准确性的蛋白质对。结果表明,一般来说,基于模板的对接对模型的结构不准确的敏感性低于自由对接。基于模板的方法生成的近天然对接构象通常比自由对接产生的构象具有更高的等级(尽管在模拟拥挤的细胞环境中蛋白质的多种相互作用方面,自由对接是必不可少的)。结果表明,对接技术可广泛应用于具有不同建模精度的蛋白质模型。该研究为对接在蛋白质模型中的实际应用提供了明确的指导。