Anishchenko Ivan, Kundrotas Petras J, Tuzikov Alexander V, Vakser Ilya A
Center for Bioinformatics, The University of Kansas, Lawrence, Kansas, 66047; United Institute of Informatics Problems, National Academy of Sciences, 220012, Minsk, Belarus.
Proteins. 2014 Feb;82(2):278-87. doi: 10.1002/prot.24385. Epub 2013 Oct 17.
Characterization of life processes at the molecular level requires structural details of protein-protein interactions (PPIs). The number of experimentally determined protein structures accounts only for a fraction of known proteins. This gap has to be bridged by modeling, typically using experimentally determined structures as templates to model related proteins. The fraction of experimentally determined PPI structures is even smaller than that for the individual proteins, due to a larger number of interactions than the number of individual proteins, and a greater difficulty of crystallizing protein-protein complexes. The approaches to structural modeling of PPI (docking) often have to rely on modeled structures of the interactors, especially in the case of large PPI networks. Structures of modeled proteins are typically less accurate than the ones determined by X-ray crystallography or nuclear magnetic resonance. Thus the utility of approaches to dock these structures should be assessed by thorough benchmarking, specifically designed for protein models. To be credible, such benchmarking has to be based on carefully curated sets of structures with levels of distortion typical for modeled proteins. This article presents such a suite of models built for the benchmark set of the X-ray structures from the Dockground resource (http://dockground.bioinformatics.ku.edu) by a combination of homology modeling and Nudged Elastic Band method. For each monomer, six models were generated with predefined C(α) root mean square deviation from the native structure (1, 2, …, 6 Å). The sets and the accompanying data provide a comprehensive resource for the development of docking methodology for modeled proteins.
在分子水平上表征生命过程需要蛋白质-蛋白质相互作用(PPI)的结构细节。实验确定的蛋白质结构数量仅占已知蛋白质的一小部分。这个差距必须通过建模来弥补,通常使用实验确定的结构作为模板来对相关蛋白质进行建模。由于相互作用的数量多于单个蛋白质的数量,并且蛋白质-蛋白质复合物的结晶难度更大,实验确定的PPI结构的比例甚至比单个蛋白质的比例还要小。PPI结构建模(对接)方法通常不得不依赖于相互作用蛋白的建模结构,尤其是在大型PPI网络的情况下。建模蛋白质的结构通常不如通过X射线晶体学或核磁共振确定的结构准确。因此,对接这些结构的方法的效用应该通过专门为蛋白质模型设计的全面基准测试来评估。为了可信,这样的基准测试必须基于精心策划的结构集,这些结构具有建模蛋白质典型的扭曲水平。本文介绍了通过同源建模和推挤弹性带方法为来自Dockground资源(http://dockground.bioinformatics.ku.edu)的X射线结构基准集构建的这样一套模型。对于每个单体,生成了六个与天然结构具有预定义C(α)均方根偏差(1、2、…、6 Å)的模型。这些模型集和随附数据为开发针对建模蛋白质的对接方法提供了全面的资源。