通过同源性预测瞬时蛋白质-蛋白质复合物的三维结构。

Predicting 3D structures of transient protein-protein complexes by homology.

作者信息

Kundrotas Petras J, Alexov Emil

机构信息

Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA.

出版信息

Biochim Biophys Acta. 2006 Sep;1764(9):1498-511. doi: 10.1016/j.bbapap.2006.08.002. Epub 2006 Aug 10.

DOI:10.1016/j.bbapap.2006.08.002

PMID:16963323

Abstract

The paper reports a homology based approach for predicting the 3D structures of full length hetero protein complexes. We have created a database of templates that includes structures of hetero protein-protein complexes as well as domain-domain structures (), which allowed us to expand the template pool up to 418 two-chain entries (at 40% sequence identity). Two protocols were tested-a protocol based on position specific Blast search (Protocol-I) and a protocol based on structural similarity of monomers (Protocol-II). All possible combinations of two monomers (350,284 pairs) in the ProtCom database were subjected to both protocols to predict if they form complexes. The predictions were benchmarked against the ProtCom database resulting to false-true positives ratios of approximately 5:1 and approximately 7:1 and recovery of 19% and 86%, respectively for protocols I and II. From 350,284 trials Protocol-I made only approximately 500 wrong predictions resulting to 0.5% error. In addition, though it was shown that artificially created domain-domain structures can in principle be good templates for modeling full length protein complexes, more sensitive methods are needed to detect homology relations. The quality of the models was assessed using two different criteria such as interfacial residues and overall RMSD. It was found that there is no correlation between these two measures. In many cases the interface residues were predicted correctly, but the overall RMSD was over 6 A and vice versa.

摘要

本文报道了一种基于同源性的方法来预测全长异源蛋白质复合物的三维结构。我们创建了一个模板数据库，其中包括异源蛋白质-蛋白质复合物的结构以及结构域-结构域结构（），这使我们能够将模板库扩展到418个双链条目（序列同一性为40%）。测试了两种方案——一种基于位置特异性Blast搜索的方案（方案I）和一种基于单体结构相似性的方案（方案II）。ProtCom数据库中两种单体的所有可能组合（350,284对）都经过这两种方案来预测它们是否形成复合物。预测结果以ProtCom数据库为基准，方案I和方案II的假阳性与真阳性比率分别约为5:1和7:1，回收率分别为19%和86%。在350,284次试验中，方案I只做出了大约500次错误预测，错误率为0.5%。此外，虽然已表明人工创建的结构域-结构域结构原则上可以作为模拟全长蛋白质复合物的良好模板，但需要更灵敏的方法来检测同源关系。使用两种不同的标准（如界面残基和整体均方根偏差）评估模型的质量。发现这两种测量方法之间没有相关性。在许多情况下，界面残基被正确预测，但整体均方根偏差超过6 Å，反之亦然。

相似文献

Predicting 3D structures of transient protein-protein complexes by homology.

Biochim Biophys Acta. 2006 Sep;1764(9):1498-511. doi: 10.1016/j.bbapap.2006.08.002. Epub 2006 Aug 10.

Homology-based modeling of 3D structures of protein-protein complexes using alignments of modified sequence profiles.

Int J Biol Macromol. 2008 Aug 15;43(2):198-208. doi: 10.1016/j.ijbiomac.2008.05.004. Epub 2008 May 21.

PROTCOM: searchable database of protein complexes enhanced with domain-domain structures.

Nucleic Acids Res. 2007 Jan;35(Database issue):D575-9. doi: 10.1093/nar/gkl768. Epub 2006 Oct 28.

MULTIPROSPECTOR: an algorithm for the prediction of protein-protein interactions by multimeric threading.

Proteins. 2002 Nov 15;49(3):350-64. doi: 10.1002/prot.10222.

Protein docking using case-based reasoning.

Proteins. 2013 Dec;81(12):2150-8. doi: 10.1002/prot.24433.

Tertiary structure predictions on a comprehensive benchmark of medium to large size proteins.

Biophys J. 2004 Oct;87(4):2647-55. doi: 10.1529/biophysj.104.045385.

3D-partner: a web server to infer interacting partners and binding models.

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W561-7. doi: 10.1093/nar/gkm346. Epub 2007 May 21.

Domain-based small molecule binding site annotation.

BMC Bioinformatics. 2006 Mar 17;7:152. doi: 10.1186/1471-2105-7-152.

Protein structure prediction based on sequence similarity.

Methods Mol Biol. 2009;569:129-56. doi: 10.1007/978-1-59745-524-4_7.

Benchmarking of dimeric threading and structure refinement.

Proteins. 2006 May 15;63(3):457-65. doi: 10.1002/prot.20878.

引用本文的文献

Structural and Functional Characterization of CreFH1, the Frataxin Homolog from .

Plants (Basel). 2022 Jul 26;11(15):1931. doi: 10.3390/plants11151931.

Plausible blockers of Spike RBD in SARS-CoV2-molecular design and underlying interaction dynamics from high-level structural descriptors.

J Mol Model. 2021 May 31;27(6):191. doi: 10.1007/s00894-021-04779-0.

Template-based structure modeling of protein-protein interactions.

Curr Opin Struct Biol. 2014 Feb;24:10-23. doi: 10.1016/j.sbi.2013.11.005. Epub 2013 Dec 11.

GWIDD: a comprehensive resource for genome-wide structural modeling of protein-protein interactions.

Hum Genomics. 2012 Jul 11;6(1):7. doi: 10.1186/1479-7364-6-7.

Correlation between protein sequence similarity and crystallization reagents in the biological macromolecule crystallization database.

Int J Mol Sci. 2012;13(8):9514-9526. doi: 10.3390/ijms13089514. Epub 2012 Jul 27.

Protein docking by the interface structure similarity: how much structure is needed?

PLoS One. 2012;7(2):e31349. doi: 10.1371/journal.pone.0031349. Epub 2012 Feb 13.

On the role of electrostatics in protein-protein interactions.

Phys Biol. 2011 Jun;8(3):035001. doi: 10.1088/1478-3975/8/3/035001. Epub 2011 May 13.

Study of protein complexes via homology modeling, applied to cysteine proteases and their protein inhibitors.

J Mol Model. 2011 Dec;17(12):3163-72. doi: 10.1007/s00894-011-0990-y. Epub 2011 Mar 2.

GWIDD: Genome-wide protein docking database.

Nucleic Acids Res. 2010 Jan;38(Database issue):D513-7. doi: 10.1093/nar/gkp944. Epub 2009 Nov 9.

SpaK/SpaR two-component system characterized by a structure-driven domain-fusion method and in vitro phosphorylation studies.

PLoS Comput Biol. 2009 Jun;5(6):e1000401. doi: 10.1371/journal.pcbi.1000401. Epub 2009 Jun 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过同源性预测瞬时蛋白质-蛋白质复合物的三维结构。

Predicting 3D structures of transient protein-protein complexes by homology.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献