HorA网络服务器，用于利用序列和结构相似性推断蛋白质之间的同源性。

HorA web server to infer homology between proteins using sequence and structural similarity.

作者信息

Kim Bong-Hyun, Cheng Hua, Grishin Nick V

机构信息

Department of Biochemistry, University of Texas, Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA.

出版信息

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W532-8. doi: 10.1093/nar/gkp328. Epub 2009 May 5.

DOI:10.1093/nar/gkp328

PMID:19417074

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2703895/

Abstract

The biological properties of proteins are often gleaned through comparative analysis of evolutionary relatives. Although protein structure similarity search methods detect more distant homologs than purely sequence-based methods, structural resemblance can result from either homology (common ancestry) or analogy (similarity without common ancestry). While many existing web servers detect structural neighbors, they do not explicitly address the question of homology versus analogy. Here, we present a web server named HorA (Homology or Analogy) that identifies likely homologs for a query protein structure. Unlike other servers, HorA combines sequence information from state-of-the-art profile methods with structure information from spatial similarity measures using an advanced computational technique. HorA aims to identify biologically meaningful connections rather than purely 3D-geometric similarities. The HorA method finds approximately 90% of remote homologs defined in the manually curated database SCOP. HorA will be especially useful for finding remote homologs that might be overlooked by other sequence or structural similarity search servers. The HorA server is available at http://prodata.swmed.edu/horaserver.

摘要

蛋白质的生物学特性通常是通过对进化相关物进行比较分析来获取的。尽管蛋白质结构相似性搜索方法比单纯基于序列的方法能检测到更远缘的同源物，但结构相似性可能源于同源性（共同祖先）或类比性（无共同祖先的相似性）。虽然许多现有的网络服务器能检测结构上的邻近物，但它们并未明确解决同源性与类比性的问题。在此，我们展示了一个名为HorA（同源性或类比性）的网络服务器，它能为查询的蛋白质结构识别可能的同源物。与其他服务器不同，HorA使用先进的计算技术，将来自最新概况方法的序列信息与来自空间相似性度量的结构信息相结合。HorA旨在识别具有生物学意义的联系，而非单纯的三维几何相似性。HorA方法能找到在人工整理的数据库SCOP中定义的约90%的远缘同源物。HorA对于寻找可能被其他序列或结构相似性搜索服务器忽略的远缘同源物将特别有用。HorA服务器可在http://prodata.swmed.edu/horaserver获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cb6/2703895/c702d7a5be90/gkp328f1.jpg

相似文献

HorA web server to infer homology between proteins using sequence and structural similarity.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W532-8. doi: 10.1093/nar/gkp328. Epub 2009 May 5.

PROMALS3D web server for accurate multiple protein sequence and structure alignments.

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W30-4. doi: 10.1093/nar/gkn322. Epub 2008 May 24.

ProSMoS server: a pattern-based search using interaction matrix representation of protein structures.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W526-31. doi: 10.1093/nar/gkp316. Epub 2009 May 6.

Using homology relations within a database markedly boosts protein sequence similarity search.

Proc Natl Acad Sci U S A. 2015 Jun 2;112(22):7003-8. doi: 10.1073/pnas.1424324112. Epub 2015 May 18.

PROCAIN server for remote protein sequence similarity search.

Bioinformatics. 2009 Aug 15;25(16):2076-7. doi: 10.1093/bioinformatics/btp346. Epub 2009 Jun 3.

fastSCOP: a fast web server for recognizing protein structural domains and SCOP superfamilies.

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W438-43. doi: 10.1093/nar/gkm288. Epub 2007 May 7.

MESSA: MEta-Server for protein Sequence Analysis.

BMC Biol. 2012 Oct 2;10:82. doi: 10.1186/1741-7007-10-82.

COMPASS server for remote homology inference.

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W653-8. doi: 10.1093/nar/gkm293. Epub 2007 May 21.

COMPASS server for homology detection: improved statistical accuracy, speed and functionality.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W90-4. doi: 10.1093/nar/gkp360. Epub 2009 May 12.

CPHmodels-3.0--remote homology modeling using structure-guided sequence profiles.

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W576-81. doi: 10.1093/nar/gkq535. Epub 2010 Jun 11.

引用本文的文献

Structural Basis for Regulation of GPR56/ADGRG1 by Its Alternatively Spliced Extracellular Domains.

Neuron. 2016 Sep 21;91(6):1292-1304. doi: 10.1016/j.neuron.2016.08.022.

A vocabulary of ancient peptides at the origin of folded proteins.

Elife. 2015 Dec 14;4:e09410. doi: 10.7554/eLife.09410.

Manual classification strategies in the ECOD database.

Proteins. 2015 Jul;83(7):1238-51. doi: 10.1002/prot.24818. Epub 2015 May 8.

ECOD: an evolutionary classification of protein domains.

PLoS Comput Biol. 2014 Dec 4;10(12):e1003926. doi: 10.1371/journal.pcbi.1003926. eCollection 2014 Dec.

Conserved evolutionary units in the heme-copper oxidase superfamily revealed by novel homologous protein families.

Protein Sci. 2014 Sep;23(9):1220-34. doi: 10.1002/pro.2503. Epub 2014 Jul 7.

CASP9 target classification.

Proteins. 2011;79 Suppl 10(Suppl 10):21-36. doi: 10.1002/prot.23190. Epub 2011 Oct 14.

iPBA: a tool for protein structure comparison using sequence alignment strategies.

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W18-23. doi: 10.1093/nar/gkr333. Epub 2011 May 17.

Crystal structure of CCM3, a cerebral cavernous malformation protein critical for vascular integrity.

J Biol Chem. 2010 Jul 30;285(31):24099-107. doi: 10.1074/jbc.M110.128470. Epub 2010 May 19.

本文引用的文献

Searching protein structure databases with DaliLite v.3.

Bioinformatics. 2008 Dec 1;24(23):2780-1. doi: 10.1093/bioinformatics/btn507. Epub 2008 Sep 25.

Discrimination between distant homologs and structural analogs: lessons from manually constructed, reliable data sets.

J Mol Biol. 2008 Apr 4;377(4):1265-78. doi: 10.1016/j.jmb.2007.12.076. Epub 2008 Jan 5.

MALIDUP: a database of manually constructed structure alignments for duplicated domain pairs.

Proteins. 2008 Mar;70(4):1162-6. doi: 10.1002/prot.21783.

MALISAM: a database of structurally analogous motifs in proteins.

Nucleic Acids Res. 2008 Jan;36(Database issue):D211-7. doi: 10.1093/nar/gkm698. Epub 2007 Sep 12.

COMPASS server for remote homology inference.

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W653-8. doi: 10.1093/nar/gkm293. Epub 2007 May 21.

Evolutionary genomics of the HAD superfamily: understanding the structural adaptations and catalytic diversity in a superfamily of phosphoesterases and allied enzymes.

J Mol Biol. 2006 Sep 1;361(5):1003-34. doi: 10.1016/j.jmb.2006.06.049. Epub 2006 Jul 7.

Protein structure database search and evolutionary classification.

Nucleic Acids Res. 2006 Aug 2;34(13):3646-59. doi: 10.1093/nar/gkl395. Print 2006.

The HHpred interactive server for protein homology detection and structure prediction.

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W244-8. doi: 10.1093/nar/gki408.

TM-align: a protein structure alignment algorithm based on the TM-score.

Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. doi: 10.1093/nar/gki524. Print 2005.

FAST: a novel protein structure alignment algorithm.

Proteins. 2005 Feb 15;58(3):618-27. doi: 10.1002/prot.20331.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

HorA网络服务器，用于利用序列和结构相似性推断蛋白质之间的同源性。

HorA web server to infer homology between proteins using sequence and structural similarity.

作者信息

Kim Bong-Hyun, Cheng Hua, Grishin Nick V

机构信息

Department of Biochemistry, University of Texas, Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA.

出版信息

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W532-8. doi: 10.1093/nar/gkp328. Epub 2009 May 5.

DOI:10.1093/nar/gkp328

PMID:19417074

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2703895/

Abstract

摘要

HorA网络服务器，用于利用序列和结构相似性推断蛋白质之间的同源性。

HorA web server to infer homology between proteins using sequence and structural similarity.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

HorA网络服务器，用于利用序列和结构相似性推断蛋白质之间的同源性。

HorA web server to infer homology between proteins using sequence and structural similarity.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献