Kim Bong-Hyun, Cheng Hua, Grishin Nick V
Department of Biochemistry, University of Texas, Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA.
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W532-8. doi: 10.1093/nar/gkp328. Epub 2009 May 5.
The biological properties of proteins are often gleaned through comparative analysis of evolutionary relatives. Although protein structure similarity search methods detect more distant homologs than purely sequence-based methods, structural resemblance can result from either homology (common ancestry) or analogy (similarity without common ancestry). While many existing web servers detect structural neighbors, they do not explicitly address the question of homology versus analogy. Here, we present a web server named HorA (Homology or Analogy) that identifies likely homologs for a query protein structure. Unlike other servers, HorA combines sequence information from state-of-the-art profile methods with structure information from spatial similarity measures using an advanced computational technique. HorA aims to identify biologically meaningful connections rather than purely 3D-geometric similarities. The HorA method finds approximately 90% of remote homologs defined in the manually curated database SCOP. HorA will be especially useful for finding remote homologs that might be overlooked by other sequence or structural similarity search servers. The HorA server is available at http://prodata.swmed.edu/horaserver.
蛋白质的生物学特性通常是通过对进化相关物进行比较分析来获取的。尽管蛋白质结构相似性搜索方法比单纯基于序列的方法能检测到更远缘的同源物,但结构相似性可能源于同源性(共同祖先)或类比性(无共同祖先的相似性)。虽然许多现有的网络服务器能检测结构上的邻近物,但它们并未明确解决同源性与类比性的问题。在此,我们展示了一个名为HorA(同源性或类比性)的网络服务器,它能为查询的蛋白质结构识别可能的同源物。与其他服务器不同,HorA使用先进的计算技术,将来自最新概况方法的序列信息与来自空间相似性度量的结构信息相结合。HorA旨在识别具有生物学意义的联系,而非单纯的三维几何相似性。HorA方法能找到在人工整理的数据库SCOP中定义的约90%的远缘同源物。HorA对于寻找可能被其他序列或结构相似性搜索服务器忽略的远缘同源物将特别有用。HorA服务器可在http://prodata.swmed.edu/horaserver获取。