同源蛋白结构比对数据库（HOMSTRAD）：最新进展

HOMSTRAD: recent developments of the Homologous Protein Structure Alignment Database.

作者信息

Stebbings Lucy A, Mizuguchi Kenji

机构信息

Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK.

出版信息

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D203-7. doi: 10.1093/nar/gkh027.

DOI:10.1093/nar/gkh027

PMID:14681395

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC308761/

Abstract

HOMSTRAD (http://www-cryst.bioc.cam.ac.uk/ homstrad/) is a collection of protein families, clustered on the basis of sequence and structural similarity. The database is unique in that the protein family sequence alignments have been specially annotated using the program, JOY, to highlight a wide range of structural features. Such data are useful for identifying key structurally conserved residues within the families. Superpositions of the structures within each family are also available and a sensitive structure-aided search engine, FUGUE, can be used to search the database for matches to a query protein sequence. Historically, HOMSTRAD families were generated using several key pieces of software, including COMPARER and MNYFIT, and held in a number of flat files and indexes. A new relational database version of HOMSTRAD, HOMSTRAD BETA (http://www-cryst.bioc.cam. ac.uk/homstradbeta/) is being developed using MySQL. This relational data structure provides more flexibility for future developments, reduces update times and makes data more easily accessible. Consequently it has been possible to add a number of new web features including a custom alignment facility. Altogether, this makes HOMSTRAD and its new BETA version, an excellent resource both for comparative modelling and for identifying distant sequence/structure similarities between proteins.

摘要

HOMSTRAD（http://www-cryst.bioc.cam.ac.uk/homstrad/）是一个蛋白质家族集合，这些家族是根据序列和结构相似性聚类而成的。该数据库的独特之处在于，蛋白质家族序列比对已使用JOY程序进行了专门注释，以突出各种结构特征。此类数据对于识别家族内关键的结构保守残基很有用。每个家族内结构的叠加也可获取，并且可以使用一个灵敏的结构辅助搜索引擎FUGUE在数据库中搜索与查询蛋白质序列匹配的序列。从历史上看，HOMSTRAD家族是使用包括COMPARER和MNYFIT在内的几个关键软件生成的，并保存在多个平面文件和索引中。正在使用MySQL开发HOMSTRAD的一个新的关系数据库版本，即HOMSTRAD BETA（http://www-cryst.bioc.cam.ac.uk/homstradbeta/）。这种关系数据结构为未来的开发提供了更大的灵活性，减少了更新时间，并使数据更容易获取。因此，已经可以添加许多新的网络功能，包括自定义比对工具。总之，这使得HOMSTRAD及其新的BETA版本成为比较建模以及识别蛋白质之间远距离序列/结构相似性的优秀资源。

相似文献

HOMSTRAD: recent developments of the Homologous Protein Structure Alignment Database.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D203-7. doi: 10.1093/nar/gkh027.

HOMSTRAD: a database of protein structure alignments for homologous families.

Protein Sci. 1998 Nov;7(11):2469-71. doi: 10.1002/pro.5560071126.

FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties.

J Mol Biol. 2001 Jun 29;310(1):243-57. doi: 10.1006/jmbi.2001.4762.

HOMSTRAD: adding sequence information to structure-based alignments of homologous protein families.

Bioinformatics. 2001 Aug;17(8):748-9. doi: 10.1093/bioinformatics/17.8.748.

JOY: protein sequence-structure representation and analysis.

Bioinformatics. 1998;14(7):617-23. doi: 10.1093/bioinformatics/14.7.617.

OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.

BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47.

PASS2: an automated database of protein alignments organised as structural superfamilies.

BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.

EyeSite: a semi-automated database of protein families in the eye.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D148-52. doi: 10.1093/nar/gkh090.

An iterative structure-assisted approach to sequence alignment and comparative modeling.

Proteins. 1999;Suppl 3:55-60. doi: 10.1002/(sici)1097-0134(1999)37:3+<55::aid-prot8>3.3.co;2-2.

BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark.

Proteins. 2005 Oct 1;61(1):127-36. doi: 10.1002/prot.20527.

引用本文的文献

Exploration of Comprehensive Structural and Functional Potential of Recombinant Proteins Using Cutting-Edge Bioinformatics Tools.

Appl Biochem Biotechnol. 2025 Sep 9. doi: 10.1007/s12010-025-05366-2.

GTalign: spatial index-driven protein structure alignment, superposition, and search.

Nat Commun. 2024 Aug 24;15(1):7305. doi: 10.1038/s41467-024-51669-z.

Exploring Viral-Host Protein Interactions as Antiviral Therapies: A Computational Perspective.

Microorganisms. 2024 Mar 21;12(3):630. doi: 10.3390/microorganisms12030630.

Leveraging protein language models for accurate multiple sequence alignments.

Genome Res. 2023 Jul;33(7):1145-1153. doi: 10.1101/gr.277675.123. Epub 2023 Jul 6.

learnMSA: learning and aligning large protein families.

Gigascience. 2022 Nov 18;11. doi: 10.1093/gigascience/giac104.

The variation among sites of protein structure divergence is shaped by mutation and scaled by selection.

Curr Res Struct Biol. 2020 Aug 26;2:156-163. doi: 10.1016/j.crstbi.2020.08.002. eCollection 2020.

Fast computational mutation-response scanning of proteins.

PeerJ. 2021 Apr 21;9:e11330. doi: 10.7717/peerj.11330. eCollection 2021.

Tailor-made multiple sequence alignments using the PRALINE 2 alignment toolkit.

Bioinformatics. 2019 Dec 15;35(24):5315-5317. doi: 10.1093/bioinformatics/btz572.

Newly Synthesized Oxygenated Xanthones as Potential P-Glycoprotein Activators: , , and Studies.

Molecules. 2019 Feb 15;24(4):707. doi: 10.3390/molecules24040707.

mTM-align: an algorithm for fast and accurate multiple protein structure alignment.

Bioinformatics. 2018 May 15;34(10):1719-1725. doi: 10.1093/bioinformatics/btx828.

本文引用的文献

The CATH database: an extended protein family resource for structural and functional genomics.

Nucleic Acids Res. 2003 Jan 1;31(1):452-5. doi: 10.1093/nar/gkg062.

The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.

Nucleic Acids Res. 2003 Jan 1;31(1):365-70. doi: 10.1093/nar/gkg095.

PROSITE: a documented database using patterns and profiles as motif descriptors.

Brief Bioinform. 2002 Sep;3(3):265-74. doi: 10.1093/bib/3.3.265.

Sequence-structure homology recognition by iterative alignment refinement and comparative modeling.

Proteins. 2001;Suppl 5:92-7. doi: 10.1002/prot.1169.

The Pfam protein families database.

Nucleic Acids Res. 2002 Jan 1;30(1):276-80. doi: 10.1093/nar/30.1.276.

SCOP database in 2002: refinements accommodate structural genomics.

Nucleic Acids Res. 2002 Jan 1;30(1):264-7. doi: 10.1093/nar/30.1.264.

HOMSTRAD: adding sequence information to structure-based alignments of homologous protein families.

Bioinformatics. 2001 Aug;17(8):748-9. doi: 10.1093/bioinformatics/17.8.748.

FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties.

J Mol Biol. 2001 Jun 29;310(1):243-57. doi: 10.1006/jmbi.2001.4762.

A new algorithm for the alignment of multiple protein structures using Monte Carlo optimization.

Pac Symp Biocomput. 2001:275-86. doi: 10.1142/9789814447362_0028.

Evolutionary trace analysis of TGF-beta and related growth factors: implications for site-directed mutagenesis.

Protein Eng. 2000 Dec;13(12):839-47. doi: 10.1093/protein/13.12.839.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

同源蛋白结构比对数据库（HOMSTRAD）：最新进展

HOMSTRAD: recent developments of the Homologous Protein Structure Alignment Database.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

同源蛋白结构比对数据库（HOMSTRAD）：最新进展

HOMSTRAD: recent developments of the Homologous Protein Structure Alignment Database.

作者信息

机构信息

出版信息