Espadaler Jordi, Fernandez-Fuentes Narcis, Hermoso Antonio, Querol Enrique, Aviles Francesc X, Sternberg Michael J E, Oliva Baldomero
Institut de Biotecnologia i de Biomedicina and Departament de Bioquímica, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D185-8. doi: 10.1093/nar/gkh002.
The annotation of protein function has become a crucial problem with the advent of sequence and structural genomics initiatives. A large body of evidence suggests that protein structural information is frequently encoded in local sequences, and that folds are mainly made up of a number of simple local units of super-secondary structural motifs, consisting of a few secondary structures and their connecting loops. Moreover, protein loops play an important role in protein function. Here we present ArchDB, a classification database of structural motifs, consisting of one loop plus its bracing secondary structures. ArchDB currently contains 12,665 super-secondary elements classified into 1496 motif subclasses. The database provides an easy way to retrieve functional information from protein structures sharing a common motif, to search motifs found in a given SCOP family, superfamily or fold, or to search by keywords on proteins with classified loops. The ArchDB database of loops is located at http://sbi.imim.es/archdb.
随着序列和结构基因组计划的出现,蛋白质功能注释已成为一个关键问题。大量证据表明,蛋白质结构信息常常编码在局部序列中,并且折叠主要由一些简单的超二级结构基序的局部单元组成,这些单元由少数二级结构及其连接环组成。此外,蛋白质环在蛋白质功能中起重要作用。在此,我们展示了ArchDB,一个结构基序分类数据库,由一个环及其支撑二级结构组成。ArchDB目前包含12,665个超二级元件,分为1496个基序子类。该数据库提供了一种简便的方法,可从共享共同基序的蛋白质结构中检索功能信息,搜索在给定的SCOP家族、超家族或折叠中发现的基序,或通过关键词搜索具有分类环的蛋白质。环的ArchDB数据库位于http://sbi.imim.es/archdb 。