Cuff Alison L, Sillitoe Ian, Lewis Tony, Clegg Andrew B, Rentzsch Robert, Furnham Nicholas, Pellegrini-Calace Marialuisa, Jones David, Thornton Janet, Orengo Christine A
Institute of Structural and Molecular Biology, University College London, Darwin Building, Gower Street, London WC1E 6BT, UK.
Nucleic Acids Res. 2011 Jan;39(Database issue):D420-6. doi: 10.1093/nar/gkq1001. Epub 2010 Nov 19.
CATH version 3.3 (class, architecture, topology, homology) contains 128,688 domains, 2386 homologous superfamilies and 1233 fold groups, and reflects a major focus on classifying structural genomics (SG) structures and transmembrane proteins, both of which are likely to add structural novelty to the database and therefore increase the coverage of protein fold space within CATH. For CATH version 3.4 we have significantly improved the presentation of sequence information and associated functional information for CATH superfamilies. The CATH superfamily pages now reflect both the functional and structural diversity within the superfamily and include structural alignments of close and distant relatives within the superfamily, annotated with functional information and details of conserved residues. A significantly more efficient search function for CATH has been established by implementing the search server Solr (http://lucene.apache.org/solr/). The CATH v3.4 webpages have been built using the Catalyst web framework.
CATH版本3.3(类别、架构、拓扑、同源性)包含128,688个结构域、2386个同源超家族和1233个折叠组,主要侧重于对结构基因组学(SG)结构和跨膜蛋白进行分类,这两者都可能为数据库增添结构新颖性,从而增加CATH中蛋白质折叠空间的覆盖范围。对于CATH版本3.4,我们显著改进了CATH超家族序列信息及相关功能信息的呈现方式。CATH超家族页面现在既反映了超家族内的功能和结构多样性,还包括超家族内近缘和远缘亲属的结构比对,并标注了功能信息和保守残基的细节。通过实施搜索服务器Solr(http://lucene.apache.org/solr/),已建立了一个效率显著更高的CATH搜索功能。CATH v3.4网页是使用Catalyst网络框架构建的。