Cuff Alison L, Sillitoe Ian, Lewis Tony, Redfern Oliver C, Garratt Richard, Thornton Janet, Orengo Christine A
Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK.
Nucleic Acids Res. 2009 Jan;37(Database issue):D310-4. doi: 10.1093/nar/gkn877. Epub 2008 Nov 7.
The latest version of CATH (class, architecture, topology, homology) (version 3.2), released in July 2008 (http://www.cathdb.info), contains 114,215 domains, 2178 Homologous superfamilies and 1110 fold groups. We have assigned 20,330 new domains, 87 new homologous superfamilies and 26 new folds since CATH release version 3.1. A total of 28,064 new domains have been assigned since our NAR 2007 database publication (CATH version 3.0). The CATH website has been completely redesigned and includes more comprehensive documentation. We have revisited the CATH architecture level as part of the development of a 'Protein Chart' and present information on the population of each architecture. The CATHEDRAL structure comparison algorithm has been improved and used to characterize structural diversity in CATH superfamilies and structural overlaps between superfamilies. Although the majority of superfamilies in CATH are not structurally diverse and do not overlap significantly with other superfamilies, approximately 4% of superfamilies are very diverse and these are the superfamilies that are most highly populated in both the PDB and in the genomes. Information on the degree of structural diversity in each superfamily and structural overlaps between superfamilies can now be downloaded from the CATH website.
最新版本的CATH(类别、结构、拓扑、同源性)(3.2版)于2008年7月发布(http://www.cathdb.info),包含114,215个结构域、2178个同源超家族和1110个折叠组。自CATH 3.1版发布以来,我们已指定了20,330个新结构域、87个新同源超家族和26个新折叠。自我们2007年在《核酸研究》上发表数据库(CATH 3.0版)以来,总共指定了28,064个新结构域。CATH网站已全面重新设计,并包含更全面的文档。作为“蛋白质图表”开发的一部分,我们重新审视了CATH结构水平,并展示了每个结构的数量信息。CATHEDRAL结构比较算法已得到改进,并用于表征CATH超家族中的结构多样性以及超家族之间的结构重叠。尽管CATH中的大多数超家族在结构上并不多样,且与其他超家族没有明显重叠,但约4%的超家族非常多样,这些超家族在蛋白质数据库(PDB)和基因组中都是数量最多的。现在可以从CATH网站下载每个超家族的结构多样性程度以及超家族之间结构重叠的信息。