European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK and Department of Biochemistry and Molecular Genetics, University of Virginia, 1300 Jefferson Park Ave., Charlottesville, VA 22908, USA.
Nucleic Acids Res. 2014 Jan;42(Database issue):D485-9. doi: 10.1093/nar/gkt1243. Epub 2013 Dec 6.
Understanding which are the catalytic residues in an enzyme and what function they perform is crucial to many biology studies, particularly those leading to new therapeutics and enzyme design. The original version of the Catalytic Site Atlas (CSA) (http://www.ebi.ac.uk/thornton-srv/databases/CSA) published in 2004, which catalogs the residues involved in enzyme catalysis in experimentally determined protein structures, had only 177 curated entries and employed a simplistic approach to expanding these annotations to homologous enzyme structures. Here we present a new version of the CSA (CSA 2.0), which greatly expands the number of both curated (968) and automatically annotated catalytic sites in enzyme structures, utilizing a new method for annotation transfer. The curated entries are used, along with the variation in residue type from the sequence comparison, to generate 3D templates of the catalytic sites, which in turn can be used to find catalytic sites in new structures. To ease the transfer of CSA annotations to other resources a new ontology has been developed: the Enzyme Mechanism Ontology, which has permitted the transfer of annotations to Mechanism, Annotation and Classification in Enzymes (MACiE) and UniProt Knowledge Base (UniProtKB) resources. The CSA database schema has been re-designed and both the CSA data and search capabilities are presented in a new modern web interface.
了解酶中的催化残基及其功能对于许多生物学研究至关重要,特别是那些导致新疗法和酶设计的研究。2004 年发布的原始 Catalytic Site Atlas (CSA) (http://www.ebi.ac.uk/thornton-srv/databases/CSA) 版本仅包含 177 个经过精心整理的条目,并采用了一种简单的方法将这些注释扩展到同源酶结构。在这里,我们提出了 CSA 的新版本 (CSA 2.0),它大大扩展了经过精心整理的(968 个)和自动注释的酶结构中催化位点的数量,利用了一种新的注释转移方法。经过精心整理的条目与序列比较中残基类型的变化一起,生成催化位点的 3D 模板,这些模板反过来可以用于在新结构中找到催化位点。为了方便将 CSA 注释转移到其他资源,开发了一个新的本体论:酶机制本体论,它允许将注释转移到机制、酶中的注释和分类 (MACiE) 和 UniProt 知识库 (UniProtKB) 资源。CSA 数据库架构已重新设计,CSA 数据和搜索功能都在新的现代 Web 界面中呈现。