Marchler-Bauer Aron, Anderson John B, Cherukuri Praveen F, DeWeese-Scott Carol, Geer Lewis Y, Gwadz Marc, He Siqian, Hurwitz David I, Jackson John D, Ke Zhaoxi, Lanczycki Christopher J, Liebert Cynthia A, Liu Chunlei, Lu Fu, Marchler Gabriele H, Mullokandov Mikhail, Shoemaker Benjamin A, Simonyan Vahan, Song James S, Thiessen Paul A, Yamashita Roxanne A, Yin Jodie J, Zhang Dachuan, Bryant Stephen H
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38 A, Room 8N805, 8600 Rockville Pike, Bethesda, MD 20894, USA.
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D192-6. doi: 10.1093/nar/gki069.
The Conserved Domain Database (CDD) is the protein classification component of NCBI's Entrez query and retrieval system. CDD is linked to other Entrez databases such as Proteins, Taxonomy and PubMed, and can be accessed at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=cdd. CD-Search, which is available at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, is a fast, interactive tool to identify conserved domains in new protein sequences. CD-Search results for protein sequences in Entrez are pre-computed to provide links between proteins and domain models, and computational annotation visible upon request. Protein-protein queries submitted to NCBI's BLAST search service at http://www.ncbi.nlm.nih.gov/BLAST are scanned for the presence of conserved domains by default. While CDD started out as essentially a mirror of publicly available domain alignment collections, such as SMART, Pfam and COG, we have continued an effort to update, and in some cases replace these models with domain hierarchies curated at the NCBI. Here, we report on the progress of the curation effort and associated improvements in the functionality of the CDD information retrieval system.
保守结构域数据库(CDD)是美国国立医学图书馆(NCBI)的Entrez查询与检索系统中的蛋白质分类组件。CDD与其他Entrez数据库(如蛋白质数据库、分类数据库和PubMed)相关联,可通过网址http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=cdd进行访问。CD-Search(网址为http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi)是一种快速的交互式工具,用于识别新蛋白质序列中的保守结构域。Entrez中蛋白质序列的CD-Search结果是预先计算好的,以提供蛋白质与结构域模型之间的链接,并根据需要提供可见的计算注释。默认情况下,提交到NCBI的BLAST搜索服务(网址为http://www.ncbi.nlm.nih.gov/BLAST)的蛋白质-蛋白质查询会扫描是否存在保守结构域。虽然CDD最初本质上是公开可用的结构域比对集合(如SMART、Pfam和COG)的镜像,但我们一直在努力更新,并且在某些情况下用NCBI精心策划的结构域层次结构取代这些模型。在此,我们报告策划工作的进展以及CDD信息检索系统功能的相关改进。