Marchler-Bauer Aron, Anderson John B, Chitsaz Farideh, Derbyshire Myra K, DeWeese-Scott Carol, Fong Jessica H, Geer Lewis Y, Geer Renata C, Gonzales Noreen R, Gwadz Marc, He Siqian, Hurwitz David I, Jackson John D, Ke Zhaoxi, Lanczycki Christopher J, Liebert Cynthia A, Liu Chunlei, Lu Fu, Lu Shennan, Marchler Gabriele H, Mullokandov Mikhail, Song James S, Tasneem Asba, Thanki Narmada, Yamashita Roxanne A, Zhang Dachuan, Zhang Naigong, Bryant Stephen H
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bldg. 38 A, Room 8N805, 8600 Rockville Pike, Bethesda, MD 20894, USA.
Nucleic Acids Res. 2009 Jan;37(Database issue):D205-10. doi: 10.1093/nar/gkn845. Epub 2008 Nov 4.
NCBI's Conserved Domain Database (CDD) is a collection of multiple sequence alignments and derived database search models, which represent protein domains conserved in molecular evolution. The collection can be accessed at http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml, and is also part of NCBI's Entrez query and retrieval system, cross-linked to numerous other resources. CDD provides annotation of domain footprints and conserved functional sites on protein sequences. Precalculated domain annotation can be retrieved for protein sequences tracked in NCBI's Entrez system, and CDD's collection of models can be queried with novel protein sequences via the CD-Search service at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi. Starting with the latest version of CDD, v2.14, information from redundant and homologous domain models is summarized at a superfamily level, and domain annotation on proteins is flagged as either 'specific' (identifying molecular function with high confidence) or as 'non-specific' (identifying superfamily membership only).
美国国立医学图书馆国家生物技术信息中心的保守结构域数据库(CDD)是一个多序列比对和衍生数据库搜索模型的集合,代表了分子进化中保守的蛋白质结构域。该集合可在http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml上访问,也是美国国立医学图书馆国家生物技术信息中心Entrez查询和检索系统的一部分,与许多其他资源相互链接。CDD提供蛋白质序列上结构域足迹和保守功能位点的注释。可以为美国国立医学图书馆国家生物技术信息中心Entrez系统中跟踪的蛋白质序列检索预先计算的结构域注释,并且可以通过http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi的CD-Search服务使用新的蛋白质序列查询CDD的模型集合。从CDD的最新版本v2.14开始,冗余和同源结构域模型的信息在超家族水平上进行汇总,蛋白质上的结构域注释被标记为“特异性”(以高置信度识别分子功能)或“非特异性”(仅识别超家族成员)。