Geer Lewis Y, Domrachev Michael, Lipman David J, Bryant Stephen H
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.
Genome Res. 2002 Oct;12(10):1619-23. doi: 10.1101/gr.278202.
The Conserved Domain Architecture Retrieval Tool (CDART) performs similarity searches of the NCBI Entrez Protein Database based on domain architecture, defined as the sequential order of conserved domains in proteins. The algorithm finds protein similarities across significant evolutionary distances using sensitive protein domain profiles rather than by direct sequence similarity. Proteins similar to a query protein are grouped and scored by architecture. Relying on domain profiles allows CDART to be fast, and, because it relies on annotated functional domains, informative. Domain profiles are derived from several collections of domain definitions that include functional annotation. Searches can be further refined by taxonomy and by selecting domains of interest. CDART is available at http://www.ncbi.nlm.nih.gov/Structure/lexington/lexington.cgi.
保守结构域构架检索工具(CDART)基于结构域构架对NCBI Entrez蛋白质数据库进行相似性搜索,结构域构架定义为蛋白质中保守结构域的顺序。该算法使用敏感的蛋白质结构域谱而非直接的序列相似性来发现跨越显著进化距离的蛋白质相似性。与查询蛋白质相似的蛋白质按构架进行分组和评分。依靠结构域谱使得CDART速度快,并且由于它依赖于注释的功能结构域,所以信息丰富。结构域谱源自包括功能注释的多个结构域定义集合。搜索可以通过分类法以及选择感兴趣的结构域进一步细化。可通过http://www.ncbi.nlm.nih.gov/Structure/lexington/lexington.cgi获取CDART。