COBALT：用于多条蛋白质序列的基于约束的比对工具。

COBALT: constraint-based alignment tool for multiple protein sequences.

作者信息

Papadopoulos Jason S, Agarwala Richa

机构信息

National Center for Biotechnology Information, National Institutes of Health, Department of Health and Human Services, Bethesda, MD 20894, USA.

出版信息

Bioinformatics. 2007 May 1;23(9):1073-9. doi: 10.1093/bioinformatics/btm076. Epub 2007 Mar 1.

DOI:10.1093/bioinformatics/btm076

PMID:17332019

Abstract

MOTIVATION

A tool that simultaneously aligns multiple protein sequences, automatically utilizes information about protein domains, and has a good compromise between speed and accuracy will have practical advantages over current tools.

RESULTS

We describe COBALT, a constraint based alignment tool that implements a general framework for multiple alignment of protein sequences. COBALT finds a collection of pairwise constraints derived from database searches, sequence similarity and user input, combines these pairwise constraints, and then incorporates them into a progressive multiple alignment. We show that using constraints derived from the conserved domain database (CDD) and PROSITE protein-motif database improves COBALT's alignment quality. We also show that COBALT has reasonable runtime performance and alignment accuracy comparable to or exceeding that of other tools for a broad range of problems.

AVAILABILITY

COBALT is included in the NCBI C++ toolkit. A Linux executable for COBALT, and CDD and PROSITE data used is available at: ftp://ftp.ncbi.nlm.nih.gov/pub/agarwala/cobalt

摘要

动机

一种能够同时比对多个蛋白质序列、自动利用蛋白质结构域信息并且在速度和准确性之间取得良好平衡的工具，相较于当前工具具有实际优势。

结果

我们描述了COBALT，一种基于约束的比对工具，它实现了蛋白质序列多重比对的通用框架。COBALT找到从数据库搜索、序列相似性和用户输入中得出的一组成对约束，将这些成对约束合并，然后将它们纳入渐进式多重比对。我们表明，使用源自保守结构域数据库（CDD）和PROSITE蛋白质基序数据库的约束可提高COBALT的比对质量。我们还表明，对于广泛的问题，COBALT具有合理的运行时性能和比对准确性，可与其他工具相媲美或超过其他工具。

可用性

COBALT包含在NCBI C++工具包中。可从以下网址获取COBALT的Linux可执行文件以及所使用的CDD和PROSITE数据：ftp://ftp.ncbi.nlm.nih.gov/pub/agarwala/cobalt

相似文献

COBALT: constraint-based alignment tool for multiple protein sequences.

Bioinformatics. 2007 May 1;23(9):1073-9. doi: 10.1093/bioinformatics/btm076. Epub 2007 Mar 1.

A structure-based method for protein sequence alignment.

Bioinformatics. 2005 Apr 15;21(8):1451-6. doi: 10.1093/bioinformatics/bti233. Epub 2004 Dec 21.

WindowMasker: window-based masker for sequenced genomes.

Bioinformatics. 2006 Jan 15;22(2):134-41. doi: 10.1093/bioinformatics/bti774. Epub 2005 Nov 15.

Improved BLAST searches using longer words for protein seeding.

Bioinformatics. 2007 Nov 1;23(21):2949-51. doi: 10.1093/bioinformatics/btm479. Epub 2007 Oct 6.

QOMA: quasi-optimal multiple alignment of protein sequences.

Bioinformatics. 2007 Jan 15;23(2):162-8. doi: 10.1093/bioinformatics/btl590. Epub 2006 Nov 22.

Accuracy of structure-based sequence alignment of automatic methods.

BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.

ARCS: an aggregated related column scoring scheme for aligned sequences.

Bioinformatics. 2006 Oct 1;22(19):2326-32. doi: 10.1093/bioinformatics/btl398. Epub 2006 Jul 26.

PROMALS: towards accurate multiple sequence alignments of distantly related proteins.

Bioinformatics. 2007 Apr 1;23(7):802-8. doi: 10.1093/bioinformatics/btm017. Epub 2007 Jan 31.

APDB: a web server to evaluate the accuracy of sequence alignments using structural information.

Bioinformatics. 2006 Oct 1;22(19):2439-40. doi: 10.1093/bioinformatics/btl404.

Refining multiple sequence alignments with conserved core regions.

Nucleic Acids Res. 2006 May 17;34(9):2598-606. doi: 10.1093/nar/gkl274. Print 2006.

引用本文的文献

Experience-dependent reconfiguration of receptors at a sensory compartment regulates neuronal plasticity.

bioRxiv. 2025 Aug 13:2025.08.13.670147. doi: 10.1101/2025.08.13.670147.

Genome-scale flux balance analysis reveals redox trade-offs in the metabolism of the thermoacidophile under auto-, hetero-and methanotrophic conditions.

Front Syst Biol. 2024 Jan 29;4:1291612. doi: 10.3389/fsysb.2024.1291612. eCollection 2024.

Moderately severe osteogenesis imperfecta-like osteochondrodysplasia associated with heterozygous variants in both and .

JBMR Plus. 2025 Jul 22;9(9):ziaf111. doi: 10.1093/jbmrpl/ziaf111. eCollection 2025 Sep.

S-phase checkpoint protects from aberrant replication fork processing and degradation.

Nucleic Acids Res. 2025 Jul 19;53(14). doi: 10.1093/nar/gkaf707.

Ligand binding to a Ni-Fe cluster orchestrates conformational changes of the CO-dehydrogenase-acetyl-CoA synthase complex.

Nat Catal. 2025;8(7):657-667. doi: 10.1038/s41929-025-01365-y. Epub 2025 Jul 11.

Changes in the Adenylate Kinase Activity are Proportional to the ADP/ATP Ratio Upon Resorption and Regeneration of Chlamydomonas reinhardtii Flagella.

Cell Biochem Biophys. 2025 Jul 18. doi: 10.1007/s12013-025-01825-z.

In silico characterization, structural modeling, and molecular docking of GabP in citrus and its potential role in GABA uptake.

Sci Rep. 2025 Jul 4;15(1):23919. doi: 10.1038/s41598-025-07447-y.

Mechanistic Design of Cell-Penetrating Disruptors for a Phospho-Dependent Interaction.

Res Sq. 2025 Jun 17:rs.3.rs-6862805. doi: 10.21203/rs.3.rs-6862805/v1.

Mapping the Interactions Among Class IIa Histone Deacetylases and Myocyte Enhancer Factor 2s.

J Chem Inf Model. 2025 Jun 23;65(12):6249-6260. doi: 10.1021/acs.jcim.5c00858. Epub 2025 Jun 6.

is a critical gene in methionine biosynthesis in .

Front Fungal Biol. 2025 May 22;6:1563395. doi: 10.3389/ffunb.2025.1563395. eCollection 2025.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

COBALT：用于多条蛋白质序列的基于约束的比对工具。

COBALT: constraint-based alignment tool for multiple protein sequences.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献