Suppr超能文献

PROCAIN:带有辅助信息的蛋白质谱比较

PROCAIN: protein profile comparison with assisting information.

作者信息

Wang Yong, Sadreyev Ruslan I, Grishin Nick V

机构信息

Biomedical Engineering Program, University of Texas Southwestern Medical Center, Dallas, TX 75390-9050, USA.

出版信息

Nucleic Acids Res. 2009 Jun;37(11):3522-30. doi: 10.1093/nar/gkp212. Epub 2009 Apr 7.

Abstract

Detection of remote sequence homology is essential for the accurate inference of protein structure, function and evolution. The most sensitive detection methods involve the comparison of evolutionary patterns reflected in multiple sequence alignments (MSAs) of protein families. We present PROCAIN, a new method for MSA comparison based on the combination of 'vertical' MSA context (substitution constraints at individual sequence positions) and 'horizontal' context (patterns of residue content at multiple positions). Based on a simple and tractable profile methodology and primitive measures for the similarity of horizontal MSA patterns, the method achieves the quality of homology detection comparable to a more complex advanced method employing hidden Markov models (HMMs) and secondary structure (SS) prediction. Adding SS information further improves PROCAIN performance beyond the capabilities of current state-of-the-art tools. The potential value of the method for structure/function predictions is illustrated by the detection of subtle homology between evolutionary distant yet structurally similar protein domains. ProCAIn, relevant databases and tools can be downloaded from: http://prodata.swmed.edu/procain/download. The web server can be accessed at http://prodata.swmed.edu/procain/procain.php.

摘要

检测远程序列同源性对于准确推断蛋白质结构、功能和进化至关重要。最灵敏的检测方法涉及比较蛋白质家族多序列比对(MSA)中反映的进化模式。我们提出了PROCAIN,这是一种基于“垂直”MSA上下文(单个序列位置的替换约束)和“水平”上下文(多个位置的残基含量模式)相结合的MSA比较新方法。基于一种简单且易于处理的轮廓方法以及水平MSA模式相似性的基本度量,该方法实现的同源性检测质量与采用隐马尔可夫模型(HMM)和二级结构(SS)预测的更复杂的先进方法相当。添加SS信息进一步提升了PROCAIN的性能,超越了当前最先进工具的能力。通过检测进化距离较远但结构相似的蛋白质结构域之间的细微同源性,说明了该方法在结构/功能预测方面的潜在价值。ProCAIn、相关数据库和工具可从以下网址下载:http://prodata.swmed.edu/procain/download。可通过http://prodata.swmed.edu/procain/procain.php访问网络服务器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee65/2699500/775a61c9a931/gkp212f1.jpg

相似文献

1
PROCAIN: protein profile comparison with assisting information.
Nucleic Acids Res. 2009 Jun;37(11):3522-30. doi: 10.1093/nar/gkp212. Epub 2009 Apr 7.
2
PROCAIN server for remote protein sequence similarity search.
Bioinformatics. 2009 Aug 15;25(16):2076-7. doi: 10.1093/bioinformatics/btp346. Epub 2009 Jun 3.
3
COMPASS server for remote homology inference.
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W653-8. doi: 10.1093/nar/gkm293. Epub 2007 May 21.
4
PROMALS web server for accurate multiple protein sequence alignments.
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W649-52. doi: 10.1093/nar/gkm227. Epub 2007 Apr 22.
6
COMPASS server for homology detection: improved statistical accuracy, speed and functionality.
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W90-4. doi: 10.1093/nar/gkp360. Epub 2009 May 12.
7
SFESA: a web server for pairwise alignment refinement by secondary structure shifts.
BMC Bioinformatics. 2015 Sep 3;16(1):282. doi: 10.1186/s12859-015-0711-0.
8
PROMALS3D web server for accurate multiple protein sequence and structure alignments.
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W30-4. doi: 10.1093/nar/gkn322. Epub 2008 May 24.
9
MRFalign: protein homology detection through alignment of Markov random fields.
PLoS Comput Biol. 2014 Mar 27;10(3):e1003500. doi: 10.1371/journal.pcbi.1003500. eCollection 2014 Mar.
10
PROMALS: towards accurate multiple sequence alignments of distantly related proteins.
Bioinformatics. 2007 Apr 1;23(7):802-8. doi: 10.1093/bioinformatics/btm017. Epub 2007 Jan 31.

引用本文的文献

1
Cysteine-Rich Atrial Secretory Protein from the Snail Achatina achatina: Purification and Structural Characterization.
PLoS One. 2015 Oct 7;10(10):e0138787. doi: 10.1371/journal.pone.0138787. eCollection 2015.
3
Using homology relations within a database markedly boosts protein sequence similarity search.
Proc Natl Acad Sci U S A. 2015 Jun 2;112(22):7003-8. doi: 10.1073/pnas.1424324112. Epub 2015 May 18.
4
Refinement by shifting secondary structure elements improves sequence alignments.
Proteins. 2015 Mar;83(3):411-27. doi: 10.1002/prot.24746. Epub 2015 Jan 13.
5
Reduction of the number of major representative allergens: from clinical testing to 3-dimensional structures.
Mediators Inflamm. 2014;2014:291618. doi: 10.1155/2014/291618. Epub 2014 Mar 23.
6
From local structure to a global framework: recognition of protein folds.
J R Soc Interface. 2014 Apr 16;11(95):20131147. doi: 10.1098/rsif.2013.1147. Print 2014 Jun 6.
8
CASP9 target classification.
Proteins. 2011;79 Suppl 10(Suppl 10):21-36. doi: 10.1002/prot.23190. Epub 2011 Oct 14.
9
AlignHUSH: alignment of HMMs using structure and hydrophobicity information.
BMC Bioinformatics. 2011 Jul 5;12:275. doi: 10.1186/1471-2105-12-275.
10
Free energy simulations of a GTPase: GTP and GDP binding to archaeal initiation factor 2.
J Phys Chem B. 2011 May 26;115(20):6749-63. doi: 10.1021/jp201934p. Epub 2011 May 2.

本文引用的文献

2
A comprehensive system for evaluation of remote sequence similarity detection.
BMC Bioinformatics. 2007 Aug 28;8:314. doi: 10.1186/1471-2105-8-314.
3
On the origin and highly likely completeness of single-domain protein structures.
Proc Natl Acad Sci U S A. 2006 Feb 21;103(8):2605-10. doi: 10.1073/pnas.0509379103. Epub 2006 Feb 14.
4
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.
5
The impact of structural genomics: expectations and outcomes.
Science. 2006 Jan 20;311(5759):347-51. doi: 10.1126/science.1121018.
6
Protein family comparison using statistical models and predicted structural information.
BMC Bioinformatics. 2004 Nov 25;5:183. doi: 10.1186/1471-2105-5-183.
7
Protein homology detection by HMM-HMM comparison.
Bioinformatics. 2005 Apr 1;21(7):951-60. doi: 10.1093/bioinformatics/bti125. Epub 2004 Nov 5.
8
LGA: A method for finding 3D similarities in protein structures.
Nucleic Acids Res. 2003 Jul 1;31(13):3370-4. doi: 10.1093/nar/gkg571.
9
COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance.
J Mol Biol. 2003 Feb 7;326(1):317-36. doi: 10.1016/s0022-2836(02)01371-2.
10
The Mre11 complex: at the crossroads of dna repair and checkpoint signalling.
Nat Rev Mol Cell Biol. 2002 May;3(5):317-27. doi: 10.1038/nrm805.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验