氨基酸取代的度量模型。

A metric model of amino acid substitution.

作者信息

Xu Weijia, Miranker Daniel P

机构信息

Department of Computer Sciences, The Center for Computational Biology and Bioinformatics, University of Texas, Austin, TX 78712, USA.

出版信息

Bioinformatics. 2004 May 22;20(8):1214-21. doi: 10.1093/bioinformatics/bth065. Epub 2004 Feb 10.

DOI:10.1093/bioinformatics/bth065

PMID:14871874

Abstract

MOTIVATION

We address the question of whether there exists an effective evolutionary model of amino-acid substitution that forms a metric-distance function. There is always a trade-off between speed and sensitivity among competing computational methods of determining sequence homology. A metric model of evolution is a prerequisite for the development of an entire class of fast sequence analysis algorithms that are both scalable, O(log n) and sensitive.

RESULTS

We have reworked the mathematics of the point accepted mutation model (PAM) by calculating the expected time between accepted mutations in lieu of calculating log-odds probabilities. The resulting substitution matrix (mPAM) forms a metric. We validate the application of the mPAM evolutionary model for sequence homology by executing sequence queries from a controlled yeast protein homology search benchmark. We compare the accuracy of the results of mPAM and PAM similarity matrices as well as three prior metric models. The experiment shows that mPAM significantly outperforms the other three metrics and sufficiently approaches the sensitivity of PAM250 to make it applicable to the management of protein sequence databases.

摘要

动机

我们探讨是否存在一种有效的氨基酸替换进化模型，该模型能形成一个度量距离函数。在确定序列同源性的各种竞争计算方法中，速度和灵敏度之间始终存在权衡。进化的度量模型是开发一类快速序列分析算法的先决条件，这类算法既要具有可扩展性（O(log n)）又要灵敏。

结果

我们通过计算接受突变之间的预期时间，而不是计算对数优势概率，对接受点突变模型（PAM）的数学进行了重新推导。由此产生的替换矩阵（mPAM）形成了一个度量。我们通过执行来自受控酵母蛋白质同源性搜索基准的序列查询，验证了mPAM进化模型在序列同源性方面的应用。我们比较了mPAM和PAM相似性矩阵以及三个先前的度量模型结果的准确性。实验表明，mPAM显著优于其他三个度量，并且足够接近PAM250的灵敏度，使其适用于蛋白质序列数据库的管理。

相似文献

A metric model of amino acid substitution.

Bioinformatics. 2004 May 22;20(8):1214-21. doi: 10.1093/bioinformatics/bth065. Epub 2004 Feb 10.

Improved pairwise alignments of proteins in the Twilight Zone using local structure predictions.

Bioinformatics. 2006 Feb 15;22(4):413-22. doi: 10.1093/bioinformatics/bti828. Epub 2005 Dec 13.

The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions.

Bioinformatics. 2005 Apr 1;21(7):902-11. doi: 10.1093/bioinformatics/bti070. Epub 2004 Oct 27.

Relation between weight matrix and substitution matrix: motif search by similarity.

Bioinformatics. 2005 Apr 1;21(7):938-43. doi: 10.1093/bioinformatics/bti090. Epub 2004 Oct 28.

Eigenvalue analysis of amino acid substitution matrices reveals a sharp transition of the mode of sequence conservation in proteins.

Bioinformatics. 2004 Nov 1;20(16):2504-8. doi: 10.1093/bioinformatics/bth297. Epub 2004 May 6.

An alternative model of amino acid replacement.

Bioinformatics. 2005 Apr 1;21(7):975-80. doi: 10.1093/bioinformatics/bti109. Epub 2004 Nov 5.

Fold-specific substitution matrices for protein classification.

Bioinformatics. 2004 Apr 12;20(6):847-53. doi: 10.1093/bioinformatics/btg492. Epub 2004 Feb 5.

Andante: reducing side-chain rotamer search space during comparative modeling using environment-specific substitution probabilities.

Bioinformatics. 2007 May 1;23(9):1099-105. doi: 10.1093/bioinformatics/btm073. Epub 2007 Mar 6.

On distance and similarity in fold space.

Bioinformatics. 2008 Mar 15;24(6):872-3. doi: 10.1093/bioinformatics/btn040. Epub 2008 Jan 28.

Application of a simple likelihood ratio approximant to protein sequence classification.

Bioinformatics. 2006 Dec 1;22(23):2865-9. doi: 10.1093/bioinformatics/btl512. Epub 2006 Nov 7.

引用本文的文献

Ab initio detection of fuzzy amino acid tandem repeats in protein sequences.

BMC Bioinformatics. 2012 Mar 21;13 Suppl 3(Suppl 3):S8. doi: 10.1186/1471-2105-13-S3-S8.

Amino acid "little Big Bang": representing amino acid substitution matrices as dot products of Euclidian vectors.

BMC Bioinformatics. 2010 Jan 4;11:4. doi: 10.1186/1471-2105-11-4.

Inconsistent distances in substitution matrices can be avoided by properly handling hydrophobic residues.

Evol Bioinform Online. 2008 Oct 9;4:255-61. doi: 10.4137/ebo.s885.

The influenza virus resource at the National Center for Biotechnology Information.

J Virol. 2008 Jan;82(2):596-601. doi: 10.1128/JVI.02005-07. Epub 2007 Oct 17.

A reduced amino acid alphabet for understanding and designing protein adaptation to mutation.

Eur Biophys J. 2007 Nov;36(8):1059-69. doi: 10.1007/s00249-007-0188-5. Epub 2007 Jun 13.

A collection of amino acid replacement matrices derived from clusters of orthologs.

J Mol Evol. 2005 Nov;61(5):659-65. doi: 10.1007/s00239-005-0060-0. Epub 2005 Oct 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

氨基酸取代的度量模型。

A metric model of amino acid substitution.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

动机

结果

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献