估计氨基酸替换模型：Dayhoff估计器、预解式方法与最大似然法的比较。

Estimating amino acid substitution models: a comparison of Dayhoff's estimator, the resolvent approach and a maximum likelihood method.

作者信息

Müller Tobias, Spang Rainer, Vingron Martin

机构信息

Deutsches Krebsforschungszentrum, Theoretische Bioinformatik, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany.

出版信息

Mol Biol Evol. 2002 Jan;19(1):8-13. doi: 10.1093/oxfordjournals.molbev.a003985.

DOI:10.1093/oxfordjournals.molbev.a003985

PMID:11752185

Abstract

Evolution of proteins is generally modeled as a Markov process acting on each site of the sequence. Replacement frequencies need to be estimated based on sequence alignments. Here we compare three approaches: First, the original method by Dayhoff, Schwartz, and Orcutt (1978) Atlas Protein Seq. Struc. 5:345-352, secondly, the resolvent method (RV) by Müller and Vingron (2000) J. Comput. Biol. 7(6):761-776, and finally a maximum likelihood approach (ML) developed in this paper. We evaluate the methods using a highly divergent and inhomogeneous set of sequence alignments as an input to the estimation procedure. ML is the method of choice for small sets of input data. Although the RV method is computationally much less demanding it performs only slightly worse than ML. Therefore, it is perfectly appropriate for large-scale applications.

摘要

蛋白质的进化通常被建模为作用于序列每个位点的马尔可夫过程。需要根据序列比对来估计替换频率。在此我们比较三种方法：第一，Dayhoff、Schwartz和Orcutt（1978年，《蛋白质序列结构图谱》第5卷：345 - 352页）提出的原始方法；第二，Müller和Vingron（2000年，《计算生物学杂志》第7卷第6期：761 - 776页）提出的预解式方法（RV）；最后是本文开发的最大似然方法（ML）。我们使用一组高度发散且不均匀的序列比对作为估计程序的输入来评估这些方法。对于少量输入数据，ML是首选方法。尽管RV方法在计算上要求低得多，但其性能仅比ML略差。因此，它非常适合大规模应用。

相似文献

Estimating amino acid substitution models: a comparison of Dayhoff's estimator, the resolvent approach and a maximum likelihood method.估计氨基酸替换模型：Dayhoff估计器、预解式方法与最大似然法的比较。

Mol Biol Evol. 2002 Jan;19(1):8-13. doi: 10.1093/oxfordjournals.molbev.a003985.

Modeling amino acid replacement.模拟氨基酸替换。

J Comput Biol. 2000;7(6):761-76. doi: 10.1089/10665270050514918.

Efficient methods for estimating amino acid replacement rates.估计氨基酸替换率的有效方法。

J Mol Evol. 2006 Jun;62(6):663-73. doi: 10.1007/s00239-004-0113-9. Epub 2006 Apr 28.

Scoredist: a simple and robust protein sequence distance estimator.Scoredist：一种简单且强大的蛋白质序列距离估计器。

BMC Bioinformatics. 2005 Apr 27;6:108. doi: 10.1186/1471-2105-6-108.

Pseudo-likelihood for non-reversible nucleotide substitution models with neighbour dependent rates.具有邻域依赖速率的不可逆核苷酸替换模型的伪似然度

Stat Appl Genet Mol Biol. 2006;5:Article18. doi: 10.2202/1544-6115.1217. Epub 2006 Jul 31.

Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计

BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.

Empirical models for substitution in ribosomal RNA.核糖体RNA中替代的经验模型。

Mol Biol Evol. 2004 Mar;21(3):419-27. doi: 10.1093/molbev/msh029. Epub 2003 Dec 5.

An amino acid substitution-selection model adjusts residue fitness to improve phylogenetic estimation.氨基酸替换选择模型调整残基适合度以改进系统发育估计。

Mol Biol Evol. 2014 Apr;31(4):779-92. doi: 10.1093/molbev/msu044. Epub 2014 Jan 16.

CodonTest: modeling amino acid substitution preferences in coding sequences.CodonTest：建模编码序列中氨基酸替换偏好。

PLoS Comput Biol. 2010 Aug 19;6(8):e1000885. doi: 10.1371/journal.pcbi.1000885.

Relating physicochemical properties of amino acids to variable nucleotide substitution patterns among sites.将氨基酸的物理化学性质与位点间可变的核苷酸替代模式相关联。

Pac Symp Biocomput. 2000:81-92.

引用本文的文献

The Historical Evolution and Significance of Multiple Sequence Alignment in Molecular Structure and Function Prediction.多重序列比对在分子结构与功能预测中的历史演变及意义

Biomolecules. 2024 Nov 29;14(12):1531. doi: 10.3390/biom14121531.

An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov models.基于 FPGA 的硬件加速器，支持使用隐马尔可夫模型进行敏感序列同源性过滤。

BMC Bioinformatics. 2024 Jul 29;25(1):247. doi: 10.1186/s12859-024-05879-3.

Improved selection of canonical proteins for reference proteomes.改进用于参考蛋白质组的标准蛋白质选择。

NAR Genom Bioinform. 2024 Jun 8;6(2):lqae066. doi: 10.1093/nargab/lqae066. eCollection 2024 Jun.

New alignment method for remote protein sequences by the direct use of pairwise sequence correlations and substitutions.通过直接利用成对序列相关性和替换来对远程蛋白质序列进行新的比对方法。

Front Bioinform. 2023 Oct 12;3:1227193. doi: 10.3389/fbinf.2023.1227193. eCollection 2023.

PLoS One. 2023 Oct 26;18(10):e0293606. doi: 10.1371/journal.pone.0293606. eCollection 2023.

Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure.利用调整后的熵度量方法鉴定流感 A 病毒的宿主特异性氨基酸特征。

BMC Bioinformatics. 2022 Aug 12;23(1):333. doi: 10.1186/s12859-022-04885-7.

Bridging the gaps in statistical models of protein alignment.填补蛋白质比对统计模型中的空白。

Bioinformatics. 2022 Jun 24;38(Suppl 1):i229-i237. doi: 10.1093/bioinformatics/btac246.

On the reliability and the limits of inference of amino acid sequence alignments.关于氨基酸序列比对的可靠性和推断限制。

Bioinformatics. 2022 Jun 24;38(Suppl 1):i255-i263. doi: 10.1093/bioinformatics/btac247.

Evolutionary and functional lessons from human-specific amino acid substitution matrices.来自人类特异性氨基酸替换矩阵的进化与功能启示。

NAR Genom Bioinform. 2021 Sep 16;3(3):lqab079. doi: 10.1093/nargab/lqab079. eCollection 2021 Sep.

New amino acid substitution matrix brings sequence alignments into agreement with structure matches.新的氨基酸替代矩阵使序列比对与结构匹配一致。

Proteins. 2021 Jun;89(6):671-682. doi: 10.1002/prot.26050. Epub 2021 Feb 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

估计氨基酸替换模型：Dayhoff估计器、预解式方法与最大似然法的比较。

Estimating amino acid substitution models: a comparison of Dayhoff's estimator, the resolvent approach and a maximum likelihood method.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献