• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于加权密码子进化距离的系统发育推断。

Phylogenetic inference with weighted codon evolutionary distances.

作者信息

Criscuolo Alexis, Michel Christian J

机构信息

Equipe de Bioinformatique Théorique, LSIIT, FDBT (UMR CNRS-ULP 7005), Université de Strasbourg, Pôle API, Boulevard Sébastien Brant, 67400, Illkirch, France.

出版信息

J Mol Evol. 2009 Apr;68(4):377-92. doi: 10.1007/s00239-009-9212-y. Epub 2009 Mar 24.

DOI:10.1007/s00239-009-9212-y
PMID:19308635
Abstract

We develop a new approach to estimate a matrix of pairwise evolutionary distances from a codon-based alignment based on a codon evolutionary model. The method first computes a standard distance matrix for each of the three codon positions. Then these three distance matrices are weighted according to an estimate of the global evolutionary rate of each codon position and averaged into a unique distance matrix. Using a large set of both real and simulated codon-based alignments of nucleotide sequences, we show that this approach leads to distance matrices that have a significantly better treelikeness compared to those obtained by standard nucleotide evolutionary distances. We also propose an alternative weighting to eliminate the part of the noise often associated with some codon positions, particularly the third position, which is known to induce a fast evolutionary rate. Simulation results show that fast distance-based tree reconstruction algorithms on distance matrices based on this codon position weighting can lead to phylogenetic trees that are at least as accurate as, if not better, than those inferred by maximum likelihood. Finally, a well-known multigene dataset composed of eight yeast species and 106 codon-based alignments is reanalyzed and shows that our codon evolutionary distances allow building a phylogenetic tree which is similar to those obtained by non-distance-based methods (e.g., maximum parsimony and maximum likelihood) and also significantly improved compared to standard nucleotide evolutionary distance estimates.

摘要

我们开发了一种新方法,用于从基于密码子进化模型的密码子比对中估计成对进化距离矩阵。该方法首先为三个密码子位置中的每一个计算一个标准距离矩阵。然后,根据每个密码子位置的全局进化速率估计值对这三个距离矩阵进行加权,并将其平均为一个唯一的距离矩阵。使用大量真实和模拟的基于密码子的核苷酸序列比对,我们表明,与通过标准核苷酸进化距离获得的距离矩阵相比,这种方法得到的距离矩阵具有明显更好的树状相似性。我们还提出了一种替代加权方法,以消除通常与某些密码子位置(特别是第三位置,已知其进化速率较快)相关的部分噪声。模拟结果表明,基于这种密码子位置加权的距离矩阵上的快速基于距离的树重建算法可以生成至少与最大似然法推断的系统发育树一样准确(如果不是更准确)的系统发育树。最后,对一个由八个酵母物种和106个基于密码子的比对组成的著名多基因数据集进行了重新分析,结果表明,我们的密码子进化距离能够构建一个与通过非基于距离的方法(如最大简约法和最大似然法)获得的系统发育树相似的系统发育树,并且与标准核苷酸进化距离估计相比也有显著改进。

相似文献

1
Phylogenetic inference with weighted codon evolutionary distances.基于加权密码子进化距离的系统发育推断。
J Mol Evol. 2009 Apr;68(4):377-92. doi: 10.1007/s00239-009-9212-y. Epub 2009 Mar 24.
2
Evolutionary distances between nucleotide sequences based on the distribution of substitution rates among sites as estimated by parsimony.基于简约法估计的位点间替换率分布的核苷酸序列间的进化距离。
Mol Biol Evol. 1997 Mar;14(3):287-98. doi: 10.1093/oxfordjournals.molbev.a025764.
3
Toward extracting all phylogenetic information from matrices of evolutionary distances.从进化距离矩阵中提取所有系统发育信息。
Science. 2010 Mar 12;327(5971):1376-9. doi: 10.1126/science.1182300.
4
Phylogenetic Tree Estimation With and Without Alignment: New Distance Methods and Benchmarking.有比对和无比对情况下的系统发育树估计:新的距离方法与基准测试
Syst Biol. 2017 Mar 1;66(2):218-231. doi: 10.1093/sysbio/syw074.
5
Scoredist: a simple and robust protein sequence distance estimator.Scoredist:一种简单且强大的蛋白质序列距离估计器。
BMC Bioinformatics. 2005 Apr 27;6:108. doi: 10.1186/1471-2105-6-108.
6
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
7
Fast NJ-like algorithms to deal with incomplete distance matrices.用于处理不完整距离矩阵的类似快速NJ的算法。
BMC Bioinformatics. 2008 Mar 26;9:166. doi: 10.1186/1471-2105-9-166.
8
Comparing evolutionary distances via adaptive distance functions.通过自适应距离函数比较进化距离。
J Theor Biol. 2018 Mar 7;440:88-99. doi: 10.1016/j.jtbi.2017.12.022. Epub 2017 Dec 23.
9
Evaluating the robustness of phylogenetic methods to among-site variability in substitution processes.评估系统发育方法对替换过程中位点间变异性的稳健性。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):4013-21. doi: 10.1098/rstb.2008.0162.
10
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.

引用本文的文献

1
Toward a method for tracking virus evolutionary trajectory applied to the pandemic H1N1 2009 influenza virus.迈向一种应用于2009年大流行H1N1流感病毒的病毒进化轨迹追踪方法。
Infect Genet Evol. 2014 Dec;28:351-7. doi: 10.1016/j.meegid.2014.07.015. Epub 2014 Jul 24.
2
Impact of gene molecular evolution on phylogenetic reconstruction: a case study in the rosids (Superorder Rosanae, Angiosperms).基因分子进化对系统发育重建的影响:以蔷薇类植物(超目蔷薇超目,被子植物)为例的研究。
PLoS One. 2014 Jun 16;9(6):e99725. doi: 10.1371/journal.pone.0099725. eCollection 2014.
3
Combining distance matrices on identical taxon sets for multi-gene analysis with singular value decomposition.

本文引用的文献

1
The statistical sign test.统计符号检验。
J Am Stat Assoc. 1946 Dec;41(236):557-66. doi: 10.1080/01621459.1946.10501898.
2
Recovering evolutionary trees under a more realistic model of sequence evolution.在更现实的序列进化模型下恢复进化树。
Mol Biol Evol. 1994 Jul;11(4):605-12. doi: 10.1093/oxfordjournals.molbev.a040136.
3
Fast NJ-like algorithms to deal with incomplete distance matrices.用于处理不完整距离矩阵的类似快速NJ的算法。
通过奇异值分解将相同分类单元集上的距离矩阵合并用于多基因分析。
PLoS One. 2014 Apr 14;9(4):e94279. doi: 10.1371/journal.pone.0094279. eCollection 2014.
4
BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments.BMGE(基于信息熵的块映射与聚集):一种从多序列比对中选择系统发育信息区域的新软件。
BMC Evol Biol. 2010 Jul 13;10:210. doi: 10.1186/1471-2148-10-210.
BMC Bioinformatics. 2008 Mar 26;9:166. doi: 10.1186/1471-2105-9-166.
4
Using ESTs for phylogenomics: can one accurately infer a phylogenetic tree from a gappy alignment?利用ESTs进行系统发育基因组学研究:能否从有缺口的比对中准确推断系统发育树?
BMC Evol Biol. 2008 Mar 26;8:95. doi: 10.1186/1471-2148-8-95.
5
Mutation-selection models of codon substitution and their use to estimate selective strengths on codon usage.密码子替换的突变选择模型及其在估计密码子使用选择强度方面的应用。
Mol Biol Evol. 2008 Mar;25(3):568-79. doi: 10.1093/molbev/msm284. Epub 2008 Jan 3.
6
PAML 4: phylogenetic analysis by maximum likelihood.PAML 4:基于最大似然法的系统发育分析。
Mol Biol Evol. 2007 Aug;24(8):1586-91. doi: 10.1093/molbev/msm088. Epub 2007 May 4.
7
Codon phylogenetic distance.密码子系统发育距离。
Comput Biol Chem. 2007 Feb;31(1):36-43. doi: 10.1016/j.compbiolchem.2006.11.001. Epub 2007 Jan 25.
8
SDM: a fast distance-based approach for (super) tree building in phylogenomics.SDM:一种用于系统发育基因组学中(超)树构建的基于距离的快速方法。
Syst Biol. 2006 Oct;55(5):740-55. doi: 10.1080/10635150600969872.
9
Model use in phylogenetics: nine key questions.系统发育学中的模型应用:九个关键问题。
Trends Ecol Evol. 2007 Feb;22(2):87-94. doi: 10.1016/j.tree.2006.10.004. Epub 2006 Oct 17.
10
Neighbor-joining revealed.邻接法显示。
Mol Biol Evol. 2006 Nov;23(11):1997-2000. doi: 10.1093/molbev/msl072. Epub 2006 Jul 28.