Suppr超能文献

通过自适应距离函数比较进化距离。

Comparing evolutionary distances via adaptive distance functions.

作者信息

Damti Yanir, Gronau Ilan, Moran Shlomo, Yavneh Irad

机构信息

Computer Science department, Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel.

Efi Arazi School of Computer Science, The Herzliya Interdisciplinary Center (IDC), P.O.Box 167, Herzliya 46150, Israel.

出版信息

J Theor Biol. 2018 Mar 7;440:88-99. doi: 10.1016/j.jtbi.2017.12.022. Epub 2017 Dec 23.

Abstract

Distance-based methods for phylogenetic reconstruction are based on a two-step approach: first, pairwise distances are computed from DNA sequences associated with a given set of taxa, and then these distances are used to reconstruct the phylogenetic relationships between taxa. Because the estimated distances are based on finite sequences, they are inherently noisy, and this noise may result in reconstruction errors. Previous attempts to improve reconstruction accuracy focused either on improving the robustness of reconstruction algorithms to this stochastic noise, or on improving the accuracy of the distance estimates. Here, we aim to further improve reconstruction accuracy by utilizing the basic observation that reconstruction algorithms are based on a series of comparisons between distances (or linear combinations of distances). We start by examining the relationship between the stochastic noise in the sequence data and the accuracy of the comparisons between pairwise distance estimates. This examination results in improved methods for distance comparison, which are shown to be as accurate as likelihood-based methods, while being much simpler and more efficient to compute. We then extend these methods to improve reconstruction accuracy of quartet trees, and examine some of the challenges moving forward.

摘要

基于距离的系统发育重建方法基于两步法

首先,从与给定分类单元集相关的DNA序列计算成对距离,然后使用这些距离重建分类单元之间的系统发育关系。由于估计的距离基于有限序列,它们本质上是有噪声的,并且这种噪声可能导致重建错误。以前提高重建准确性的尝试要么集中在提高重建算法对这种随机噪声的鲁棒性上,要么集中在提高距离估计的准确性上。在这里,我们旨在通过利用基本观察结果来进一步提高重建准确性,即重建算法基于距离之间的一系列比较(或距离的线性组合)。我们首先研究序列数据中的随机噪声与成对距离估计之间比较的准确性之间的关系。这种研究产生了改进的距离比较方法,这些方法被证明与基于似然的方法一样准确,同时计算起来要简单得多且效率更高。然后,我们扩展这些方法以提高四重树的重建准确性,并研究未来面临的一些挑战。

相似文献

1
Comparing evolutionary distances via adaptive distance functions.
J Theor Biol. 2018 Mar 7;440:88-99. doi: 10.1016/j.jtbi.2017.12.022. Epub 2017 Dec 23.
2
Adaptive distance measures for resolving K2P quartets: metric separation versus stochastic noise.
J Comput Biol. 2010 Nov;17(11):1509-18. doi: 10.1089/cmb.2009.0236. Epub 2010 Jun 24.
3
Toward extracting all phylogenetic information from matrices of evolutionary distances.
Science. 2010 Mar 12;327(5971):1376-9. doi: 10.1126/science.1182300.
4
Towards optimal distance functions for stochastic substitution models.
J Theor Biol. 2009 Sep 21;260(2):294-307. doi: 10.1016/j.jtbi.2009.05.028. Epub 2009 Jun 6.
5
Phylogenetic Tree Estimation With and Without Alignment: New Distance Methods and Benchmarking.
Syst Biol. 2017 Mar 1;66(2):218-231. doi: 10.1093/sysbio/syw074.
6
On the quality of tree-based protein classification.
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
7
Phylogenetic inference with weighted codon evolutionary distances.
J Mol Evol. 2009 Apr;68(4):377-92. doi: 10.1007/s00239-009-9212-y. Epub 2009 Mar 24.
8
Accuracy guarantees for phylogeny reconstruction algorithms based on balanced minimum evolution.
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):576-83. doi: 10.1109/TCBB.2013.39.
10

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验