使用生物体数据集对多位点系统发育分析的超级矩阵和超级树方法进行比较。

A comparison of supermatrix and supertree methods for multilocus phylogenetics using organismal datasets.

作者信息

Janies Daniel A, Studer Jonathon, Handelman Samuel K, Linchangco Gregorio

机构信息

Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, NC, 28223, USA.

Case Western Reserve University School of Law, 11075 East Boulevard, Cleveland, OH, 44106, USA.

出版信息

Cladistics. 2013 Oct;29(5):560-566. doi: 10.1111/cla.12014. Epub 2013 Feb 18.

DOI:10.1111/cla.12014

PMID:34798766

Abstract

It has been proposed that supertree approaches should be applied to large multilocus datasets to achieve computational tractability. Large datasets such as those derived from phylogenomics studies can be broken into many locus-specific tree searches and the resulting trees can be stitched together via a supertree method. Using simulated data, workers have reported that they can rapidly construct a supertree that is comparable to the results of heuristic tree search on the entire dataset. To test this assertion with organismal data, we compare tree length under the parsimony criterion and computational time for 20 multilocus datasets using supertree (SuperFine and SuperTriplets) and supermatrix (heuristic search in TNT) approaches. Tree length and computational times were compared among methods using the Wilcoxon matched-pairs signed rank test. Supermatrix searches produced significantly shorter trees than either supertree approach (SuperFine or SuperTriplets; P < 0.0002 in both cases). Moreover, the processing time of supermatrix search was significantly lower than SuperFine+locus-specific search (P < 0.01) but roughly equivalent to that of SuperTriplets+locus-specific search (P > 0.4, not significant). In conclusion, we show by using real rather than simulated data that there is no basis, either in time tractability or in tree length, for use of supertrees over heuristic tree search using a supermatrix for phylogenomics.

摘要

有人提出，应将超树方法应用于大型多位点数据集，以实现计算的可处理性。诸如从系统发育基因组学研究中获得的大型数据集，可以分解为许多特定基因座的树搜索，然后通过超树方法将得到的树拼接在一起。使用模拟数据，研究人员报告说，他们可以快速构建一棵超树，其结果与对整个数据集进行启发式树搜索的结果相当。为了用生物数据检验这一断言，我们使用超树（SuperFine和SuperTriplets）和超矩阵（TNT中的启发式搜索）方法，比较了20个多位点数据集在简约标准下的树长和计算时间。使用Wilcoxon配对符号秩检验比较了不同方法之间的树长和计算时间。超矩阵搜索产生的树明显比任何一种超树方法（SuperFine或SuperTriplets）都短（两种情况下P均<0.0002）。此外，超矩阵搜索的处理时间明显低于SuperFine+特定基因座搜索（P<0.01），但与SuperTriplets+特定基因座搜索大致相当（P>0.4，不显著）。总之，我们通过使用真实而非模拟数据表明，在系统发育基因组学中，无论是在时间可处理性还是树长方面，使用超树而非使用超矩阵进行启发式树搜索都没有依据。

相似文献

A comparison of supermatrix and supertree methods for multilocus phylogenetics using organismal datasets.

Cladistics. 2013 Oct;29(5):560-566. doi: 10.1111/cla.12014. Epub 2013 Feb 18.

Complete generic-level phylogenetic analyses of palms (Arecaceae) with comparisons of supertree and supermatrix approaches.

Syst Biol. 2009 Apr;58(2):240-56. doi: 10.1093/sysbio/syp021. Epub 2009 May 30.

Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm.

Mol Biol Evol. 2017 Sep 1;34(9):2408-2421. doi: 10.1093/molbev/msx191.

MRL and SuperFine+MRL: new supertree methods.

Algorithms Mol Biol. 2012 Jan 26;7(1):3. doi: 10.1186/1748-7188-7-3.

BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees.

PeerJ. 2018 Jun 8;6:e4987. doi: 10.7717/peerj.4987. eCollection 2018.

Comparative performance of supertree algorithms in large data sets using the soapberry family (Sapindaceae) as a case study.

Syst Biol. 2011 Jan;60(1):32-44. doi: 10.1093/sysbio/syq057. Epub 2010 Nov 10.

SuperFine: fast and accurate supertree estimation.

Syst Biol. 2012 Mar;61(2):214-27. doi: 10.1093/sysbio/syr092. Epub 2011 Sep 20.

SuperTriplets: a triplet-based supertree approach to phylogenomics.

Bioinformatics. 2010 Jun 15;26(12):i115-23. doi: 10.1093/bioinformatics/btq196.

A simulation study comparing supertree and combined analysis methods using SMIDGen.

Algorithms Mol Biol. 2010 Jan 4;5:8. doi: 10.1186/1748-7188-5-8.

The impact of HGT on phylogenomic reconstruction methods.

Brief Bioinform. 2014 Jan;15(1):79-90. doi: 10.1093/bib/bbs050. Epub 2012 Aug 20.

引用本文的文献

Compensatory Base Changes in ITS2 Secondary Structure Alignment, Modelling, and Molecular Phylogeny: An Integrated Approach to Improve Species Delimitation in (Basidiomycota).

J Fungi (Basel). 2023 Aug 31;9(9):894. doi: 10.3390/jof9090894.

Phylogenies from unaligned proteomes using sequence environments of amino acid residues.

Sci Rep. 2022 May 6;12(1):7497. doi: 10.1038/s41598-022-11370-x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用生物体数据集对多位点系统发育分析的超级矩阵和超级树方法进行比较。

A comparison of supermatrix and supertree methods for multilocus phylogenetics using organismal datasets.

作者信息

Janies Daniel A, Studer Jonathon, Handelman Samuel K, Linchangco Gregorio

机构信息

Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, NC, 28223, USA.

Case Western Reserve University School of Law, 11075 East Boulevard, Cleveland, OH, 44106, USA.

出版信息

Cladistics. 2013 Oct;29(5):560-566. doi: 10.1111/cla.12014. Epub 2013 Feb 18.

DOI:10.1111/cla.12014

PMID:34798766

Abstract

摘要

使用生物体数据集对多位点系统发育分析的超级矩阵和超级树方法进行比较。

A comparison of supermatrix and supertree methods for multilocus phylogenetics using organismal datasets.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用生物体数据集对多位点系统发育分析的超级矩阵和超级树方法进行比较。

A comparison of supermatrix and supertree methods for multilocus phylogenetics using organismal datasets.

作者信息

机构信息

出版信息

相似文献

引用本文的文献