Suppr超能文献

一种用于汇总数百万物种的系统发育和分类信息的超级树管道。

A supertree pipeline for summarizing phylogenetic and taxonomic information for millions of species.

作者信息

Redelings Benjamin D, Holder Mark T

机构信息

Department of Biology, Duke University, Durham, NC, United States; Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, United States.

Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, United States; Biodiversity Institute, University of Kansas, Lawrence, KS, United States; Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.

出版信息

PeerJ. 2017 Mar 1;5:e3058. doi: 10.7717/peerj.3058. eCollection 2017.

Abstract

We present a new supertree method that enables rapid estimation of a summary tree on the scale of millions of leaves. This supertree method summarizes a collection of input phylogenies and an input taxonomy. We introduce formal goals and criteria for such a supertree to satisfy in order to transparently and justifiably represent the input trees. In addition to producing a supertree, our method computes annotations that describe which grouping in the input trees support and conflict with each group in the supertree. We compare our supertree construction method to a previously published supertree construction method by assessing their performance on input trees used to construct the Open Tree of Life version 4, and find that our method increases the number of displayed input splits from 35,518 to 39,639 and decreases the number of conflicting input splits from 2,760 to 1,357. The new supertree method also improves on the previous supertree construction method in that it produces no unsupported branches and avoids unnecessary polytomies. This pipeline is currently used by the Open Tree of Life project to produce all of the versions of project's "synthetic tree" starting at version 5. This software pipeline is called "". It relies heavily on ""-a set of C++ tools to perform most of the steps of the pipeline. All of the components are free software and are available on GitHub.

摘要

我们提出了一种新的超树方法,该方法能够在数百万个叶子的规模上快速估计一棵总结树。这种超树方法总结了一组输入系统发育树和一个输入分类法。我们引入了此类超树要满足的正式目标和标准,以便透明且合理地表示输入树。除了生成一棵超树外,我们的方法还计算注释,描述输入树中的哪些分组支持和与超树中的每个组冲突。我们通过在用于构建生命之树第4版的输入树上评估其性能,将我们的超树构建方法与先前发表的超树构建方法进行比较,发现我们的方法将显示的输入分裂数量从35,518增加到39,639,并将冲突的输入分裂数量从2,760减少到1,357。新的超树方法还在先前的超树构建方法上有所改进,即它不会产生无支持的分支并避免不必要的多歧分支。生命之树项目目前使用这个流程来生成从第5版开始的项目“综合树”的所有版本。这个软件流程被称为“”。它严重依赖于“”——一组C++工具来执行流程的大部分步骤。所有组件都是自由软件,可在GitHub上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd5/5335690/250501883449/peerj-05-3058-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验