Suppr超能文献

Hieranoid:层次同源推断。

Hieranoid: hierarchical orthology inference.

机构信息

Stockholm Bioinformatics Center, Science for Life Laboratory, Box 1031, SE-17121 Solna, Sweden; Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden.

Stockholm Bioinformatics Center, Science for Life Laboratory, Box 1031, SE-17121 Solna, Sweden; Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden; Swedish e-Science Research Center, SE-10044 Stockholm, Sweden.

出版信息

J Mol Biol. 2013 Jun 12;425(11):2072-2081. doi: 10.1016/j.jmb.2013.02.018. Epub 2013 Feb 26.

Abstract

An accurate inference of orthologs is essential in many research fields such as comparative genomics, molecular evolution, and genome annotation. Existing methods for genome-scale orthology inference are mostly based on all-versus-all similarity searches that scale quadratically with the number of species. This limits their application to the increasing number of available large-scale datasets. Here, we present Hieranoid, a new orthology inference method using a hierarchical approach. Hieranoid performs pairwise orthology analysis using InParanoid at each node in a guide tree as it progresses from its leaves to the root. This concept reduces the total runtime complexity from a quadratic to a linear function of the number of species. The tree hierarchy provides a natural structure in multi-species ortholog groups, and the aggregation of multiple sequences allows for multiple alignment similarity searching techniques, which can yield more accurate ortholog groups. Using the recently published orthobench benchmark, Hieranoid showed the overall best performance. Our progressive approach presents a new way to infer orthologs that combines efficient graph-based methodology with aspects of compute-intensive tree-based methods. The linear scaling with the number of species is a major advantage for large-scale applications and makes Hieranoid well suited to cope with vast amounts of sequenced genomes in the future. Hieranoid is an open source and can be downloaded at Hieranoid.sbc.su.se.

摘要

在比较基因组学、分子进化和基因组注释等许多研究领域中,准确推断直系同源物是至关重要的。现有的基于全基因组相似性搜索的直系同源物推断方法大多是基于全基因组相似性搜索的,其规模与物种数量的平方成正比。这限制了它们在越来越多的大型数据集上的应用。在这里,我们提出了 Hieranoid,这是一种使用层次方法进行直系同源物推断的新方法。Hieranoid 使用 InParanoid 在引导树的每个节点上执行两两直系同源分析,从叶子到根逐步进行。这一概念将总运行时间复杂度从二次函数降低到物种数量的线性函数。树层次结构为多物种直系同源物提供了一种自然的结构,并且多个序列的聚合允许使用多种对齐相似性搜索技术,从而产生更准确的直系同源物。使用最近发布的 orthobench 基准,Hieranoid 表现出了整体最佳的性能。我们的渐进方法提出了一种推断直系同源物的新方法,它将基于图的高效方法与基于树的计算密集型方法的各个方面结合起来。与物种数量的线性缩放是大规模应用的一个主要优势,这使得 Hieranoid 非常适合应对未来大量测序基因组。Hieranoid 是一个开源项目,可以在 Hieranoid.sbc.su.se 下载。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验