Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.
PLoS Comput Biol. 2020 Jul 22;16(7):e1007553. doi: 10.1371/journal.pcbi.1007553. eCollection 2020 Jul.
Phylogenetic profiling is a computational method to predict genes involved in the same biological process by identifying protein families which tend to be jointly lost or retained across the tree of life. Phylogenetic profiling has customarily been more widely used with prokaryotes than eukaryotes, because the method is thought to require many diverse genomes. There are now many eukaryotic genomes available, but these are considerably larger, and typical phylogenetic profiling methods require at least quadratic time as a function of the number of genes. We introduce a fast, scalable phylogenetic profiling approach entitled HogProf, which leverages hierarchical orthologous groups for the construction of large profiles and locality-sensitive hashing for efficient retrieval of similar profiles. We show that the approach outperforms Enhanced Phylogenetic Tree, a phylogeny-based method, and use the tool to reconstruct networks and query for interactors of the kinetochore complex as well as conserved proteins involved in sexual reproduction: Hap2, Spo11 and Gex1. HogProf enables large-scale phylogenetic profiling across the three domains of life, and will be useful to predict biological pathways among the hundreds of thousands of eukaryotic species that will become available in the coming few years. HogProf is available at https://github.com/DessimozLab/HogProf.
系统发生谱分析是一种通过识别倾向于在生命之树上共同丢失或保留的蛋白质家族来预测涉及相同生物过程的基因的计算方法。系统发生谱分析通常更广泛地用于原核生物而不是真核生物,因为该方法被认为需要许多不同的基因组。现在有许多真核生物基因组可用,但它们要大得多,并且典型的系统发生谱分析方法需要至少二次方时间作为基因数量的函数。我们引入了一种快速、可扩展的系统发生谱分析方法,称为 HogProf,它利用层次同源群来构建大型谱,并利用局部敏感哈希来高效检索相似谱。我们表明该方法优于基于系统发育的增强系统发生谱分析方法,并使用该工具来重建网络,并查询动粒复合物的相互作用因子以及参与有性生殖的保守蛋白:Hap2、Spo11 和 Gex1。HogProf 能够在生命的三个领域进行大规模的系统发生谱分析,并且将有助于预测未来几年内将可用的数十万种真核生物物种之间的生物途径。HogProf 可在 https://github.com/DessimozLab/HogProf 上获得。