Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil.
Genome Biol Evol. 2024 Oct 9;16(10). doi: 10.1093/gbe/evae229.
The assembly of a comprehensive and dated Tree of Life (ToL) remains one of the most formidable challenges in evolutionary biology. The complexity of life's history, involving both vertical and horizontal transmission of genetic information, defies its representation by a simple bifurcating phylogeny. With the advent of genome and metagenome sequencing, vast amounts of data have become available. However, employing this information for phylogeny and divergence time inference has introduced significant theoretical and computational hurdles. This perspective addresses some key methodological challenges in assembling the dated ToL, namely, the identification and classification of homologous genes, accounting for gene tree-species tree mismatch due to population-level processes along with duplication, loss, and horizontal gene transfer, and the accurate dating of evolutionary events. Ultimately, the success of this endeavor requires new approaches that integrate knowledge databases with optimized phylogenetic algorithms capable of managing complex evolutionary models.
组装一个全面且具有时间信息的生命之树(ToL)仍然是进化生物学中最具挑战性的任务之一。生命历史的复杂性涉及遗传信息的垂直和水平传递,这使得简单的二分叉系统发育树无法完全代表。随着基因组和宏基因组测序的出现,大量的数据已经可用。然而,将这些信息用于系统发育和分歧时间推断引入了重大的理论和计算障碍。本观点讨论了组装具有时间信息的生命之树所面临的一些关键方法学挑战,即同源基因的识别和分类,以及由于群体水平过程(包括复制、缺失和水平基因转移)导致的基因树-物种树不匹配的问题,还有进化事件的准确日期。最终,这一努力的成功需要将知识数据库与能够处理复杂进化模型的优化系统发育算法相结合的新方法。