ESEI: Escuela Superior de Ingeniería Informática, University of Vigo, Edificio Politécnico, Campus Universitario As Lagoas s/n, 32004, Ourense, Spain.
CINBIO: Centro de Investigaciones Biomédicas, University of Vigo, Campus Universitario Lagoas-Marcosende, 36310, Vigo, Spain.
J Integr Bioinform. 2024 Jul 24;21(2). doi: 10.1515/jib-2023-0051. eCollection 2024 Jun 1.
When inferring the evolution of a gene/gene family, it is advisable to use all available coding sequences (CDS) from as many species genomes as possible in order to infer and date all gene duplications and losses. Nowadays, this means using hundreds or even thousands of CDSs, which makes the inferred phylogenetic trees difficult to visualize and interpret. Therefore, it is useful to have an automated way of collapsing large phylogenetic trees according to a taxonomic term decided by the user (family, class, or order, for instance), in order to highlight the minimal set of sequences that should be used to recapitulate the full history of the gene/gene family being studied at that taxonomic level, that can be refined using additional software. Here we present the Phylogenetic Tree Collapser (PTC) program (https://github.com/pegi3s/phylogenetic-tree-collapser), a flexible tool for automated tree collapsing using taxonomic information, that can be easily used by researchers without a background in informatics, since it only requires the installation of Docker, Podman or Singularity. The utility of PTC is demonstrated by addressing the evolution of the ascorbic acid synthesis pathway in insects. A Docker image is available at Docker Hub (https://hub.docker.com/r/pegi3s/phylogenetic-tree-collapser) with PTC installed and ready-to-run.
在推断一个基因/基因家族的进化时,最好使用尽可能多的物种基因组中的所有可用编码序列 (CDS),以便推断和确定所有的基因重复和丢失。如今,这意味着要使用数百甚至数千个 CDS,这使得推断的系统发育树难以可视化和解释。因此,根据用户决定的分类术语(例如,家族、类或目)自动折叠大型系统发育树是很有用的,以便突出显示应该用于概括在该分类水平上研究的基因/基因家族的完整历史的最小序列集,然后可以使用其他软件进行细化。在这里,我们介绍了 Phylogenetic Tree Collapser (PTC) 程序(https://github.com/pegi3s/phylogenetic-tree-collapser),这是一种使用分类信息自动折叠系统发育树的灵活工具,即使没有信息学背景的研究人员也可以轻松使用,因为它只需要安装 Docker、Podman 或 Singularity。PTC 的实用性通过解决昆虫抗坏血酸合成途径的进化来证明。一个带有 PTC 安装并准备运行的 Docker 映像可在 Docker Hub(https://hub.docker.com/r/pegi3s/phylogenetic-tree-collapser)上获得。