School of Life Sciences and Department of Statistics, University of Warwick, United Kingdom.
Center for Computational Biology, Institute of Cancer and Genomic Sciences, University of Birmingham, United Kingdom.
Curr Protoc. 2021 Feb;1(2):e60. doi: 10.1002/cpz1.60.
Comparing the pathogen genomes from several cases of an infectious disease has the potential to help us understand and control outbreaks. Many methods exist to reconstruct a phylogeny from such genomes, which represents how the genomes are related to one another. However, such a phylogeny is not directly informative about transmission events between individuals. TransPhylo is a software tool implemented as an R package designed to bridge the gap between pathogen phylogenies and transmission trees. TransPhylo is based on a combined model of transmission between hosts and pathogen evolution within each host. It can simulate both phylogenies and transmission trees jointly under this combined model. TransPhylo can also reconstruct a transmission tree based on a dated phylogeny, by exploring the space of transmission trees compatible with the phylogeny. A transmission tree can be represented as a coloring of a phylogeny where each color represents a different host of the pathogen, and TransPhylo provides convenient ways to plot these colorings and explore the results. This article presents the basic protocols that can be used to make the most of TransPhylo. © 2021 The Authors. Basic Protocol 1: First steps with TransPhylo Basic Protocol 2: Simulation of outbreak data Basic Protocol 3: Inference of transmission Basic Protocol 4: Exploring the results of inference.
比较几种传染病病原体的基因组,有助于我们了解和控制疫情的爆发。有许多方法可以从这些基因组中重建系统发育树,即表示基因组之间相互关系的图谱。然而,这样的系统发育树并不能直接提供有关个体之间传播事件的信息。TransPhylo 是一个软件工具,它作为一个 R 包实现,旨在弥合病原体系统发育树和传播树之间的差距。TransPhylo 基于宿主之间的传播和每个宿主内病原体进化的组合模型。它可以根据这个组合模型,共同模拟系统发育树和传播树。TransPhylo 还可以通过探索与系统发育树兼容的传播树空间,根据已标记的系统发育树来重建传播树。传播树可以表示为系统发育树的一种着色,其中每种颜色代表病原体的不同宿主,TransPhylo 提供了方便的方法来绘制这些着色并探索结果。本文介绍了可以充分利用 TransPhylo 的基本方案。© 2021 作者们。基本方案 1:使用 TransPhylo 的第一步基本方案 2:爆发数据的模拟基本方案 3:传播的推断基本方案 4:推断结果的探索。