IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):230-242. doi: 10.1109/TCBB.2021.3096455. Epub 2022 Feb 3.
The inference of disease transmission networks is an important problem in epidemiology. One popular approach for building transmission networks is to reconstruct a phylogenetic tree using sequences from disease strains sampled from infected hosts and infer transmissions based on this tree. However, most existing phylogenetic approaches for transmission network inference are highly computationally intensive and cannot take within-host strain diversity into account. Here, we introduce a new phylogenetic approach for inferring transmission networks, TNet, that addresses these limitations. TNet uses multiple strain sequences from each sampled host to infer transmissions and is simpler and more accurate than existing approaches. Furthermore, TNet is highly scalable and able to distinguish between ambiguous and unambiguous transmission inferences. We evaluated TNet on a large collection of 560 simulated transmission networks of various sizes and diverse host, sequence, and transmission characteristics, as well as on 10 real transmission datasets with known transmission histories. Our results show that TNet outperforms two other recently developed methods, phyloscanner and SharpTNI, that also consider within-host strain diversity. We also applied TNet to a large collection of SARS-CoV-2 genomes sampled from infected individuals in many countries around the world, demonstrating how our inference framework can be adapted to accurately infer geographical transmission networks. TNet is freely available from https://compbio.engr.uconn.edu/software/TNet/.
疾病传播网络的推断是流行病学中的一个重要问题。构建传播网络的一种流行方法是使用从感染宿主中采样的疾病菌株的序列构建系统发育树,并根据该树推断传播。然而,大多数现有的用于传播网络推断的系统发育方法计算量非常大,并且无法考虑宿主内菌株多样性。在这里,我们引入了一种新的用于推断传播网络的系统发育方法 TNet,该方法解决了这些限制。TNet 使用来自每个采样宿主的多个菌株序列来推断传播,并且比现有的方法更简单、更准确。此外,TNet 具有高度可扩展性,能够区分模糊和明确的传播推断。我们在一个包含 560 个大小不同、宿主、序列和传播特征多样的模拟传播网络的大型集合以及 10 个具有已知传播历史的真实传播数据集上评估了 TNet。我们的结果表明,TNet 优于另外两种最近开发的方法,phyloscanner 和 SharpTNI,它们也考虑了宿主内菌株多样性。我们还将 TNet 应用于从世界各地感染个体中采样的大量 SARS-CoV-2 基因组,展示了我们的推断框架如何能够适应准确推断地理传播网络。TNet 可从 https://compbio.engr.uconn.edu/software/TNet/ 免费获得。