1Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA.
2Department of Epidemiology, Harvard University, Boston, MA 02115, USA.
Microb Genom. 2018 Oct;4(10). doi: 10.1099/mgen.0.000217. Epub 2018 Oct 11.
Whole genome sequencing in conjunction with traditional epidemiology has been used to reconstruct transmission networks of Mycobacterium tuberculosis during outbreaks. Given its low mutation rate, genetic diversity within M. tuberculosis outbreaks can be extremely limited - making it difficult to determine precisely who transmitted to whom. In addition to consensus SNPs (cSNPs), examining heterogeneous alleles (hSNPs) has been proposed to improve resolution. However, few studies have examined the potential biases in detecting these hSNPs. Here, we analysed genome sequence data from 25 specimens from British Columbia, Canada. Specimens were sequenced to a depth of 112-296×. We observed biases in read depth, base quality, strand distribution and read placement where possible hSNPs were initially identified, so we applied conservative filters to reduce false positives. Overall, there was phylogenetic concordance between the observed 2542 cSNP and 63 hSNP loci. Furthermore, we identified hSNPs shared exclusively by epidemiologically linked patients, supporting their use in transmission inferences. We conclude that hSNPs may add resolution to transmission networks, particularly where the overall genetic diversity is low.
全基因组测序与传统流行病学相结合,已被用于重建结核分枝杆菌暴发期间的传播网络。鉴于其突变率低,结核分枝杆菌暴发内部的遗传多样性可能极其有限 - 这使得难以准确确定谁将病原体传播给了谁。除了共识单核苷酸多态性(cSNP)外,还提出了检查异质等位基因(hSNP)以提高分辨率。然而,很少有研究探讨检测这些 hSNP 时的潜在偏差。在这里,我们分析了来自加拿大不列颠哥伦比亚省的 25 个标本的基因组序列数据。标本的测序深度为 112-296×。我们观察到在最初鉴定可能的 hSNP 的位置存在读取深度、碱基质量、链分布和读取位置的偏差,因此我们应用保守的筛选方法来减少假阳性。总体而言,在观察到的 2542 个 cSNP 和 63 个 hSNP 基因座之间存在系统发育一致性。此外,我们还鉴定了仅在流行病学相关患者中共享的 hSNP,支持它们用于传播推断。我们的结论是,hSNP 可能会增加传播网络的分辨率,特别是在整体遗传多样性较低的情况下。