Montoya Vincent, Olmstead Andrea, Tang Patrick, Cook Darrel, Janjua Naveed, Grebely Jason, Jacka Brendan, Poon Art F Y, Krajden Mel
BC Centre for Disease Control, Vancouver, BC, Canada; Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada.
Sidra Medical and Research Center, Doha, Qatar.
Infect Genet Evol. 2016 Sep;43:329-37. doi: 10.1016/j.meegid.2016.06.015. Epub 2016 Jun 6.
Effective surveillance and treatment strategies are required to control the hepatitis C virus (HCV) epidemic. Phylogenetic analyses are powerful tools for reconstructing the evolutionary history of viral outbreaks and identifying transmission clusters. These studies often rely on Sanger sequencing which typically generates a single consensus sequence for each infected individual. For rapidly mutating viruses such as HCV, consensus sequencing underestimates the complexity of the viral quasispecies population and could therefore generate different phylogenetic tree topologies. Although deep sequencing provides a more detailed quasispecies characterization, in-depth phylogenetic analyses are challenging due to dataset complexity and computational limitations. Here, we apply deep sequencing to a characterized population to assess its ability to identify phylogenetic clusters compared with consensus Sanger sequencing. For deep sequencing, a sample specific threshold determined by the 50th percentile of the patristic distance distribution for all variants within each individual was used to identify clusters. Among seven patristic distance thresholds tested for the Sanger sequence phylogeny ranging from 0.005-0.06, a threshold of 0.03 was found to provide the maximum balance between positive agreement (samples in a cluster) and negative agreement (samples not in a cluster) relative to the deep sequencing dataset. From 77 HCV seroconverters, 10 individuals were identified in phylogenetic clusters using both methods. Deep sequencing analysis identified an additional 4 individuals and excluded 8 other individuals relative to Sanger sequencing. The application of this deep sequencing approach could be a more effective tool to understand onward HCV transmission dynamics compared with Sanger sequencing, since the incorporation of minority sequence variants improves the discrimination of phylogenetically linked clusters.
需要有效的监测和治疗策略来控制丙型肝炎病毒(HCV)的流行。系统发育分析是重建病毒爆发的进化历史和识别传播集群的有力工具。这些研究通常依赖于桑格测序,该方法通常为每个受感染个体生成一个单一的共识序列。对于像HCV这样快速变异的病毒,共识测序低估了病毒准种群体的复杂性,因此可能会产生不同的系统发育树拓扑结构。尽管深度测序提供了更详细的准种特征,但由于数据集的复杂性和计算限制,深入的系统发育分析具有挑战性。在这里,我们将深度测序应用于一个特征明确的群体,以评估其与桑格测序共识相比识别系统发育集群的能力。对于深度测序,使用由每个个体内所有变体的简约距离分布的第50百分位数确定的样本特定阈值来识别集群。在针对桑格序列系统发育测试的0.005-0.06的七个简约距离阈值中,发现0.03的阈值相对于深度测序数据集在阳性一致性(集群中的样本)和阴性一致性(不在集群中的样本)之间提供了最大平衡。在77名HCV血清转化者中,两种方法均在系统发育集群中鉴定出10名个体。与桑格测序相比,深度测序分析还额外鉴定出4名个体,并排除了8名其他个体。与桑格测序相比,这种深度测序方法的应用可能是一种更有效的工具,用于了解HCV的后续传播动态,因为纳入少数序列变体提高了对系统发育相关集群的区分能力。