Rhee Soo-Yon, Magalis Brittany R, Hurley Leo, Silverberg Michael J, Marcus Julia L, Slome Sally, Kosakovsky Pond Sergei L, Shafer Robert W
Division of Infectious Diseases, Department of Medicine, Stanford University, California.
Department of Biology, Temple University, Philadelphia, Pennsylvania.
Open Forum Infect Dis. 2019 Mar 14;6(4):ofz135. doi: 10.1093/ofid/ofz135. eCollection 2019 Apr.
Recent advances in high-throughput molecular epidemiology are transforming the analysis of viral infections.
Human immunodeficiency virus (HIV)-1 sequences from a Northern Californian cohort (NCC) of 4553 antiretroviral-naive individuals sampled between 1998 and 2016 were analyzed together with 140 000 previously published global sequences. The HIV-TRAnsmission Cluster Engine (HIV-TRACE) was used to infer a transmission network comprising links between NCC and previously published sequences having a genetic distance ≤1.5%.
Twenty-five percent of NCC sequences were included in 264 clusters linked to a published sequence, and approximately one third of these (8.0% of the total) were linked to 1 or more non-US sequences. The largest cluster, containing 512 NCC sequences (11.2% of the total), comprised the subtype B lineage that traced its origin to the earliest North American sequences. Approximately 5 percent of NCC sequences belonged to a non-B subtype, and these were more likely to cluster with a non-US sequence. Twenty-two NCC sequences belonged to 1 of 4 large clusters containing sequences from rapidly growing regional epidemics: CRF07_BC (East Asia), subtype A6 (former Soviet Union), a Japanese subtype B lineage, and an East/Southeast Asian CRF01_AE lineage. Bayesian phylogenetics suggested that most non-B sequences resulted from separate introductions but that local spread within the largest CRF01_AE cluster occurred twice.
The NCC contains national and international links to previously published sequences including many to the subtype B strain that originated in North America and several to rapidly growing Asian epidemics. Despite their rapid regional growth, the Asian epidemic strains demonstrated limited NCC spread.
高通量分子流行病学的最新进展正在改变对病毒感染的分析。
对1998年至2016年间从北加利福尼亚队列(NCC)中抽取的4553名未接受抗逆转录病毒治疗个体的人类免疫缺陷病毒(HIV)-1序列,与140000条先前发表的全球序列一起进行分析。使用HIV传播聚类引擎(HIV-TRACE)推断一个传播网络,该网络包括NCC序列与遗传距离≤1.5%的先前发表序列之间的联系。
25%的NCC序列被纳入与已发表序列相关的264个聚类中,其中约三分之一(占总数的8.0%)与1个或多个非美国序列相关。最大的聚类包含512条NCC序列(占总数的11.2%),由B亚型谱系组成,其起源可追溯到最早的北美序列。约5%的NCC序列属于非B亚型,且这些序列更有可能与非美国序列聚类。22条NCC序列属于4个大型聚类中的1个,这些大型聚类包含来自快速增长的地区性流行的序列:CRF07_BC(东亚)、A6亚型(前苏联)、日本B亚型谱系以及东亚/东南亚CRF01_AE谱系。贝叶斯系统发育分析表明,大多数非B序列是由单独引入导致的,但在最大的CRF01_AE聚类中发生了两次局部传播。
NCC包含与先前发表序列的国内和国际联系,包括许多与起源于北美的B亚型毒株的联系,以及与快速增长的亚洲流行毒株的若干联系。尽管亚洲流行毒株在区域内快速增长,但其在NCC中的传播有限。