BC Centre for Excellence in HIV/AIDS, Vancouver, Canada.
AIDS. 2011 Oct 23;25(16):2019-26. doi: 10.1097/QAD.0b013e32834b643c.
To reconstruct dates of HIV infection by the coalescent analysis of longitudinal next-generation sequencing (NGS) data.
The coalescent predicts the time that has elapsed since the most recent common ancestor (MRCA) of a population. Because HIV tends to undergo severe bottlenecks upon transmission, the MRCA may be a good predictor of the time of infection. NGS provides an efficient means for performing large-scale clonal sequencing of HIV populations within patients, and the ideal raw material for coalescent analysis.
Baseline and follow-up plasma samples were obtained from 19 individuals enrolled into the Montréal Primary HIV Infection cohort. Dates of infection were initially estimated at baseline from nongenetic data (clinical and serological markers and patient questionnaires). HIV RNA was extracted and seven regions of the genome were amplified, subjected to parallel-tagged 454 pyrosequencing, and analyzed using the software package BEAST.
Mean estimates of the time to the MRCA per patient were significantly correlated with nongenetic estimates (Spearman's ρ = 0.65, P = 4.4 × 10(-3)). The median absolute difference between coalescent and nongenetic date estimates was smallest (median 29.4 days) for highly variable regions of the HIV genome such as env V3, and greater (median 114.9 days) for more conserved regions such as pol.
This application of NGS represents an important advancement, not only because accurate estimates of dates of infection can be derived retrospectively from archived specimens, but also because each analysis is patient-specific and, therefore, robust to variation in rates of HIV evolution.
通过对纵向下一代测序(NGS)数据的合并分析来重建 HIV 感染日期。
合并分析预测了自群体中最近共同祖先(MRCA)以来经过的时间。由于 HIV 在传播时往往会经历严重的瓶颈,因此 MRCA 可能是感染时间的良好预测指标。NGS 为在患者体内进行大规模 HIV 群体的克隆测序提供了有效的手段,是合并分析的理想原始材料。
从纳入蒙特利尔原发性 HIV 感染队列的 19 名个体中获得基线和随访血浆样本。最初通过非遗传数据(临床和血清学标志物以及患者问卷)在基线时估计感染日期。提取 HIV RNA 并扩增基因组的七个区域,进行平行标记的 454 焦磷酸测序,并使用 BEAST 软件包进行分析。
每位患者的 MRCA 时间的平均估计值与非遗传估计值显著相关(Spearman ρ=0.65,P=4.4×10(-3))。合并分析和非遗传日期估计之间的中位绝对差异最小(中位值为 29.4 天),对于 HIV 基因组中高度可变的区域,如 env V3,而对于更保守的区域,如 pol,则更大(中位值为 114.9 天)。
这项 NGS 的应用是一个重要的进展,不仅因为可以从存档标本中回溯地准确估计感染日期,而且因为每个分析都是针对特定患者的,因此对 HIV 进化率的变化具有鲁棒性。