Suchard Marc A, Weiss Robert E, Dorman Karin S, Sinsheimer Janet S
Department of Biomathematics, School of Medicine, University of California, Los Angeles, California 90095-1766, USA.
Syst Biol. 2002 Oct;51(5):715-28. doi: 10.1080/10635150290102384.
Current methods to identify recombination between subtypes of human immunodeficiency virus 1 (HIV-1) fall into a sequential testing trap, in which significance is assessed conditional on parental representative sequences and crossover points (COPs) that maximize the same test statistic. We overcame this shortfall by testing for recombination while inferring parental heritage and COPs using an extended Bayesian multiple change-point model. The model assumes that aligned molecular sequence data consist of an unknown number of contiguous segments that may support alternative topologies or varying evolutionary pressures. We allowed for heterogeneity in the substitution process and specifically tested for intersubtype recombination using Bayes factors. We also developed a new class of priors to assess significance across a wide range of support for recombination in the data. We applied our method to three putative gag gene recombinants. HIV-1 isolate RW024 decisively supported recombination with an inferred parental heritage of AD and a COP 95% Bayesian credible interval of (1,152, 1,178) using the HXB2 numbering scheme. HIV-1 isolate VI557 barely supported recombination. HIV-1 isolate RF decisively rejected recombination as expected, given that the sequence is commonly used as a reference sequence for subtype B. We employed scaled regeneration quantile plots to assess convergence and found this approach convenient to use even for our variable dimensional model parameter space.
目前用于识别人类免疫缺陷病毒1型(HIV-1)亚型间重组的方法陷入了一种顺序检验陷阱,即在以最大化相同检验统计量的亲本代表性序列和交叉点(COP)为条件评估显著性。我们通过在使用扩展贝叶斯多变化点模型推断亲本遗传和COP的同时检验重组来克服这一不足。该模型假设比对后的分子序列数据由数量未知的连续片段组成,这些片段可能支持不同的拓扑结构或不同的进化压力。我们考虑了替换过程中的异质性,并使用贝叶斯因子专门检验亚型间重组。我们还开发了一类新的先验来评估数据中对重组的广泛支持范围内的显著性。我们将我们的方法应用于三个假定的gag基因重组体。使用HXB2编号方案,HIV-1分离株RW024决定性地支持重组,推断的亲本遗传为AD,COP的95%贝叶斯可信区间为(1,152, 1,178)。HIV-1分离株VI557几乎不支持重组。鉴于HIV-1分离株RF的序列通常用作B亚型的参考序列,不出所料,它决定性地拒绝重组。我们使用缩放再生分位数图来评估收敛性,发现这种方法即使对于我们的可变维模型参数空间也很方便使用。