CHU de Toulouse, Hôpital Purpan, Laboratoire de Virologie.
INSERM U1043-CNRS UMR 5282-Toulouse University Paul Sabatier, CPTP, Toulouse F-31300, France.
Bioinformatics. 2020 Jan 15;36(2):416-421. doi: 10.1093/bioinformatics/btz585.
The circulating recombinant form of HIV-1 CRF02-AG is the most frequent non-B subtype in Europe. Anti-HIV therapy and pathophysiological studies on the impact of HIV-1 tropism require genotypic determination of HIV-1 tropism for non-B subtypes. But genotypic approaches based on analysis of the V3 envelope region perform poorly when used to determine the tropism of CRF02-AG. We, therefore, designed an algorithm based on information from the gp120 and gp41 ectodomain that better predicts the tropism of HIV-1 subtype CRF02-AG.
We used a bio-statistical method to identify the genotypic determinants of CRF02-AG coreceptor use. Toulouse HIV Extended Tropism Algorithm (THETA), based on a Least Absolute Shrinkage and Selection Operator method, uses HIV envelope sequence from phenotypically characterized clones. Prediction of R5X4/X4 viruses was 86% sensitive and that of R5 viruses was 89% specific with our model. The overall accuracy of THETA was 88%, making it sufficiently reliable for predicting the tropism of subtype CRF02-AG sequences.
Binaries are freely available for download at https://github.com/viro-tls/THETA. It was implemented in Matlab and supported on MS Windows platform. The sequence data used in this work are available from GenBank under the accession numbers MK618182-MK618417.
HIV-1 CRF02-AG 的循环重组形式是欧洲最常见的非 B 亚型。抗 HIV 治疗和 HIV-1 嗜性对病理生理的研究需要对非 B 亚型的 HIV-1 嗜性进行基因分型确定。但是,基于分析 V3 包膜区的基因分型方法在用于确定 CRF02-AG 的嗜性时表现不佳。因此,我们设计了一种基于 gp120 和 gp41 外显子区信息的算法,该算法可以更好地预测 HIV-1 亚型 CRF02-AG 的嗜性。
我们使用生物统计学方法来确定 CRF02-AG 辅助受体使用的基因决定因素。基于最小绝对值收缩和选择算子(LASSO)方法的图卢兹 HIV 扩展嗜性算法(THETA),使用表型特征克隆的 HIV 包膜序列。我们的模型对 R5X4/X4 病毒的预测敏感性为 86%,对 R5 病毒的预测特异性为 89%。THETA 的总体准确性为 88%,足以可靠地预测 CRF02-AG 亚型序列的嗜性。
二进制文件可在 https://github.com/viro-tls/THETA 上免费下载。它是用 Matlab 实现的,支持 MS Windows 平台。本工作中使用的序列数据可从 GenBank 获得,登录号为 MK618182-MK618417。