RDI, London, UK.
AIDS. 2011 Sep 24;25(15):1855-63. doi: 10.1097/QAD.0b013e328349a9c2.
The optimum selection and sequencing of combination antiretroviral therapy to maintain viral suppression can be challenging. The HIV Resistance Response Database Initiative has pioneered the development of computational models that predict the virological response to drug combinations. Here we describe the development and testing of random forest models to power an online treatment selection tool.
Five thousand, seven hundred and fifty-two treatment change episodes were selected to train a committee of 10 models to predict the probability of virological response to a new regimen. The input variables were antiretroviral treatment history, baseline CD4 cell count, viral load and genotype, drugs in the new regimen, time from treatment change to follow-up and follow-up viral load values. The models were assessed during cross-validation and with an independent set of 50 treatment change episodes by plotting receiver-operator characteristic curves and their performance compared with genotypic sensitivity scores from rules-based genotype interpretation systems.
The models achieved an area under the curve during cross-validation of 0.77-0.87 (mean = 0.82), accuracy of 72-81% (mean = 77%), sensitivity of 62-80% (mean = 67%) and specificity of 75-89% (mean = 81%). When tested with the 50 test cases, the area under the curve was 0.70-0.88, accuracy 64-82%, sensitivity 62-80% and specificity 68-95%. The genotypic sensitivity scores achieved an area under the curve of 0.51-0.52, overall accuracy of 54-56%, sensitivity of 43-64% and specificity of 41-73%.
The models achieved a consistent, high level of accuracy in predicting treatment responses, which was markedly superior to that of genotypic sensitivity scores. The models are being used to power an experimental system now available via the Internet.
选择和制定维持病毒抑制的最佳联合抗逆转录病毒治疗方案可能具有挑战性。HIV 耐药反应数据库倡议率先开发了预测药物组合病毒学反应的计算模型。本文介绍了开发和测试随机森林模型的情况,以为在线治疗选择工具提供支持。
选择 5752 次治疗转换事件来训练 10 个模型委员会,以预测新方案的病毒学反应概率。输入变量包括抗逆转录病毒治疗史、基线 CD4 细胞计数、病毒载量和基因型、新方案中的药物、从治疗转换到随访的时间以及随访病毒载量值。通过绘制接受者操作特征曲线并将其性能与基于规则的基因型解释系统的基因型敏感性评分进行比较,在交叉验证和 50 次治疗转换事件的独立集上评估模型。
模型在交叉验证中的曲线下面积为 0.77-0.87(平均值=0.82),准确性为 72%-81%(平均值=77%),敏感性为 62%-80%(平均值=67%),特异性为 75%-89%(平均值=81%)。在 50 个测试病例中进行测试时,曲线下面积为 0.70-0.88,准确性为 64%-82%,敏感性为 62%-80%,特异性为 68%-95%。基因型敏感性评分的曲线下面积为 0.51-0.52,总体准确性为 54%-56%,敏感性为 43%-64%,特异性为 41%-73%。
模型在预测治疗反应方面达到了一致的、高水平的准确性,明显优于基因型敏感性评分。该模型正在被用于为目前通过互联网提供的实验系统提供支持。