Revell Andrew D, Wang Dechao, Perez-Elias Maria-Jesus, Wood Robin, Cogill Dolphina, Tempelman Hugo, Hamers Raph L, Reiss Peter, van Sighem Ard, Rehm Catherine A, Agan Brian, Alvarez-Uria Gerardo, Montaner Julio S G, Lane H Clifford, Larder Brendan A
The HIV Resistance Response Database Initiative (RDI), London, UK.
Instituto Ramón y Cajal de Investigación Sanitaria, Madrid, Spain.
J Antimicrob Chemother. 2021 Jun 18;76(7):1898-1906. doi: 10.1093/jac/dkab078.
With the goal of facilitating the use of HIV-TRePS to optimize therapy in settings with limited healthcare resources, we aimed to develop computational models to predict treatment responses accurately in the absence of commonly used baseline data.
Twelve sets of random forest models were trained using very large, global datasets to predict either the probability of virological response (classifier models) or the absolute change in viral load in response to a new regimen (absolute models) following virological failure. Two 'standard' models were developed with all baseline variables present and 10 others developed without HIV genotype, time on therapy, CD4 count or any combination of the above.
The standard classifier models achieved an AUC of 0.89 in cross-validation and independent testing. Models with missing variables achieved AUC values of 0.78-0.90. The standard absolute models made predictions that correlated significantly with observed changes in viral load with a mean absolute error of 0.65 log10 copies HIV RNA/mL in cross-validation and 0.69 log10 copies HIV RNA/mL in independent testing. Models with missing variables achieved values of 0.65-0.75 log10 copies HIV RNA/mL. All models identified alternative regimens that were predicted to be effective for the vast majority of cases where the new regimen prescribed in the clinic failed. All models were significantly better predictors of treatment response than genotyping with rules-based interpretation.
These latest models that predict treatment responses accurately, even when a number of baseline variables are not available, are a major advance with greatly enhanced potential benefit, particularly in resource-limited settings. The only obstacle to realizing this potential is the willingness of healthcare professions to use the system.
为促进在医疗资源有限的环境中使用HIV-TRePS优化治疗,我们旨在开发计算模型,以便在缺乏常用基线数据的情况下准确预测治疗反应。
使用非常大的全球数据集训练了12组随机森林模型,以预测病毒学失败后病毒学反应的概率(分类器模型)或对新方案的病毒载量绝对变化(绝对模型)。开发了两个包含所有基线变量的“标准”模型,以及另外10个不包含HIV基因型、治疗时间、CD4计数或上述任何组合的模型。
标准分类器模型在交叉验证和独立测试中的AUC为0.89。缺少变量的模型的AUC值为0.78 - 0.90。标准绝对模型的预测与观察到的病毒载量变化显著相关,交叉验证中的平均绝对误差为0.65 log10拷贝HIV RNA/mL,独立测试中的平均绝对误差为0.69 log10拷贝HIV RNA/mL。缺少变量的模型的值为0.65 - 0.75 log10拷贝HIV RNA/mL。所有模型都确定了替代方案,预计这些方案对临床上规定的新方案失败的绝大多数病例有效。所有模型在预测治疗反应方面都比基于规则解释的基因分型要好得多。
即使在一些基线变量不可用的情况下,这些最新模型仍能准确预测治疗反应,这是一项重大进展,具有极大增强的潜在益处,特别是在资源有限的环境中。实现这一潜力的唯一障碍是医疗专业人员使用该系统的意愿。