Thangaraj Phyllis M, Shankar Sumukh Vasisht, Huang Sicong, Nadkarni Girish N, Mortazavi Bobak J, Oikonomou Evangelos K, Khera Rohan
Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA.
Department of Computer Science and Engineering, Texas A&M University, College Station, TX.
medRxiv. 2024 Sep 6:2024.03.25.24304868. doi: 10.1101/2024.03.25.24304868.
Randomized clinical trials (RCTs) are essential to guide medical practice; however, their generalizability to a given population is often uncertain. We developed a statistically informed Generative Adversarial Network (GAN) model, RCT-Twin-GAN, that leverages relationships between covariates and outcomes and generates a digital twin of an RCT (RCT-Twin) conditioned on covariate distributions from a second patient population. We used RCT-Twin-GAN to reproduce treatment effect outcomes of the Systolic Blood Pressure Intervention Trial (SPRINT) and the Action to Control Cardiovascular Risk in Diabetes (ACCORD) Blood Pressure Trial, which tested the same intervention but found different treatment effects. To demonstrate treatment effect estimates of each RCT conditioned on the other RCT's patient population, we evaluated the cardiovascular event-free survival of SPRINT digital twins conditioned on the ACCORD cohort and vice versa (ACCORD twins conditioned on SPRINT). The conditioned digital twins were balanced across intervention and control arms (mean absolute standardized mean difference (MASMD) of covariates between treatment arms 0.019 (SD 0.018), and the conditioned covariates of the SPRINT-Twin on ACCORD were more similar to ACCORD than SPRINT (MASMD 0.0082 SD 0.016 vs. 0.46 SD 0.20). Notably, across iterations, SPRINT conditioned ACCORD-Twin datasets reproduced the overall non-significant effect size seen in ACCORD (5-year cardiovascular outcome hazard ratio (95% confidence interval) of 0.88 (0.73-1.06) in ACCORD vs. median 0.87 (0.68-1.13) in the SPRINT conditioned ACCORD-Twin), while the ACCORD conditioned SPRINT-Twins reproduced the significant effect size seen in SPRINT (0.75 (0.64-0.89) vs. median 0.79 (0.72-0.86)) in the ACCORD conditioned SPRINT-Twin). Finally, we demonstrate the translation of this approach to real-world populations by conditioning the trials on an electronic health record population. Therefore, RCT-Twin-GAN simulates the direct translation of RCT-derived treatment effects across various patient populations.
随机临床试验(RCT)对于指导医疗实践至关重要;然而,其对特定人群的可推广性往往并不确定。我们开发了一种基于统计学的生成对抗网络(GAN)模型,即RCT-Twin-GAN,该模型利用协变量与结果之间的关系,并根据来自第二个患者群体的协变量分布生成RCT的数字孪生体(RCT-Twin)。我们使用RCT-Twin-GAN来重现收缩压干预试验(SPRINT)和糖尿病心血管风险控制行动(ACCORD)血压试验的治疗效果结果,这两项试验测试了相同的干预措施,但发现了不同的治疗效果。为了展示每项RCT在另一项RCT的患者群体条件下的治疗效果估计,我们评估了以ACCORD队列条件为基础的SPRINT数字孪生体的无心血管事件生存率,反之亦然(以SPRINT为条件的ACCORD孪生体)。条件数字孪生体在干预组和对照组之间是平衡的(治疗组之间协变量的平均绝对标准化均数差异(MASMD)为0.019(标准差0.018),并且以ACCORD为条件的SPRINT-Twin的条件协变量与ACCORD比与SPRINT更相似(MASMD为0.0082标准差0.016,而与SPRINT的0.46标准差0.20相比)。值得注意的是,在多次迭代中,以SPRINT为条件的ACCORD-Twin数据集重现了ACCORD中总体无显著差异的效应大小(ACCORD中5年心血管结局风险比(95%置信区间)为0.88(0.73 - 1.06),而在以SPRINT为条件的ACCORD-Twin中中位数为0.87(0.68 - 1.13)),而以ACCORD为条件的SPRINT-Twin重现了SPRINT中显著的效应大小(0.75(0.64 - 0.89),而在以ACCORD为条件的SPRINT-Twin中中位数为0.79(0.72 - 0.86))。最后,我们通过以电子健康记录人群为条件进行试验,展示了这种方法向现实世界人群的转化。因此,RCT-Twin-GAN模拟了RCT衍生的治疗效果在各种患者群体之间的直接转化。