Nianogo Roch A, O'Neill Stephen, Inoue Kosuke
Department of Epidemiology, Fielding School of Public Health, University of California, Los Angeles (UCLA), USA.
California Center for Population Research, University of California, Los Angeles (UCLA), USA.
Stat Methods Med Res. 2025 Apr;34(4):648-662. doi: 10.1177/09622802251316969. Epub 2025 Feb 24.
There has been a renewed interest in identifying heterogenous treatment effects (HTEs) to guide personalized medicine. The objective was to illustrate the use of a step-by-step transparent parametric data-adaptive approach (the generalized HTE approach) based on the G-computation algorithm to detect heterogenous subgroups and estimate meaningful conditional average treatment effects (CATE). The following seven steps implement the generalized HTE approach: Step 1: Select variables that satisfy the backdoor criterion and potential effect modifiers; Step 2: Specify a flexible saturated model including potential confounders and effect modifiers; Step 3: Apply a selection method to reduce overfitting; Step 4: Predict potential outcomes under treatment and no treatment; Step 5: Contrast the potential outcomes for each individual; Step 6: Fit cluster modeling to identify potential effect modifiers; Step 7: Estimate subgroup CATEs. We illustrated the use of this approach using simulated and real data. Our generalized HTE approach successfully identified HTEs and subgroups defined by all effect modifiers using simulated and real data. Our study illustrates that it is feasible to use a step-by-step parametric and transparent data-adaptive approach to detect effect modifiers and identify meaningful HTEs in an observational setting. This approach should be more appealing to epidemiologists interested in explanation.
为指导个性化医疗,人们对识别异质性治疗效果(HTE)重新产生了兴趣。目的是说明如何使用基于G计算算法的逐步透明参数数据自适应方法(广义HTE方法)来检测异质性亚组并估计有意义的条件平均治疗效果(CATE)。广义HTE方法通过以下七个步骤实现:步骤1:选择满足后门标准和潜在效应修饰因素的变量;步骤2:指定一个灵活的饱和模型,包括潜在混杂因素和效应修饰因素;步骤3:应用选择方法以减少过度拟合;步骤4:预测治疗和未治疗情况下的潜在结果;步骤5:对比每个个体的潜在结果;步骤6:进行聚类建模以识别潜在效应修饰因素;步骤7:估计亚组CATE。我们使用模拟数据和真实数据说明了该方法的应用。我们的广义HTE方法使用模拟数据和真实数据成功识别了由所有效应修饰因素定义的HTE和亚组。我们的研究表明,在观察性研究中,使用逐步参数化且透明的数据自适应方法来检测效应修饰因素并识别有意义的HTE是可行的。这种方法应该会对热衷于解释的流行病学家更具吸引力。