ECSTRRA Team, INSERM U1153, Université Paris Cité, 1 avenue Claude Vellefaux, 75010, Paris, France.
ENT and head and neck surgery department, Lariboisiere hospital, 2 rue Ambroise Paré, 75010, Paris, France.
BMC Med Res Methodol. 2023 Oct 31;23(1):255. doi: 10.1186/s12874-023-02071-8.
Looking for treatment-by-subset interaction on a right-censored outcome based on observational data using propensity-score (PS) modeling is of interest. However, there are still issues regarding its implementation, notably when the subsets are very imbalanced in terms of prognostic features and treatment prevalence.
We conducted a simulation study to compare two main PS estimation strategies, performed either once on the whole sample ("across subset") or in each subset separately ("within subsets"). Several PS models and estimands are also investigated. We then illustrated those approaches on the motivating example, namely, evaluating the benefits of facial nerve resection in patients with parotid cancer in contact with the nerve, according to pretreatment facial palsy.
Our simulation study demonstrated that both strategies provide close results in terms of bias and variance of the estimated treatment effect, with a slight advantage for the "across subsets" strategy in very small samples, provided that interaction terms between the subset variable and other covariates influencing the choice of treatment are incorporated. PS matching without replacement resulted in biased estimates and should be avoided in the case of very imbalanced subsets.
When assessing heterogeneity in the treatment effect in small samples, the "across subsets" strategy of PS estimation is preferred. Then, either a PS matching with replacement or a weighting method must be used to estimate the average treatment effect in the treated or in the overlap population. In contrast, PS matching without replacement should be avoided in this setting.
在观察性数据的基础上,基于倾向评分(PS)模型寻找右删失结局的治疗亚组间交互作用很有意义。然而,在实施过程中仍然存在一些问题,尤其是在预后特征和治疗流行率方面,亚组非常不平衡的情况下。
我们进行了一项模拟研究,比较了两种主要的 PS 估计策略,一种是在整个样本上进行一次(“跨亚组”),另一种是分别在每个亚组中进行(“在亚组内”)。还研究了几种 PS 模型和估计量。然后,我们在动机示例中说明了这些方法,即根据术前面瘫评估接触神经的腮腺癌患者面神经切除术的获益。
我们的模拟研究表明,这两种策略在估计治疗效果的偏差和方差方面提供了相近的结果,在非常小的样本中,“跨亚组”策略略有优势,前提是包含了亚组变量与其他影响治疗选择的协变量之间的交互项。不进行替换的 PS 匹配会导致有偏估计,在亚组非常不平衡的情况下应避免使用。
在小样本中评估治疗效果的异质性时,应首选 PS 估计的“跨亚组”策略。然后,必须使用 PS 匹配(有替换)或加权方法来估计处理人群或重叠人群中的平均治疗效果。相比之下,在这种情况下,应避免不进行替换的 PS 匹配。