Am J Epidemiol. 2022 Jan 1;191(1):198-207. doi: 10.1093/aje/kwab220.
Effect measure modification is often evaluated using parametric models. These models, although efficient when correctly specified, make strong parametric assumptions. While nonparametric models avoid important functional form assumptions, they often require larger samples to achieve a given accuracy. We conducted a simulation study to evaluate performance tradeoffs between correctly specified parametric and nonparametric models to detect effect modification of a binary exposure by both binary and continuous modifiers. We evaluated generalized linear models and doubly robust (DR) estimators, with and without sample splitting. Continuous modifiers were modeled with cubic splines, fractional polynomials, and nonparametric DR-learner. For binary modifiers, generalized linear models showed the greatest power to detect effect modification, ranging from 0.42 to 1.00 in the worst and best scenario, respectively. Augmented inverse probability weighting had the lowest power, with an increase of 23% when using sample splitting. For continuous modifiers, the DR-learner was comparable to flexible parametric models in capturing quadratic and nonlinear monotonic functions. However, for nonlinear, nonmonotonic functions, the DR-learner had lower integrated bias than splines and fractional polynomials, with values of 141.3, 251.7, and 209.0, respectively. Our findings suggest comparable performance between nonparametric and correctly specified parametric models in evaluating effect modification.
效应修正通常使用参数模型进行评估。这些模型在正确指定时虽然有效,但做出了很强的参数假设。虽然非参数模型避免了重要的函数形式假设,但它们通常需要更大的样本量才能达到给定的准确性。我们进行了一项模拟研究,以评估正确指定的参数和非参数模型之间的性能权衡,以检测二进制暴露对二进制和连续修饰符的效应修正。我们评估了广义线性模型和双重稳健(DR)估计量,包括有无样本分割。连续修饰符采用三次样条、分数多项式和非参数 DR 学习者进行建模。对于二进制修饰符,广义线性模型显示出最大的检测效应修正的能力,最差和最佳情况下分别为 0.42 到 1.00。增强逆概率加权的能力最低,使用样本分割时增加了 23%。对于连续修饰符,DR 学习者在捕捉二次和非线性单调函数方面与灵活的参数模型相当。然而,对于非线性、非单调函数,DR 学习者的综合偏差低于样条和分数多项式,分别为 141.3、251.7 和 209.0。我们的研究结果表明,在评估效应修正方面,非参数模型和正确指定的参数模型的性能相当。