RAND Corporation, 1776 Main St. Santa Monica, CA, 90401, USA.
Univ. Bordeaux, INSERM, INRIA, BPH, U1219, SISTM, F-33000 Bordeaux, France and Vaccine Research Institute, F-94000 Créteil, France.
Biostatistics. 2023 Oct 18;24(4):985-999. doi: 10.1093/biostatistics/kxac020.
When evaluating the effectiveness of a treatment, policy, or intervention, the desired measure of efficacy may be expensive to collect, not routinely available, or may take a long time to occur. In these cases, it is sometimes possible to identify a surrogate outcome that can more easily, quickly, or cheaply capture the effect of interest. Theory and methods for evaluating the strength of surrogate markers have been well studied in the context of a single surrogate marker measured in the course of a randomized clinical study. However, methods are lacking for quantifying the utility of surrogate markers when the dimension of the surrogate grows. We propose a robust and efficient method for evaluating a set of surrogate markers that may be high-dimensional. Our method does not require treatment to be randomized and may be used in observational studies. Our approach draws on a connection between quantifying the utility of a surrogate marker and the most fundamental tools of causal inference-namely, methods for robust estimation of the average treatment effect. This connection facilitates the use of modern methods for estimating treatment effects, using machine learning to estimate nuisance functions and relaxing the dependence on model specification. We demonstrate that our proposed approach performs well, demonstrate connections between our approach and certain mediation effects, and illustrate it by evaluating whether gene expression can be used as a surrogate for immune activation in an Ebola study.
在评估治疗、政策或干预措施的有效性时,可能需要花费高昂的成本、无法常规获得或需要很长时间才能获得理想的疗效测量指标。在这些情况下,有时可以确定一个替代终点,该终点可以更轻松、快速或廉价地捕捉到感兴趣的效果。在随机临床试验中测量单个替代终点的情况下,已经对评估替代标志物强度的理论和方法进行了深入研究。但是,当替代终点的维度增加时,量化替代标志物效用的方法却缺乏。我们提出了一种稳健且高效的方法来评估一组可能是高维的替代标志物。我们的方法不需要对治疗进行随机分组,并且可以在观察性研究中使用。我们的方法基于量化替代标志物效用与因果推断最基本工具之间的联系,即用于稳健估计平均治疗效果的方法。这种联系便于使用现代方法估计治疗效果,使用机器学习来估计干扰函数,并放宽对模型规范的依赖。我们证明了我们提出的方法表现良好,演示了我们的方法与某些中介效应之间的联系,并通过评估基因表达是否可以在埃博拉研究中用作免疫激活的替代指标来说明该方法。