Graduate Group in Biostatistics, University of California, Berkeley, California.
Center for Computational Biology, University of California, Berkeley, California.
Biometrics. 2021 Dec;77(4):1241-1253. doi: 10.1111/biom.13375. Epub 2020 Sep 28.
The advent and subsequent widespread availability of preventive vaccines has altered the course of public health over the past century. Despite this success, effective vaccines to prevent many high-burden diseases, including human immunodeficiency virus (HIV), have been slow to develop. Vaccine development can be aided by the identification of immune response markers that serve as effective surrogates for clinically significant infection or disease endpoints. However, measuring immune response marker activity is often costly, which has motivated the usage of two-phase sampling for immune response evaluation in clinical trials of preventive vaccines. In such trials, the measurement of immunological markers is performed on a subset of trial participants, where enrollment in this second phase is potentially contingent on the observed study outcome and other participant-level information. We propose nonparametric methodology for efficiently estimating a counterfactual parameter that quantifies the impact of a given immune response marker on the subsequent probability of infection. Along the way, we fill in theoretical gaps pertaining to the asymptotic behavior of nonparametric efficient estimators in the context of two-phase sampling, including a multiple robustness property enjoyed by our estimators. Techniques for constructing confidence intervals and hypothesis tests are presented, and an open source software implementation of the methodology, the txshift R package, is introduced. We illustrate the proposed techniques using data from a recent preventive HIV vaccine efficacy trial.
在过去的一个世纪中,预防疫苗的出现和随后的广泛应用改变了公共卫生的进程。尽管取得了这一成功,但仍难以开发出有效的疫苗来预防许多高负担疾病,包括人类免疫缺陷病毒 (HIV)。可以通过识别免疫反应标志物来帮助疫苗开发,这些标志物可作为临床相关感染或疾病终点的有效替代指标。然而,测量免疫反应标志物的活性通常成本高昂,这促使人们在预防性疫苗临床试验中使用两阶段抽样来评估免疫反应。在这种试验中,对一部分试验参与者进行免疫标志物测量,其中第二期的入组可能取决于观察到的研究结果和其他参与者水平的信息。我们提出了一种非参数方法,用于有效地估计一个反事实参数,该参数量化了给定免疫反应标志物对随后感染概率的影响。在此过程中,我们填补了两阶段抽样背景下非参数有效估计量的渐近行为的理论空白,包括我们的估计量所具有的多重稳健性特性。我们还介绍了构建置信区间和假设检验的技术,并介绍了该方法的开源软件实现,即 txshift R 包。我们使用最近的预防性 HIV 疫苗功效试验的数据来说明所提出的技术。