Boatman Jeffrey A, Vock David M, Koopmeiners Joseph S, Donny Eric C
Division of Biostatistics, University of Minnesota, A460 Mayo Building, MMC 303 420 Delaware St. SE, Minneapolis, MN 55455, USA
Department of Psychology, University of Pittsburgh, 4119 Sennott Square 210 S. Bouquet St., Pittsburgh, PA 15260, USA.
Biostatistics. 2018 Jan 1;19(1):103-118. doi: 10.1093/biostatistics/kxx029.
Noncompliance or non-adherence to randomized treatment is a common challenge when interpreting data from randomized clinical trials. The effect of an intervention if all participants were forced to comply with the assigned treatment (i.e., the causal effect) is often of primary scientific interest. For example, in trials of very low nicotine content (VLNC) cigarettes, policymakers are interested in their effect on smoking behavior if their use were to be compelled by regulation. A variety of statistical methods to estimate the causal effect of an intervention have been proposed, but these methods, including inverse probability of compliance weighted (IPCW) estimators, assume that participants' compliance statuses are reported without error. This is an untenable assumption when compliance is based on self-report. Biomarkers (e.g., nicotine levels in the urine) may provide more reliable indicators of compliance but cannot perfectly discriminate between compliers and non-compliers. However, by modeling the distribution of the biomarker as a mixture distribution and writing the probability of compliance as a function of the mixture components, we show how the probability of compliance can be directly estimated from the data even when compliance status is unknown. To estimate the causal effect, we develop a new approach which re-weights participants by the product of their probability of compliance given the observed data and the inverse probability of compliance given confounders. We show that our proposed estimator is consistent and asymptotically normal and show that in some situations the proposed approach is more efficient than standard IPCW estimators. We demonstrate via simulation that the proposed estimator achieves smaller bias and greater efficiency than ad hoc approaches to estimating the causal effect when compliance is measured with error. We apply our method to data from a recently completed randomized trial of VLNC cigarettes.
在解读随机临床试验数据时,不依从或不坚持随机分配的治疗是一个常见的挑战。如果所有参与者都被迫遵守分配的治疗(即因果效应),那么干预措施的效果通常是主要的科学关注点。例如,在极低尼古丁含量(VLNC)香烟的试验中,政策制定者感兴趣的是,如果通过监管强制使用这些香烟,它们对吸烟行为的影响。已经提出了多种估计干预因果效应的统计方法,但这些方法,包括依从性加权逆概率(IPCW)估计器,都假设参与者的依从状态报告无误。当依从性基于自我报告时,这是一个站不住脚的假设。生物标志物(例如尿液中的尼古丁水平)可能提供更可靠的依从性指标,但不能完美地区分依从者和不依从者。然而,通过将生物标志物的分布建模为混合分布,并将依从概率写为混合成分的函数,我们展示了即使依从状态未知,也可以直接从数据中估计依从概率。为了估计因果效应,我们开发了一种新方法,该方法根据给定观察数据的依从概率与给定混杂因素的依从逆概率的乘积对参与者进行重新加权。我们表明,我们提出的估计器是一致的且渐近正态,并表明在某些情况下,所提出的方法比标准IPCW估计器更有效。我们通过模拟证明,当依从性测量存在误差时,所提出的估计器在估计因果效应方面比临时方法具有更小的偏差和更高的效率。我们将我们的方法应用于最近完成的VLNC香烟随机试验的数据。