Institute for Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany.
Freiburg Center for Data Analysis and Modeling, University of Freiburg, Freiburg, Germany.
Stat Med. 2019 Sep 30;38(22):4390-4403. doi: 10.1002/sim.8305. Epub 2019 Jul 17.
Estimating the potential risk associated with an exposure occurring over time requires complex statistical techniques, since ignoring the time from study entry until the exposure leads to potentially seriously biased effect estimates. A prominent example is estimating the effect of hospital-acquired infections on adverse outcomes in patients admitted to the intensive care unit. Exposure density sampling has been proposed as an approach to dynamic matching with respect to a time-dependent exposure. Firstly, exposure density sampling can be useful to reduce the workload of study follow up, as it includes all exposed but only a subset of the not yet exposed individuals. Secondly, it can help to obtain a comparable control group by including propensity score matching. In the present article, we provide the theoretical justification that data obtained by exposure density sampling can be analyzed as a left-truncated cohort. It is shown that exposure density sampling allows estimation of the effect of a time-dependent exposure as well as further baseline covariates on a subsequent event, with only minor loss in precision as compared with a full cohort analysis. The sampling is applied to a real data example (hospital-acquired infections in intensive care units) and in a simulation study. We also provide an estimate of the loss in precision in terms of an increased standard error in the reduced data set after exposure density sampling as compared with the full cohort.
估算随时间发生的暴露相关潜在风险需要复杂的统计技术,因为忽略从研究开始到暴露导致潜在严重偏差的效应估计的时间。一个突出的例子是估计医院获得性感染对重症监护病房患者不良结局的影响。暴露密度抽样已被提议作为一种针对时间相关暴露的动态匹配方法。首先,暴露密度抽样对于减少研究随访的工作量可能很有用,因为它包括所有暴露者,但仅包括一部分尚未暴露者。其次,通过包含倾向评分匹配,可以帮助获得可比的对照组。在本文中,我们提供了理论依据,即通过暴露密度抽样获得的数据可以作为左截断队列进行分析。结果表明,暴露密度抽样可以估计时间相关暴露以及进一步的基线协变量对随后事件的影响,与全队列分析相比,仅略有精度损失。该抽样方法应用于实际数据示例(重症监护病房的医院获得性感染)和模拟研究。我们还提供了一个精度损失的估计,即与全队列相比,在暴露密度抽样后,在简化数据集上的标准误差增加。