Wolkewitz Martin, Beyersmann J, Gastmeier P, Schumacher M
Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg, 79104 Freiburg, Germany.
Methods Inf Med. 2009;48(5):438-43. doi: 10.3414/ME9241. Epub 2009 Aug 5.
The impact of time-dependent exposures on the time until study endpoint may correctly be analyzed with data of a full cohort. Ignoring the time-dependent nature of these exposures leads to time-dependent bias. Matching for time to exposure is often applied to take the time-dependency into account, but prefixed sets of exposed and unexposed may still create bias. This approach is attractive since a subcohort would also save resources, especially when exposure and outcome data are only available in the full cohort but further covariate information is required. The first objective is to show to which extent matching for time to exposure yields biased results. Secondly, exposure density sampling is introduced and explored.
To evaluate how both sampling methods perform, they are compared to the correct method as well as to the approach in which the time-dependent nature of the exposure is ignored. Real data of the SIR-3 study (Germany, 2000-2001) and a simulation study are used.
Simulations show that matching may reduce the time-dependent bias but still there is a bias. The matching bias decreases if fewer patients are exposed. Exposure density sampling yields unbiased results.
Results from studies in which matching for time to exposure was applied are only tolerable for rare exposures. Whenever subcohorting is the intention in order to save resources, exposure density sampling should be preferred instead.
随时间变化的暴露因素对至研究终点时间的影响可通过全队列数据进行正确分析。忽略这些暴露因素的时间依赖性会导致时间依赖性偏倚。为暴露时间进行匹配常被用于考虑时间依赖性,但预先设定的暴露组和非暴露组仍可能产生偏倚。这种方法很有吸引力,因为一个亚队列也能节省资源,特别是当暴露和结局数据仅在全队列中可用而还需要进一步的协变量信息时。第一个目标是表明为暴露时间进行匹配在多大程度上会产生有偏的结果。其次,引入并探讨暴露密度抽样。
为评估这两种抽样方法的表现,将它们与正确方法以及忽略暴露因素时间依赖性的方法进行比较。使用了SIR - 3研究(德国,2000 - 2001年)的真实数据和一项模拟研究。
模拟显示匹配可减少时间依赖性偏倚,但仍存在偏倚。暴露的患者越少,匹配偏倚越小。暴露密度抽样产生无偏的结果。
应用了为暴露时间进行匹配的研究结果仅在暴露罕见时是可接受的。每当为节省资源而打算采用亚队列研究时,应优先选择暴露密度抽样。