Université Paris-Saclay, UVSQ, Inserm, CESP, High Dimensional Biostatistics for Drug Safety and Genomics, Villejuif, France.
Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Canada.
Biom J. 2023 Aug;65(6):e2100384. doi: 10.1002/bimj.202100384. Epub 2023 Feb 27.
Cohort and nested case-control (NCC) designs are frequently used in pharmacoepidemiology to assess the associations of drug exposure that can vary over time with the risk of an adverse event. Although it is typically expected that estimates from NCC analyses are similar to those from the full cohort analysis, with moderate loss of precision, only few studies have actually compared their respective performance for estimating the effects of time-varying exposures (TVE). We used simulations to compare the properties of the resulting estimators of these designs for both time-invariant exposure and TVE. We varied exposure prevalence, proportion of subjects experiencing the event, hazard ratio, and control-to-case ratio and considered matching on confounders. Using both designs, we also estimated the real-world associations of time-invariant ever use of menopausal hormone therapy (MHT) at baseline and updated, time-varying MHT use with breast cancer incidence. In all simulated scenarios, the cohort-based estimates had small relative bias and greater precision than the NCC design. NCC estimates displayed bias to the null that decreased with a greater number of controls per case. This bias markedly increased with higher proportion of events. Bias was seen with Breslow's and Efron's approximations for handling tied event times but was greatly reduced with the exact method or when NCC analyses were matched on confounders. When analyzing the MHT-breast cancer association, differences between the two designs were consistent with simulated data. Once ties were taken correctly into account, NCC estimates were very similar to those of the full cohort analysis.
队列研究和巢式病例对照研究(NCC)经常用于药物流行病学,以评估随时间变化的药物暴露与不良事件风险之间的关联。虽然通常预期 NCC 分析的估计值与全队列分析相似,只是精度略有降低,但实际上很少有研究比较过它们在估计随时间变化的暴露(TVE)效果方面的表现。我们使用模拟比较了这两种设计对于时不变暴露和 TVE 的结果估计量的特性。我们改变了暴露的流行率、发生事件的受试者比例、风险比和病例对照比,并考虑了对混杂因素的匹配。我们使用这两种设计,还估计了基础时不变的绝经期激素治疗(MHT)的实际使用情况以及更新的、随时间变化的 MHT 使用与乳腺癌发病率之间的关联。在所有模拟场景中,基于队列的估计值相对于 NCC 设计具有较小的相对偏差和更高的精度。NCC 估计值存在向零值的偏差,随着病例对照的比例增加而减小。这种偏差随着事件比例的增加而显著增加。Breslow 和 Efron 的近似值在处理捆绑事件时间时会出现偏差,但使用精确方法或当 NCC 分析针对混杂因素进行匹配时,偏差会大大降低。在分析 MHT-乳腺癌相关性时,两种设计之间的差异与模拟数据一致。一旦正确考虑了关联,NCC 估计值就非常接近全队列分析的估计值。