Satagopan Jaya M, Zhou Qin, Oliveria Susan A, Dusza Stephen W, Weinstock Martin A, Berwick Marianne, Halpern Allan C
Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center.
J R Stat Soc Ser C Appl Stat. 2011 Aug 1;60(4):619-632. doi: 10.1111/j.1467-9876.2011.00762.x.
Epidemiology studies increasingly examine multiple exposures in relation to disease by selecting the exposures of interest in a thematic manner. For example, sun exposure, sunburn, and sun protection behavior could be themes for an investigation of sun-related exposures. Several studies now use pre-defined linear combinations of the exposures pertaining to the themes to estimate the effects of the individual exposures. Such analyses may improve the precision of the exposure effects, but they can lead to inflated bias and type I errors when the linear combinations are inaccurate. We investigate preliminary test estimators and empirical Bayes type shrinkage estimators as alternative approaches when it is desirable to exploit the thematic choice of exposures, but the accuracy of the pre-defined linear combinations is unknown. We show that the two types of estimator are intimately related under certain assumptions. The shrinkage estimator derived under the assumption of an exchangeable prior distribution gives precise estimates and is robust to misspecifications of the user-defined linear combinations. The precision gains and robustness of the shrinkage estimation approach are illustrated using data from the SONIC study, where the exposures are the individual questionnaire items and the outcome is (log) total back nevus count.
流行病学研究越来越多地通过以主题方式选择感兴趣的暴露因素来考察与疾病相关的多种暴露。例如,阳光暴露、晒伤和防晒行为可以作为与阳光相关暴露调查的主题。现在有几项研究使用与主题相关的预定义暴露线性组合来估计个体暴露的影响。这种分析可能会提高暴露效应的精度,但当线性组合不准确时,可能会导致偏差和I型错误膨胀。当希望利用暴露的主题选择,但预定义线性组合的准确性未知时,我们研究了初步检验估计量和经验贝叶斯型收缩估计量作为替代方法。我们表明,在某些假设下,这两种类型的估计量密切相关。在可交换先验分布假设下导出的收缩估计量能给出精确估计,并且对用户定义线性组合的错误设定具有鲁棒性。使用SONIC研究的数据说明了收缩估计方法的精度提高和鲁棒性,其中暴露因素是个体问卷项目,结果是(对数)背部痣总数。