Boehm Laura, Reich Brian J, Bandyopadhyay Dipankar
Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA.
Biometrics. 2013 Jun;69(2):545-54. doi: 10.1111/biom.12027. Epub 2013 May 31.
Spatially referenced binary data are common in epidemiology and public health. Owing to its elegant log-odds interpretation of the regression coefficients, a natural model for these data is logistic regression. To account for missing confounding variables that might exhibit a spatial pattern (say, socioeconomic, biological, or environmental conditions), it is customary to include a Gaussian spatial random effect. Conditioned on the spatial random effect, the coefficients may be interpreted as log odds ratios. However, marginally over the random effects, the coefficients no longer preserve the log-odds interpretation, and the estimates are hard to interpret and generalize to other spatial regions. To resolve this issue, we propose a new spatial random effect distribution through a copula framework which ensures that the regression coefficients maintain the log-odds interpretation both conditional on and marginally over the spatial random effects. We present simulations to assess the robustness of our approach to various random effects, and apply it to an interesting dataset assessing periodontal health of Gullah-speaking African Americans. The proposed methodology is flexible enough to handle areal or geo-statistical datasets, and hierarchical models with multiple random intercepts.
空间参考二元数据在流行病学和公共卫生领域很常见。由于其对回归系数的优雅对数优势解释,这些数据的自然模型是逻辑回归。为了考虑可能呈现空间模式的缺失混杂变量(例如社会经济、生物或环境条件),通常会纳入高斯空间随机效应。在空间随机效应条件下,系数可解释为对数优势比。然而,在随机效应的边缘分布上,系数不再保留对数优势解释,估计值难以解释且难以推广到其他空间区域。为了解决这个问题,我们通过一个Copula框架提出了一种新的空间随机效应分布,该框架确保回归系数在空间随机效应条件下和边缘分布上都保持对数优势解释。我们进行了模拟,以评估我们的方法对各种随机效应的稳健性,并将其应用于一个评估说古拉语的非裔美国人牙周健康的有趣数据集。所提出的方法足够灵活,能够处理区域或地理统计数据集以及具有多个随机截距的分层模型。