Burton Paul R
Department of Epidemiology and Public Health and Institute of Genetics, University of Leicester, Leicester, UK.
Genet Epidemiol. 2003 Jan;24(1):24-35. doi: 10.1002/gepi.10206.
Gibbs sampling-based generalized linear mixed models (GLMMs) provide a convenient and flexible way to extend variance components models for multivariate normally distributed continuous traits to other classes of phenotype. This includes binary traits and right-censored failure times such as age-at-onset data. The approach has applications in many areas of genetic epidemiology. However, the required GLMMs are sensitive to nonrandom ascertainment. In the absence of an appropriate correction for ascertainment, they can exhibit marked positive bias in the estimated grand mean and serious shrinkage in the estimated magnitude of variance components. To compound practical difficulties, it is currently difficult to implement a conventional adjustment for ascertainment because of the need to undertake repeated integration across the distribution of random effects. This is prohibitively slow when it must be repeated at every iteration of the Markov chain Monte Carlo (MCMC) procedure. This paper motivates a correction for ascertainment that is based on sampling random effects rather than integrating across them and can therefore be implemented in a general-purpose Gibbs sampling environment such as WinBUGS. The approach has the characteristic that it returns ascertainment-adjusted parameter estimates that pertain to the true distribution of determinants in the ascertained sample rather than in the general population. The implications of this characteristic are investigated and discussed. This paper extends the utility of Gibbs sampling-based GLMMs to a variety of settings in which family data are ascertained nonrandomly.
基于吉布斯抽样的广义线性混合模型(GLMMs)提供了一种便捷且灵活的方法,可将用于多元正态分布连续性状的方差分量模型扩展到其他类型的表型。这包括二元性状和右删失失效时间,如发病年龄数据。该方法在遗传流行病学的许多领域都有应用。然而,所需的GLMMs对非随机确定敏感。在没有适当的确定校正的情况下,它们在估计的总体均值中可能表现出明显的正偏差,并且在估计的方差分量大小中出现严重收缩。更复杂的是,由于需要在随机效应分布上进行重复积分,目前很难实施传统的确定校正。当必须在马尔可夫链蒙特卡罗(MCMC)过程的每次迭代中重复进行时,这会极其缓慢。本文提出了一种基于对随机效应进行抽样而非对其进行积分的确定校正方法,因此可以在诸如WinBUGS这样的通用吉布斯抽样环境中实现。该方法的特点是它返回的确定校正参数估计值与确定样本中决定因素的真实分布相关,而不是与总体中的真实分布相关。本文对这一特点的影响进行了研究和讨论。本文将基于吉布斯抽样的GLMMs的效用扩展到了各种非随机确定家庭数据的情况。