Chao W H, Palta M, Young T
Department of Preventive Medicine, University of Wisconsin, Madison 53705, USA.
Biometrics. 1997 Jun;53(2):678-89.
Marginal analysis using the generalized estimating equation approach is widely applied to correlated observations, as occur in studies with clusters and in longitudinal follow-up of individuals. In this article, we investigate the effect of confounding in such models. We assume that a risk factor x and a confounder z are related by a generalized linear model to the outcome y, which can be binary or ordinal. In order to investigate confounding arising from the omission of z, a joint structure for x and z must be specified. Modeling normally distributed (x,z) as sums of between- and within-individual (or cluster) components allows us to incorporate different degrees of between- and within-individual correlation. Such a structure includes, as special cases, cohort and period effects in longitudinal settings and random intercept models. The latter situation corresponds to allowing z to vary only on the between-individual (or cluster) level and to be uncorrelated with x, and leads to attenuation of the coefficient of x in marginal models with the logit and probit links. More complex situations occur when z is allowed to also vary on the within-individual (or cluster) level and when z is correlated with x. We examine the model specification and the expected bias when fitting a marginal model in the presence of the omitted confounder z. We derive general formulas and interpret the parameters and results in an ongoing cohort study. Testing for omitted covariates is also discussed.
使用广义估计方程方法的边际分析广泛应用于相关观测值,如在聚类研究和个体的纵向随访中出现的情况。在本文中,我们研究了此类模型中混杂因素的影响。我们假设风险因素x和混杂因素z通过广义线性模型与结果y相关,y可以是二元或有序的。为了研究因遗漏z而产生的混杂,必须指定x和z的联合结构。将正态分布的(x,z)建模为个体间(或聚类)成分与个体内成分之和,使我们能够纳入不同程度的个体间和个体内相关性。作为特殊情况,这种结构包括纵向设置中的队列效应和时期效应以及随机截距模型。后一种情况对应于允许z仅在个体间(或聚类)水平上变化且与x不相关,这会导致在具有logit和probit链接的边际模型中x的系数衰减。当允许z也在个体内(或聚类)水平上变化以及z与x相关时,会出现更复杂的情况。我们研究了在存在遗漏的混杂因素z的情况下拟合边际模型时的模型设定和预期偏差。我们推导了一般公式,并在一项正在进行的队列研究中解释了参数和结果。还讨论了对遗漏协变量的检验。