Rosner B, Milton R C
Department of Preventive Medicine and Clinical Epidemiology, Harvard Medical School and Brigham & Women's Hospital, Boston, Massachusetts 02115.
Biometrics. 1988 Jun;44(2):505-12.
Multiple logistic regression is a commonly used multivariate technique for analyzing data with a binary outcome. One assumption needed for this method of analysis is the independence of outcome for all sample points in a data set. In ophthalmologic data and other types of correlated binary data, this assumption is often grossly violated and the validity of the technique becomes an issue. A technique has been developed (Rosner, 1984) that utilizes a polychotomous logistic regression model to allow one to look at multiple exposure variables in the context of a correlated binary data structure. This model is an extension of the beta-binomial model, which has been widely used to model correlated binary data when no covariates are present. In this paper, a relationship is developed between the two techniques, whereby it is shown that use of ordinary logistic regression in the presence of correlated binary data can result in true significance levels that are considerably larger than nominal levels in frequently encountered situations. This relationship is explored in detail in the case of a single dichotomous exposure variable. In this case, the appropriate test statistic can be expressed as an adjusted chi-square statistic based on the 2 X 2 contingency table relating exposure to outcome. The test statistic is easily computed as a function of the ordinary chi-square statistic and the correlation between eyes (or more generally between cluster members) for outcome and exposure, respectively. This generalizes some previous results obtained by Koval and Donner (1987, in Festschrift for V. M. Joshi, I. B. MacNeill (ed.), Vol. V, 199-224.(ABSTRACT TRUNCATED AT 250 WORDS)
多元逻辑回归是一种常用的多变量技术,用于分析具有二元结局的数据。这种分析方法所需的一个假设是数据集中所有样本点的结局相互独立。在眼科数据和其他类型的相关二元数据中,这一假设常常被严重违背,该技术的有效性就成了问题。已经开发出一种技术(罗斯纳,1984年),它利用多分类逻辑回归模型,使人们能够在相关二元数据结构的背景下考察多个暴露变量。该模型是β-二项式模型的扩展,当不存在协变量时,β-二项式模型已被广泛用于对相关二元数据进行建模。本文阐述了这两种技术之间的关系,结果表明,在存在相关二元数据的情况下使用普通逻辑回归,在常见情形下可能导致实际显著性水平远高于名义水平。对于单个二分暴露变量的情况,对这种关系进行了详细探讨。在这种情况下,适当的检验统计量可以表示为基于将暴露与结局联系起来的2×2列联表的调整卡方统计量。该检验统计量可以很容易地根据普通卡方统计量以及结局和暴露在眼睛之间(或更一般地在聚类成员之间)的相关性来计算。这推广了科瓦尔和唐纳(1987年,载于《V.M.乔希纪念文集》,I.B.麦克尼尔编,第五卷,199 - 224页)先前得到的一些结果。(摘要截取自250字)