O'Gorman T W, Woolson R F, Jones M P, Lemke J H
Department of Preventive Medicine, University of Iowa, Iowa City 52242.
Environ Health Perspect. 1990 Jul;87:103-7. doi: 10.1289/ehp.9087103.
In order to control for confounding variables, epidemiologists often obtain data in the form of a 2 x 2 table. One variable is usually the disease status, while the other variable represents a dichotomous exposure variable that is suspected of being a risk factor. If a confounding variable is present, the data are often stratified into several 2 x 2 tables. The objectives of the analysis are to test for the association between the suspected risk factor and the disease and to estimate the strength of this relationship. Before estimating a common odds ratio, it is important to check whether the odds ratios are homogeneous. This paper presents the results of a Monte Carlo study that was performed to determine the size and power of a number of tests of association and homogeneity when the data are sparse. We also evaluated the performance of three estimators of the common odds ratio. For the Monte Carlo studies, equal numbers of cases and controls were used in a wide variety of sparse data situations. On the basis of these studies, we recommend the Breslow-Day test for nonsparse data, and the T4 and T5 statistics for sparse data to test for homogeneity. The Mantel-Haenszel test of association is recommended for sparse and nonsparse data sets. With sparse data, none of the odds ratio estimators are entirely satisfactory.
为了控制混杂变量,流行病学家通常以2×2表格的形式获取数据。一个变量通常是疾病状态,而另一个变量代表一个二分暴露变量,该变量被怀疑是一个风险因素。如果存在混杂变量,数据通常会被分层为几个2×2表格。分析的目的是检验疑似风险因素与疾病之间的关联,并估计这种关系的强度。在估计共同比值比之前,检查比值比是否齐性很重要。本文介绍了一项蒙特卡洛研究的结果,该研究旨在确定数据稀疏时一些关联检验和齐性检验的规模和功效。我们还评估了共同比值比的三种估计方法的性能。对于蒙特卡洛研究,在各种稀疏数据情况下使用了相等数量的病例和对照。基于这些研究,我们推荐对非稀疏数据使用Breslow-Day检验,对稀疏数据使用T4和T5统计量来检验齐性。对于稀疏和非稀疏数据集,推荐使用Mantel-Haenszel关联检验。对于稀疏数据,没有一种比值比估计方法是完全令人满意的。