Lavange L M, Keyes L L, Koch G G, Margolis P A
Centre for Medical Statistics, Research Triangle Institute, Research Triangle Park, NC 27709.
Stat Med. 1994 Feb 28;13(4):343-55. doi: 10.1002/sim.4780130403.
We describe ratio estimation methods for multivariately analysing incidence densities from prospective epidemiologic studies. Commonly used in survey data analysis, these ratio methods require minimal distributional assumptions and take into account the random variability in the at-risk periods. We illustrate their application with data from a study of lower respiratory illness (LRI) in children during the first year of life. One question of interest is whether children with passive exposure to tobacco smoke have a higher rate of LRI, on average, than those with no exposure and in a setting where age of child and season are taken into account. A second question is whether the relationship persists after adjusting for background variables such as family's socioeconomic status, crowding in the home, race, and type of feeding. The basic strategy consists of a two-step process in which we first estimate subgroup-specific incidence densities and their covariance matrix via a first-order Taylor series approximation. These estimates are used to test for differences in marginal rates of LRI between children exposed to tobacco smoke and those not exposed. We then fit a log-linear model to the estimated ratios in order to test for significant covariate effects. The ability to produce direct estimates of adjusted incidence density ratios for risk factors of interest is an important advantage of this approach. For comparison purposes and to address the limitations of the ratio method with respect to the number of covariates that can be controlled simultaneously, we consider survey logistic regression methods for the example data as well as logistic and Poisson regression models fitted via generalized estimating equation methods. Although the analysis strategy is illustrated with illness data from an epidemiologic study, the context of application is broader and includes, for example, data on adverse events from a clinical trial.
我们描述了用于对前瞻性流行病学研究中的发病密度进行多变量分析的比率估计方法。这些比率方法常用于调查数据分析,所需的分布假设最少,并考虑了风险期的随机变异性。我们用一项关于一岁以内儿童下呼吸道疾病(LRI)的研究数据来说明它们的应用。一个感兴趣的问题是,在考虑儿童年龄和季节的情况下,平均而言,被动接触烟草烟雾的儿童患LRI的比率是否高于未接触者。第二个问题是,在调整诸如家庭社会经济地位、家庭拥挤程度、种族和喂养方式等背景变量后,这种关系是否仍然存在。基本策略包括一个两步过程,首先我们通过一阶泰勒级数近似估计亚组特异性发病密度及其协方差矩阵。这些估计值用于检验接触烟草烟雾的儿童和未接触儿童之间LRI边际发生率的差异。然后,我们对估计的比率拟合一个对数线性模型,以检验协变量的显著影响。能够直接估计感兴趣的危险因素的调整发病密度比率是这种方法的一个重要优点。为了进行比较,并解决比率方法在可同时控制的协变量数量方面的局限性,我们针对示例数据考虑了调查逻辑回归方法以及通过广义估计方程方法拟合的逻辑回归和泊松回归模型。尽管分析策略是用一项流行病学研究的疾病数据来说明的,但其应用背景更广泛,例如包括来自临床试验的不良事件数据。