Department of Health Management, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PEI C1A 4P3, Canada.
Prev Vet Med. 2010 Feb 1;93(2-3):81-97. doi: 10.1016/j.prevetmed.2009.10.004. Epub 2009 Dec 11.
Binary repeated measures data are commonly encountered in both experimental and observational veterinary studies. Among the wide range of statistical methods and software applicable to such data one major distinction is between marginal and random effects procedures. The objective of the study was to review and assess the performance of marginal and random effects estimation procedures for the analysis of binary repeated measures data. Two simulation studies were carried out, using relatively small, balanced, two-level (time within subjects) datasets. The first study was based on data generated from a marginal model with first order autocorrelation, the second on a random effects model with autocorrelated random effects within subjects. Three versions of the models were considered in which a dichotomous treatment was modelled additively, either between or within subjects, or modelled by a time interaction. Among the studied statistical procedures were: generalized estimating equations (GEE), Marginal Quasi Likelihood, likelihood based on numerical integration, penalized quasi-likelihood, restricted pseudo likelihood and Bayesian Markov Chain Monte Carlo. Results for data generated by the marginal model showed autoregressive GEE to be highly efficient when treatment was within subjects, even with strongly correlated responses. For treatment between subjects, random effects procedures also performed well in some situations; however, a relatively small number of subjects with a short time series proved a challenge for both marginal and random effects procedures. Results for data generated by the random effects model showed bias in estimates from random effects procedures when autocorrelation was present in the data, while the marginal procedures generally gave estimates close to the marginal parameters.
在实验和观察性兽医研究中,经常会遇到二进制重复测量数据。在适用于此类数据的广泛的统计方法和软件中,一个主要区别是边缘和随机效应程序之间的区别。本研究的目的是回顾和评估用于分析二进制重复测量数据的边缘和随机效应估计程序的性能。进行了两项模拟研究,使用相对较小、平衡的两级(受试者内时间)数据集。第一项研究基于具有一阶自相关的边缘模型生成的数据,第二项研究基于具有自相关随机效应的随机效应模型生成的数据。考虑了三种模型版本,其中将二项式处理分别建模为受试者间、受试者内或通过时间交互。所研究的统计程序包括:广义估计方程(GEE)、边缘拟似然、基于数值积分的似然、惩罚拟似然、限制伪似然和贝叶斯马尔可夫链蒙特卡罗。对于由边缘模型生成的数据,当处理为受试者内时,自回归 GEE 表现出高度的效率,即使响应高度相关。对于受试者间的处理,随机效应程序在某些情况下也表现良好;然而,对于边缘和随机效应程序来说,具有短时间序列的受试者数量相对较少是一个挑战。对于由随机效应模型生成的数据,当数据中存在自相关时,随机效应程序的估计存在偏差,而边缘程序通常给出接近边缘参数的估计。