Lefkopoulou M, Ryan L
Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts.
Biometrics. 1993 Dec;49(4):975-88.
The applied statistician often encounters the need to compare two or more groups with respect to more than one outcome or response. Several options are generally available, including reducing the dimension of the problem by averaging or summarizing the outcomes, using Bonferroni or other adjustments for multiple comparisons, or applying a global test based on a suitable multivariate model. For normally distributed data, it is well established that global tests tend to be significantly more sensitive than other procedures. While global tests have also been proposed for multiple binary outcomes, their properties have not been well studied nor have they been widely discussed in the context of clustered data. In this paper, we derive a class of quasi-likelihood score tests for multiple binary outcomes, and show that special cases of this class correspond to other tests that have been proposed. We discuss extensions to allow for clustered data, and compare the results to the simple approach of collapsing the data to a single binary outcome, indicating the presence or absence of at least one response. The asymptotic relative efficiencies of the tests are shown to depend not only on the correlation between the outcomes, but also on the response probabilities. Although global tests based on a multivariate model are generally recommended, our findings suggest that a test based on the collapsed data can maintain surprisingly high efficiency, especially when the outcomes of interest are rare. Data from several developmental toxicity studies illustrate our results.
应用统计学家经常会遇到需要就不止一个结果或反应来比较两个或更多组的情况。通常有几种选择,包括通过对结果进行平均或汇总来降低问题的维度,使用邦费罗尼校正或其他多重比较校正方法,或者应用基于合适的多元模型的全局检验。对于正态分布的数据,众所周知全局检验往往比其他方法显著更灵敏。虽然也有人针对多个二元结果提出了全局检验,但它们的性质尚未得到充分研究,在聚类数据的背景下也未得到广泛讨论。在本文中,我们推导了一类针对多个二元结果的拟似然得分检验,并表明该类的特殊情况对应于已提出的其他检验。我们讨论了允许处理聚类数据的扩展,并将结果与将数据合并为单个二元结果(表明至少有一个反应的存在或不存在)的简单方法进行比较。检验的渐近相对效率不仅取决于结果之间的相关性,还取决于反应概率。尽管通常推荐基于多元模型的全局检验,但我们的研究结果表明基于合并后数据的检验可以保持出奇高的效率,特别是当感兴趣的结果很少见时。来自几项发育毒性研究的数据说明了我们的结果。