Weinberg J M, Lagakos S W
Department of Epidemiology and Biostatistics, Boston University School of Public Health, Boston, Massachusetts 02118, USA.
Stat Med. 2001 Mar 15;20(5):705-31. doi: 10.1002/sim.708.
A popular method of using repeated measures data to compare treatment groups in a clinical trial is to summarize each individual's outcomes with a scalar summary statistic, and then to perform a two-group comparison of the resulting statistics using a rank or permutation test. Many different types of summary statistics are used in practice, including discrete and continuous functions of the underlying repeated measures data. When the repeated measures processes of the comparison groups differ by a location shift at each time point, the asymptotic relative efficiency of (continuous) summary statistics that are linear functions of the repeated measures has been determined and used to compare tests in this class. However, little is known about the non-null behaviour of discrete summary statistics, about continuous summary statistics when the groups differ in more complex ways than location shifts or where the summary statistics are not linear functions of the repeated measures. Indeed, even simple distributional structures on the repeated measures variables can lead to complex differences between the distribution of common summary statistics of the comparison groups. The presence of left censoring of the repeated measures, which can arise when these are laboratory markers with lower limits of detection, further complicates the distribution of, and hence the ability to compare, summary statistics. This paper uses recent theoretical results for the non-null behaviour of rank and permutation tests to examine the asymptotic relative efficiencies of several popular summary statistics, both discrete and continuous, under a variety of common settings. We assume a flexible linear growth curve model to describe the repeated measures responses and focus on the types of settings that commonly arise in HIV/AIDS and other diseases.
在临床试验中,一种常用的利用重复测量数据比较治疗组的方法是用一个标量汇总统计量来概括每个个体的结果,然后使用秩检验或置换检验对所得统计量进行两组比较。在实际应用中会使用许多不同类型的汇总统计量,包括基础重复测量数据的离散函数和连续函数。当比较组的重复测量过程在每个时间点因位置偏移而不同时,作为重复测量线性函数的(连续)汇总统计量的渐近相对效率已被确定,并用于比较此类检验。然而,对于离散汇总统计量的非零行为、当组间差异比位置偏移更复杂时连续汇总统计量的非零行为,或者当汇总统计量不是重复测量的线性函数时的情况,我们了解得很少。实际上,即使重复测量变量上简单的分布结构也可能导致比较组常见汇总统计量的分布之间出现复杂差异。当重复测量存在左删失(这在重复测量为有检测下限的实验室指标时可能出现)时,会进一步使汇总统计量的分布变得复杂,从而影响比较汇总统计量的能力。本文利用关于秩检验和置换检验非零行为的最新理论结果,在各种常见情况下研究了几种流行的离散和连续汇总统计量的渐近相对效率。我们假设一个灵活的线性增长曲线模型来描述重复测量响应,并关注在艾滋病毒/艾滋病和其他疾病中常见的情况类型。