Division of Biostatistics, Medical College of Wisconsin,
Department of Statistics, University of Missouri-Columbia,
Stud Health Technol Inform. 2021 May 24;278:11-16. doi: 10.3233/SHTI210043.
This manuscript investigates sample sizes for interim analyses in group sequential designs. Traditional group sequential designs (GSD) rely on "information fraction" arguments to define the interim sample sizes. Then, interim maximum likelihood estimators (MLEs) are used to decide whether to stop early or continue the data collection until the next interim analysis. The possibility of early stopping changes the distribution of interim and final MLEs: possible interim decisions on trial stopping excludes some sample space elements. At each interim analysis the distribution of an interim MLE is a mixture of truncated and untruncated distributions. The distributional form of an MLE becomes more and more complicated with each additional interim analysis. Test statistics that are asymptotically normal without a possibly of early stopping, become mixtures of truncated normal distributions under local alternatives. Stage-specific information ratios are equivalent to sample size ratios for independent and identically distributed data. This equivalence is used to justify interim sample sizes in GSDs. Because stage-specific information ratios derived from normally distributed data differ from those derived from non-normally distributed data, the former equivalence is invalid when there is a possibility of early stopping. Tarima and Flournoy [3] have proposed a new GSD where interim sample sizes are determined by a pre-defined sequence of ordered alternative hypotheses, and the calculation of information fractions is not needed. This innovation allows researchers to prescribe interim analyses based on desired power properties. This work compares interim power properties of a classical one-sided three stage Pocock design with a one-sided three stage design driven by three ordered alternatives.
本文研究了群组序贯设计中中期分析的样本量。传统的群组序贯设计(GSD)依赖于“信息分数”论证来定义中期样本量。然后,使用中期最大似然估计量(MLE)来决定是否提前停止或继续收集数据直到下一次中期分析。提前停止的可能性会改变中期和最终 MLE 的分布:提前停止试验的可能性排除了一些样本空间元素。在每次中期分析中,中期 MLE 的分布是截断和非截断分布的混合。随着每次额外的中期分析,MLE 的分布形式变得越来越复杂。在没有提前停止的可能性的情况下渐近正态的检验统计量,在局部替代下成为截断正态分布的混合物。特定阶段的信息比在独立同分布数据中相当于样本量比。这种等价性用于证明 GSD 中的中期样本量。由于来自正态分布数据的特定阶段信息比与来自非正态分布数据的信息比不同,因此当存在提前停止的可能性时,前者的等价性无效。Tarima 和 Flournoy[3]提出了一种新的 GSD,其中中期样本量由预定义的有序替代假设序列确定,并且不需要计算信息分数。这项创新允许研究人员根据所需的功效属性规定中期分析。这项工作比较了基于三个有序替代假设驱动的单侧三阶段设计的经典单侧三阶段 Pocock 设计的中期功效属性。