Suppr超能文献

自适应期中分析实验的评估。

Evaluation of experiments with adaptive interim analyses.

作者信息

Bauer P, Köhne K

机构信息

Institut für Medizinische Statistik, Wien, Austria.

出版信息

Biometrics. 1994 Dec;50(4):1029-41.

PMID:7786985
Abstract

A general method for statistical testing in experiments with an adaptive interim analysis is proposed. The method is based on the observed error probabilities from the disjoint subsamples before and after the interim analysis. Formally, an intersection of individual null hypotheses is tested by combining the two p-values into a global test statistic. Stopping rules for Fisher's product criterion in terms of critical limits for the p-value in the first subsample are introduced, including early stopping in the case of missing effects. The control of qualitative treatment-stage interactions is considered. A generalization to three stages is outlined. The loss of power when using the product criterion instead of the optimal classical test on the whole sample is calculated for the test of the mean of a normal distribution, depending on increasing proportions of the first subsample in relation to the total sample size. An upper bound on the loss of power due to early stopping is derived. A general example is presented and rules for assessing the sample size in the second stage of the trial are given. The problems of interpretation and precautions to be taken for applications are discussed. Finally, the sources of bias for estimation in such designs are described.

摘要

提出了一种在具有适应性期中分析的实验中进行统计检验的通用方法。该方法基于期中分析前后不相交子样本的观察误差概率。形式上,通过将两个p值组合成一个全局检验统计量来检验各个原假设的交集。引入了根据第一个子样本中p值的临界极限确定的Fisher乘积准则的停止规则,包括在效应缺失的情况下提前停止。考虑了定性治疗阶段相互作用的控制。概述了向三个阶段的推广。对于正态分布均值的检验,根据第一个子样本相对于总样本量的比例增加,计算了使用乘积准则而非对整个样本进行最优经典检验时的功效损失。推导了由于提前停止导致的功效损失的上限。给出了一个一般示例,并给出了试验第二阶段样本量评估规则。讨论了应用中的解释问题和注意事项。最后,描述了此类设计中估计偏差的来源。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验