Zhao Yanli, Grambsch Patricia M, Neaton James D
Clinical Biostatistics, Merck Research Laboratories, North Wales, PA 19454, USA.
Clin Trials. 2007;4(2):140-53. doi: 10.1177/1740774507076936.
Many clinical trials have multiple outcomes. Formal interim monitoring guidelines that take account of multiple outcomes can be useful to Data Monitoring Committees (DMC). Previous research has focused on marginal criteria that control the overall type I error for bivariate endpoints corresponding to efficacy and safety. Usually, an 'or' decision rule is used, that is, the trial is stopped when either the safety or the efficacy endpoint crosses a boundary.
In many trials there is not a clear difference between what is considered a safety and an efficacy endpoint. Likewise, in some studies there is interest in more than one disease outcome and while there may be a primary efficacy endpoint, interim monitoring might also involve a measure of overall health. For these situations, an ;and' decision rule might be more appropriate; that is, the trial is stopped only when both endpoints cross a boundary. Formulation of this new decision rule at the design stage would encourage more discussion among trial investigators of monitoring guidelines and would result in improved guidelines for DMCs.
In this paper, we propose stopping guidelines for such trials with two major outcomes that control the overall type I error and stop early only if both endpoints indicate superiority for the same treatment. Trials with two treatments are considered and we develop sets of paired two-sided boundaries, permitting one endpoint to be primary and the other supportive, with or without pre-specifying which one is primary.
The results show that the boundaries depend on the correlation between the two outcomes. The critical values increase as the correlation increases in most cases. For low to moderate correlation and before the last stage, critical values based on the O'Brien Fleming (or Pocock) error spending function that consider the correlation are lower than those which do not.
Investigators might not want to stop a trial early with the small size of the critical values that result from this decision rule for some situations. For a trial in which the treatment effect for one outcome is large, while for the other it is small, the proposed decision rule has low power to accept this treatment for its superiority on both outcomes. Also, we do not provide a separate P-value for each outcome since type I error is not controlled for individual null hypotheses.
For trials involving two major health outcomes these stopping guidelines may be appropriate for DMCs when the trial investigators and sponsor recommend that early termination not be considered unless the findings are consistent for both outcomes.
许多临床试验有多个结局。考虑到多个结局的正式中期监测指南对数据监测委员会(DMC)可能会很有用。以往的研究主要集中在控制与疗效和安全性相对应的双变量终点的总体I型错误的边际标准上。通常使用“或”决策规则,即当安全性或疗效终点越过边界时试验即停止。
在许多试验中,安全性终点和疗效终点之间并没有明显的区别。同样,在一些研究中,人们关注的不止一种疾病结局,虽然可能有一个主要疗效终点,但中期监测可能还涉及总体健康指标。对于这些情况,“且”决策规则可能更合适;也就是说,只有当两个终点都越过边界时试验才停止。在设计阶段制定这一新的决策规则将促使试验研究者就监测指南展开更多讨论,并将为数据监测委员会带来改进的指南。
在本文中,我们针对此类有两个主要结局的试验提出停止指南,以控制总体I型错误,并且只有当两个终点都表明同一治疗具有优越性时才提前停止试验。我们考虑了有两种治疗方法的试验,并制定了成对的双侧边界集,允许一个终点为主终点,另一个为支持性终点,无论是否预先指定哪一个为主终点。
结果表明,边界取决于两个结局之间的相关性。在大多数情况下,临界值随着相关性的增加而增加。对于低至中等相关性且在最后阶段之前,基于考虑相关性的奥布赖恩·弗莱明(或波科克)错误消耗函数的临界值低于不考虑相关性的临界值。
在某些情况下,由于此决策规则产生的临界值较小,研究者可能不想提前停止试验。对于一个结局的治疗效果大而另一个结局的治疗效果小的试验,所提出的决策规则接受该治疗在两个结局上均具有优越性的检验效能较低。此外,由于未针对单个原假设控制I型错误,我们没有为每个结局提供单独的P值。
对于涉及两个主要健康结局的试验,当试验研究者和申办方建议除非两个结局的结果一致否则不考虑提前终止时,这些停止指南可能适用于数据监测委员会。