Department of Biostatistics, Institut Gustave-Roussy, Villejuif, France.
Clin Trials. 2012 Jun;9(3):283-92. doi: 10.1177/1740774512443430. Epub 2012 May 8.
Traditional clinical trial designs strive to definitively establish the superiority of an experimental treatment, which results in risk-adverse criteria and large sample sizes. Increasingly, common cancers are recognized as consisting of small subsets with specific aberrations for targeted therapy, making large trials infeasible.
To compare the performance of different trial design strategies over a long-term research horizon.
We simulated a series of two-treatment superiority trials over 15 years using different design parameters. Trial parameters examined included the number of positive trials to establish superiority (one-trial vs. two-trial rule), α level (2.5%-50%), and the number of trials in the 15-year period, K (thus, trial sample size). The design parameters were evaluated for different disease scenarios, accrual rates, and distributions of treatment effect. Metrics used included the overall survival gain at a 15-year horizon measured by the hazard ratio (HR), year 15 versus year 0. We also computed the expected total survival benefit and the risk of selecting as new standard of care at year 15 a treatment inferior to the initial control treatment, P(detrimental effect).
Expected survival benefits over the 15-year horizon were maximized when more (smaller) trials were conducted than recommended under traditional criteria, using the criterion of one positive trial (vs. two), and relaxing the α value from 2.5% to 20%. Reducing the sample size and relaxing the α value also increased the likelihood of selecting an inferior treatment at the end. The impact of α and K on the survival benefit depended on the specific disease scenario and accrual rate: greater gains for relaxing α in diseases with good outcome and/or low accrual rates and greater gains for increasing K for diseases with poor outcomes. Trials with smaller sample size did not perform well when using stringent (standard) level of evidence. For each disease scenario and accrual rate studied, the optimal design, defined as the design that the maximized expected total survival benefit while constraining P(detrimental effect) < 2.5%, specified α = 20% or 10%, and a sample size considerably smaller than that recommended by the traditional designs. The results were consistent under different assumed distributions for treatment effect.
The simulations assumed no toxicity issues and did not consider interim analyses.
It is worthwhile to consider a design paradigm that seeks to maximize the expected survival benefit across a series of trials, over a longer research horizon. In today's environment of constrained, biomarker-selected populations, our results indicate that smaller sample sizes and larger α values lead to greater long-term survival gains compared to traditional large trials designed to meet stringent criteria with a low efficacy bar.
传统的临床试验设计旨在明确证实实验治疗的优越性,这导致了风险规避标准和大样本量。越来越多的常见癌症被认为由具有特定靶向治疗异常的小亚群组成,使得大型试验变得不可行。
比较不同试验设计策略在长期研究中的表现。
我们使用不同的设计参数模拟了一系列为期 15 年的两治疗优势试验。试验参数包括:建立优越性的阳性试验数量(单试验与双试验规则)、α 水平(2.5%-50%)和 15 年期间的试验数量 K(因此,试验样本量)。评估了不同疾病情况、入组率和治疗效果分布下的设计参数。使用的指标包括 15 年时通过风险比(HR)衡量的总体生存获益,即第 15 年与第 0 年的比较。我们还计算了预期的总生存获益和在第 15 年选择作为新治疗标准的治疗效果劣于初始对照治疗的风险,即 P(有害效应)。
当使用传统标准的单阳性试验(而非双阳性试验)和放宽α值从 2.5%至 20%进行比建议更多(更小)的试验时,15 年的预期生存获益最大化,并且降低样本量和放宽α值也增加了在研究结束时选择劣效治疗的可能性。α 和 K 对生存获益的影响取决于特定的疾病情况和入组率:在预后较好和/或入组率较低的疾病中放宽α的获益更大,在预后较差的疾病中增加 K 的获益更大。当使用严格(标准)水平的证据时,较小样本量的试验表现不佳。对于研究的每种疾病情况和入组率,最优设计定义为在限制 P(有害效应)<2.5%的同时最大化预期总生存获益的设计,规定α值为 20%或 10%,以及比传统设计建议的样本量小得多。在不同假设的治疗效果分布下,结果是一致的。
模拟不考虑毒性问题,也不考虑中期分析。
考虑一种设计范式是值得的,该范式旨在在较长的研究期间内通过一系列试验最大化预期的生存获益。在当今受限制、基于生物标志物选择人群的环境下,与旨在满足严格标准且疗效低的大型试验相比,我们的结果表明,较小的样本量和较大的α值可带来更大的长期生存获益。