Broglio Kristine R, Berry Donald A
Department of Statistics, Texas A&M University, College Station, TX, USA.
J Natl Cancer Inst. 2009 Dec 2;101(23):1642-9. doi: 10.1093/jnci/djp369. Epub 2009 Nov 9.
Whether progression-free survival (PFS) or overall survival (OS) is the more appropriate endpoint in clinical trials of metastatic cancer is controversial. In some disease and treatment settings, an improvement in PFS does not result in an improved OS.
We partitioned OS into two parts and expressed it as the sum of PFS and survival postprogression (SPP). We simulated randomized clinical trials with two arms that had respective medians for PFS of 6 and 9 months. We assumed no treatment difference in median SPP. We found the probability of a statistically significant benefit in OS for various median SPP and observed P values for PFS. We compared the sample sizes required for PFS vs OS for various median SPP. We compare our results with the literature regarding surrogacy of PFS for OS by use of the correlation between hazard ratios for PFS and OS. All statistical tests were two-sided.
For a trial with observed P value for improvement in PFS of .001, there was a greater than 90% probability for statistical significance in OS if median SPP was 2 months but less than 20% if median SPP was 24 months. For a trial requiring 280 patients to detect a 3-month difference in PFS, 350 and 2440 patients, respectively, were required to have the same power for detecting a real difference in OS that is carried over from the 3-month benefit in PFS when the median SPP was 2 and 24 months.
Addressing SPP is important in understanding treatment effects. For clinical trials with a PFS benefit, lack of statistical significance in OS does not imply lack of improvement in OS, especially for diseases with long median SPP. Although there may be no treatment effect on SPP, its variability so dilutes the OS comparison that statistical significance is likely lost. OS is a reasonable primary endpoint when median SPP is short but is too high a bar when median SPP is long, such as longer than 12 months.
在转移性癌症的临床试验中,无进展生存期(PFS)还是总生存期(OS)作为更合适的终点存在争议。在某些疾病和治疗背景下,PFS的改善并未带来OS的改善。
我们将OS分为两部分,并将其表示为PFS与进展后生存期(SPP)之和。我们模拟了双臂随机临床试验,其中PFS的中位数分别为6个月和9个月。我们假设SPP中位数无治疗差异。我们发现了不同SPP中位数时OS具有统计学显著获益的概率,并观察了PFS的P值。我们比较了不同SPP中位数时PFS与OS所需的样本量。我们通过使用PFS和OS风险比之间的相关性,将我们的结果与关于PFS作为OS替代指标的文献进行比较。所有统计检验均为双侧检验。
对于一项PFS改善的观察P值为0.001的试验,如果SPP中位数为2个月,OS具有统计学显著性的概率大于90%,但如果SPP中位数为24个月,则概率小于20%。对于一项需要280名患者来检测PFS有3个月差异的试验,当SPP中位数为2个月和24个月时,分别需要350名和2440名患者才能具有相同的检验效能来检测从PFS的3个月获益中延续而来的OS实际差异。
考虑SPP对于理解治疗效果很重要。对于具有PFS获益的临床试验,OS缺乏统计学显著性并不意味着OS没有改善,特别是对于SPP中位数较长的疾病。尽管对SPP可能没有治疗效果,但其变异性会稀释OS比较,以至于可能失去统计学显著性。当SPP中位数较短时,OS是一个合理的主要终点,但当SPP中位数较长(如超过12个月)时,这一标准过高。