Huang Bo, Kuan Pei-Fen
Pfizer Inc, Groton, CT, USA.
Department of Applied Math and Statistics, Stony Brook University, Stony Brook, NY, USA.
Pharm Stat. 2018 May;17(3):202-213. doi: 10.1002/pst.1846. Epub 2017 Dec 28.
With the emergence of novel therapies exhibiting distinct mechanisms of action compared to traditional treatments, departure from the proportional hazard (PH) assumption in clinical trials with a time-to-event end point is increasingly common. In these situations, the hazard ratio may not be a valid statistical measurement of treatment effect, and the log-rank test may no longer be the most powerful statistical test. The restricted mean survival time (RMST) is an alternative robust and clinically interpretable summary measure that does not rely on the PH assumption. We conduct extensive simulations to evaluate the performance and operating characteristics of the RMST-based inference and against the hazard ratio-based inference, under various scenarios and design parameter setups. The log-rank test is generally a powerful test when there is evident separation favoring 1 treatment arm at most of the time points across the Kaplan-Meier survival curves, but the performance of the RMST test is similar. Under non-PH scenarios where late separation of survival curves is observed, the RMST-based test has better performance than the log-rank test when the truncation time is reasonably close to the tail of the observed curves. Furthermore, when flat survival tail (or low event rate) in the experimental arm is expected, selecting the minimum of the maximum observed event time as the truncation timepoint for the RMST is not recommended. In addition, we recommend the inclusion of analysis based on the RMST curve over the truncation time in clinical settings where there is suspicion of substantial departure from the PH assumption.
随着与传统治疗相比具有独特作用机制的新型疗法的出现,在以事件发生时间为终点的临床试验中偏离比例风险(PH)假设的情况越来越普遍。在这些情况下,风险比可能不是治疗效果的有效统计量度,对数秩检验可能不再是最有效的统计检验。受限平均生存时间(RMST)是一种替代的稳健且具有临床可解释性的汇总量度,它不依赖于PH假设。我们进行了广泛的模拟,以评估在各种场景和设计参数设置下基于RMST的推断与基于风险比的推断的性能和操作特征。当在Kaplan-Meier生存曲线的大多数时间点存在明显有利于一个治疗组的分离时,对数秩检验通常是一种有效的检验,但RMST检验的性能与之相似。在观察到生存曲线后期分离的非PH场景下,当截断时间合理地接近观察曲线的尾部时,基于RMST的检验比对数秩检验具有更好的性能。此外,当预计试验组存在平坦的生存尾部(或低事件发生率)时,不建议选择最大观察事件时间的最小值作为RMST的截断时间点。另外,在怀疑严重偏离PH假设的临床环境中,我们建议纳入基于截断时间的RMST曲线的分析。