Sorbonne Universités, UPMC Univ Paris 06, INSERM, Institut Pierre Louis d'épidémiologie et de Santé Publique (IPLESP UMRS 1136), F75013, Paris, France.
Sorbonne Universités, UPMC Univ Paris 06, INSERM, Institut Pierre Louis d'épidémiologie et de Santé Publique (IPLESP UMRS 1136), F75013, Paris, France.
J Clin Epidemiol. 2018 Apr;96:63-72. doi: 10.1016/j.jclinepi.2017.12.010. Epub 2017 Dec 28.
The aim of this study was to investigate the use of three treatment-effect measures in the analysis of randomized trials using a time-to-event endpoint and assess their influence on the results.
A recent equivalence trial showed discordant results with the use of different measures. Different hypotheses may explain such discordant results including a mistaken hypothesized distribution of time to failure and an overestimation of failure rates in the protocol. In a simulation study, we investigate different situations in comparing analyses based hazard ratio (HR), risk difference estimated by Kaplan-Meier curves, and difference in restricted mean survival time. We also compared these three analyses on genuine data from a recent equivalence trial.
In the equivalence trial, two analyses would have concluded equivalence, whereas the original analysis based on HR estimate did not declare equivalence. Results of our simulation study indicate little to moderate differences between the three analyses when the true distribution of time to failure is different to the hypothesized distribution. The main discordant results are found when failure rates have been overestimated or underestimated regardless of the distribution.
With the distributions investigated, differences between analyses based on different measures are much more driven by mistaken hypothesized failure rates than by the shape of the distribution of time to failure.
本研究旨在探讨使用三种治疗效果测量方法分析以时间为结局的随机试验,并评估其对结果的影响。
最近的一项等效性试验使用不同的测量方法得出了不一致的结果。这些不一致的结果可能有不同的解释,包括对失败时间分布的假设错误和方案中失败率的高估。在一项模拟研究中,我们研究了基于风险比(HR)、通过 Kaplan-Meier 曲线估计的风险差异和受限平均生存时间差异进行比较分析的不同情况。我们还将这三种分析方法应用于最近一项等效性试验的真实数据中。
在等效性试验中,两种分析方法将得出等效性的结论,而基于 HR 估计的原始分析则没有宣布等效性。模拟研究的结果表明,当真实的失败时间分布与假设的分布不同时,三种分析之间的差异较小或中等。当失败率被高估或低估时,无论分布如何,都会出现主要的不一致结果。
在所研究的分布中,基于不同测量方法的分析之间的差异更多地是由假设的失败率错误驱动的,而不是由失败时间分布的形状驱动的。