Tian Lu, Fu Haoda, Ruberg Stephen J, Uno Hajime, Wei Lee-Jen
Department of Biomedical Data Science, Stanford University, California 94305, U.S.A.
Eli Lilly and Company, Indianapolis, Indiana 46285, U.S.A.
Biometrics. 2018 Jun;74(2):694-702. doi: 10.1111/biom.12770. Epub 2017 Sep 12.
In comparing two treatments with the event time observations, the hazard ratio (HR) estimate is routinely used to quantify the treatment difference. However, this model dependent estimate may be difficult to interpret clinically especially when the proportional hazards (PH) assumption is violated. An alternative estimation procedure for treatment efficacy based on the restricted means survival time or t-year mean survival time (t-MST) has been discussed extensively in the statistical and clinical literature. On the other hand, a statistical test via the HR or its asymptotically equivalent counterpart, the logrank test, is asymptotically distribution-free. In this article, we assess the relative efficiency of the hazard ratio and t-MST tests with respect to the statistical power under various PH and non-PH models theoretically and empirically. When the PH assumption is valid, the t-MST test performs almost as well as the HR test. For non-PH models, the t-MST test can substantially outperform its HR counterpart. On the other hand, the HR test can be powerful when the true difference of two survival functions is quite large at end but not the beginning of the study. Unfortunately, for this case, the HR estimate may not have a simple clinical interpretation for the treatment effect due to the violation of the PH assumption.
在比较两种具有事件时间观测值的治疗方法时,风险比(HR)估计值通常用于量化治疗差异。然而,这种依赖模型的估计值在临床上可能难以解释,尤其是当比例风险(PH)假设被违反时。基于受限平均生存时间或t年平均生存时间(t-MST)的治疗效果替代估计程序在统计和临床文献中已被广泛讨论。另一方面,通过HR或其渐近等效的对数秩检验进行的统计检验是渐近无分布的。在本文中,我们从理论和实证两方面评估了风险比检验和t-MST检验在各种PH和非PH模型下相对于统计功效的相对效率。当PH假设成立时,t-MST检验的表现几乎与HR检验一样好。对于非PH模型,t-MST检验的表现可能会大大优于HR检验。另一方面,当两种生存函数在研究结束时而非开始时的真实差异相当大时,HR检验可能会很有效。不幸的是,对于这种情况,由于PH假设被违反,HR估计值对于治疗效果可能没有简单的临床解释。