• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

肿瘤学中以时间为事件的当代 III 期随机对照试验中统计检验的经验功效比较。

Empirical power comparison of statistical tests in contemporary phase III randomized controlled trials with time-to-event outcomes in oncology.

机构信息

Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA.

Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.

出版信息

Clin Trials. 2020 Dec;17(6):597-606. doi: 10.1177/1740774520940256. Epub 2020 Sep 15.

DOI:10.1177/1740774520940256
PMID:32933339
Abstract

BACKGROUND

More than 95% of recent cancer randomized controlled trials used the log-rank test to detect a treatment difference making it the predominant tool for comparing two survival functions. As with other tests, the log-rank test has both advantages and disadvantages. One advantage is that it offers the highest power against proportional hazards differences, which may be a major reason why alternative methods have rarely been employed in practice. The performance of statistical tests has traditionally been investigated both theoretically and numerically for several patterns of difference between two survival functions. However, to the best of our knowledge, there has been no attempt to compare the performance of various statistical tests using empirical data from past oncology randomized controlled trials. So, it is unknown whether the log-rank test offers a meaningful power advantage over alternative testing methods in contemporary cancer randomized controlled trials. Focusing on recently reported phase III cancer randomized controlled trials, we assessed whether the log-rank test gave meaningfully greater power when compared with five alternative testing methods: generalized Wilcoxon, test based on maximum of test statistics from multiple weighted log-rank tests, difference in -year event rate, and difference in restricted mean survival time with fixed and adaptive .

METHODS

Using manuscripts from cancer randomized controlled trials recently published in high-tier clinical journals, we reconstructed patient-level data for overall survival (69 trials) and progression-free survival (54 trials). For each trial endpoint, we estimated the empirical power of each test. Empirical power was measured as the proportion of trials for which a test would have identified a significant result ( value < .05).

RESULTS

For overall survival, -year event rate offered the lowest (30.4%) empirical power and restricted mean survival time with fixed offered the highest (43.5%). The empirical power of the other types of tests was almost identical (36.2%-37.7%). For progression-free survival, the tests we investigated offered numerically equivalent empirical power (55.6%-61.1%). No single test consistently outperformed any other test.

CONCLUSION

The empirical power assessment with the past cancer randomized controlled trials provided new insights on the performance of statistical tests. Although the log-rank test has been used in almost all trials, our study suggests that the log-rank test is not the only option from an empirical power perspective. Near universal use of the log-rank test is not supported by a meaningful difference in empirical power. Clinical trial investigators could consider alternative methods, beyond the log-rank test, for their primary analysis when designing a cancer randomized controlled trial. Factors other than power (e.g. interpretability of the estimated treatment effect) should garner greater consideration when selecting statistical tests for cancer randomized controlled trials.

摘要

背景

超过 95%的近期癌症随机对照试验采用对数秩检验来检测治疗差异,使其成为比较两种生存函数的主要工具。与其他检验方法一样,对数秩检验有其优缺点。一个优点是,它在针对比例风险差异时具有最高的功效,这可能是替代方法在实践中很少被采用的主要原因。统计检验的性能在理论上和数值上都针对两种生存函数之间的几种差异模式进行了研究。然而,据我们所知,还没有人试图使用过去肿瘤学随机对照试验的经验数据来比较各种统计检验的性能。因此,尚不清楚在当代癌症随机对照试验中,对数秩检验相对于替代检验方法是否具有有意义的功效优势。本研究关注最近报道的 III 期癌症随机对照试验,评估了与五种替代检验方法相比,对数秩检验是否具有更有意义的功效:广义 Wilcoxon 检验、基于多个加权对数秩检验统计量最大值的检验、-年事件发生率差异检验以及固定和自适应受限平均生存时间差异检验。

方法

使用最近在高档次临床期刊上发表的癌症随机对照试验的手稿,我们为总生存(69 项试验)和无进展生存(54 项试验)重建了患者水平数据。对于每个试验终点,我们估计了每种检验的经验功效。经验功效以试验中能够识别出显著结果(值<0.05)的比例来衡量。

结果

对于总生存,-年事件发生率提供的功效最低(30.4%),而固定受限平均生存时间提供的功效最高(43.5%)。其他类型检验的功效几乎相同(36.2%-37.7%)。对于无进展生存,我们研究的检验提供了数值等效的功效(55.6%-61.1%)。没有一种检验始终优于任何其他检验。

结论

使用过去的癌症随机对照试验进行的经验功效评估为统计检验的性能提供了新的见解。尽管对数秩检验几乎在所有试验中都得到了应用,但我们的研究表明,从经验功效的角度来看,对数秩检验并不是唯一的选择。从经验功效的角度来看,对数秩检验的广泛应用并没有带来有意义的差异。癌症随机对照试验的设计者在进行试验时,可以考虑除对数秩检验之外的替代方法作为主要分析方法。在选择癌症随机对照试验的统计检验方法时,应更多地考虑功效以外的因素(例如,治疗效果估计的可解释性)。

相似文献

1
Empirical power comparison of statistical tests in contemporary phase III randomized controlled trials with time-to-event outcomes in oncology.肿瘤学中以时间为事件的当代 III 期随机对照试验中统计检验的经验功效比较。
Clin Trials. 2020 Dec;17(6):597-606. doi: 10.1177/1740774520940256. Epub 2020 Sep 15.
2
Designing clinical trials with (restricted) mean survival time endpoint: Practical considerations.设计(受限)平均生存时间终点的临床试验:实用考虑。
Clin Trials. 2020 Jun;17(3):285-294. doi: 10.1177/1740774520905563. Epub 2020 Feb 17.
3
Log-Rank Test vs MaxCombo and Difference in Restricted Mean Survival Time Tests for Comparing Survival Under Nonproportional Hazards in Immuno-oncology Trials: A Systematic Review and Meta-analysis.对数秩检验与最大连续检验和受限平均生存时间检验在免疫肿瘤学试验中非比例风险下生存比较的比较:系统评价和荟萃分析。
JAMA Oncol. 2022 Sep 1;8(9):1294-1300. doi: 10.1001/jamaoncol.2022.2666.
4
Power and sample size for randomized phase III survival trials under the Weibull model.威布尔模型下随机III期生存试验的检验效能与样本量
J Biopharm Stat. 2015;25(1):16-28. doi: 10.1080/10543406.2014.919940.
5
Comparison between asymptotic and re-randomisation tests under non-proportional hazards in a randomised controlled trial using the minimisation method.最小化法在非比例风险随机对照试验中渐近检验和再随机检验的比较。
BMC Med Res Methodol. 2024 Jul 30;24(1):166. doi: 10.1186/s12874-024-02295-2.
6
Proportional Hazards Violations in Phase III Cancer Clinical Trials: A Potential Source of Trial Misinterpretation.三期癌症临床试验中比例风险违反:潜在的试验误解来源。
Clin Cancer Res. 2024 Oct 15;30(20):4791-4799. doi: 10.1158/1078-0432.CCR-24-0566.
7
Robust group sequential designs for trials with survival endpoints and delayed response.用于具有生存终点和延迟反应试验的稳健组序贯设计。
Biom J. 2022 Feb;64(2):343-360. doi: 10.1002/bimj.202000169. Epub 2021 Dec 21.
8
Comparison of the restricted mean survival time with the hazard ratio in superiority trials with a time-to-event end point.在具有事件发生时间终点的优效性试验中,受限平均生存时间与风险比的比较。
Pharm Stat. 2018 May;17(3):202-213. doi: 10.1002/pst.1846. Epub 2017 Dec 28.
9
Designing therapeutic cancer vaccine trials with delayed treatment effect.设计具有延迟治疗效果的治疗性癌症疫苗试验。
Stat Med. 2017 Feb 20;36(4):592-605. doi: 10.1002/sim.7157. Epub 2016 Nov 2.
10
Comparison of Time-to-First Event and Recurrent-Event Methods in Randomized Clinical Trials.随机临床试验中首次事件时间和复发性事件方法的比较。
Circulation. 2018 Aug 7;138(6):570-577. doi: 10.1161/CIRCULATIONAHA.117.033065.

引用本文的文献

1
A Biomarker Signature-Guided Clinical Trial Design for Precision Medicine.一种用于精准医学的生物标志物特征引导的临床试验设计。
Stat Med. 2025 May;44(10-12):e70103. doi: 10.1002/sim.70103.
2
The Impact of Urate-Lowering Therapy in Post-Myocardial Infarction Patients: Insights From a Population-Based, Propensity Score-Matched Analysis.降尿酸治疗对心肌梗死后患者的影响:基于人群的倾向评分匹配分析的见解。
Clin Pharmacol Ther. 2022 Mar;111(3):655-663. doi: 10.1002/cpt.2473. Epub 2021 Nov 17.