• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一项模拟研究比较了九种用于评估时间事件结局的随机对照试验中处理效应的检验效能。

A simulation study comparing the power of nine tests of the treatment effect in randomized controlled trials with a time-to-event outcome.

机构信息

MRC Clinical Trials Unit at UCL, Institute of Clinical Trials and Methodology, 90 High Holborn, London, WC1V 6LJ, UK.

出版信息

Trials. 2020 Apr 6;21(1):315. doi: 10.1186/s13063-020-4153-2.

DOI:10.1186/s13063-020-4153-2
PMID:32252820
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7132898/
Abstract

BACKGROUND

The logrank test is routinely applied to design and analyse randomized controlled trials (RCTs) with time-to-event outcomes. Sample size and power calculations assume the treatment effect follows proportional hazards (PH). If the PH assumption is false, power is reduced and interpretation of the hazard ratio (HR) as the estimated treatment effect is compromised. Using statistical simulation, we investigated the type 1 error and power of the logrank (LR)test and eight alternatives. We aimed to identify test(s) that improve power with three types of non-proportional hazards (non-PH): early, late or near-PH treatment effects.

METHODS

We investigated weighted logrank tests (early, LRE; late, LRL), the supremum logrank test (SupLR) and composite tests (joint, J; combined, C; weighted combined, WC; versatile and modified versatile weighted logrank, VWLR, VWLR2) with two or more components. Weighted logrank tests are intended to be sensitive to particular non-PH patterns. Composite tests attempt to improve power across a wider range of non-PH patterns. Using extensive simulations based on real trials, we studied test size and power under PH and under simple departures from PH comprising pointwise constant HRs with a single change point at various follow-up times. We systematically investigated the influence of high or low control-arm event rates on power.

RESULTS

With no preconceived type of treatment effect, the preferred test is VWLR2. Expecting an early effect, tests with acceptable power are SupLR, C, VWLR2, J, LRE and WC. Expecting a late effect, acceptable tests are LRL, VWLR, VWLR2, WC and J. Under near-PH, acceptable tests are LR, LRE, VWLR, C, VWLR2 and SupLR. Type 1 error was well controlled for all tests, showing only minor deviations from the nominal 5%. The location of the HR change point relative to the cumulative proportion of control-arm events considerably affected power.

CONCLUSIONS

Assuming ignorance of the likely treatment effect, the best choice is VWLR2. Several non-standard tests performed well when the correct type of treatment effect was assumed. A low control-arm event rate reduced the power of weighted logrank tests targeting early effects. Test size was generally well controlled. Further investigation of test characteristics with different types of non-proportional hazards of the treatment effect is warranted.

摘要

背景

对数秩检验常用于设计和分析具有时间事件结局的随机对照试验(RCT)。样本量和功效计算假设治疗效果遵循比例风险(PH)。如果 PH 假设不成立,则功效降低,并且危害比(HR)作为估计治疗效果的解释受到影响。本研究通过统计模拟,比较了对数秩(LR)检验和八种替代检验在处理三种非比例风险(非 PH)时的Ⅰ类错误和功效,旨在寻找提高具有早期、晚期或接近 PH 治疗效果的非 PH 类型功效的检验。

方法

我们研究了加权对数秩检验(早期,LRE;晚期,LRL)、最大对数秩检验(SupLR)和复合检验(联合,J;联合,C;加权联合,WC;通用和改良通用加权对数秩,VWLR,VWLR2),它们都有两个或多个组成部分。加权对数秩检验旨在对特定的非 PH 模式敏感。复合检验试图在更广泛的非 PH 模式范围内提高功效。我们基于真实试验进行了广泛的模拟研究,研究了 PH 下和简单的 PH 偏离下(包括在不同随访时间点具有单一变化点的恒 HR)的检验效能和功效。我们系统地研究了高或低对照组事件率对功效的影响。

结果

在没有先验治疗效果类型的情况下,首选检验是 VWLR2。如果预期有早期效果,可接受功效的检验是 SupLR、C、VWLR2、J、LRE 和 WC。如果预期有晚期效果,可接受的检验是 LRL、VWLR、VWLR2、WC 和 J。在接近 PH 情况下,可接受的检验是 LR、LRE、VWLR、C、VWLR2 和 SupLR。所有检验的Ⅰ类错误均得到很好的控制,仅显示出与名义 5%的轻微偏差。HR 变化点相对于对照组事件累积比例的位置极大地影响了功效。

结论

在不知道可能的治疗效果的情况下,最好的选择是 VWLR2。当假设正确的治疗效果类型时,几种非标准检验表现良好。低对照组事件率降低了针对早期效果的加权对数秩检验的功效。检验效能通常得到很好的控制。需要进一步研究不同类型的治疗效果非比例风险的检验特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/a33ebaf1bf41/13063_2020_4153_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/cf4b4dddb8f8/13063_2020_4153_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/f48256fa0d0d/13063_2020_4153_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/8bc87bf38acd/13063_2020_4153_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/9da5777660d6/13063_2020_4153_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/096cfc7c5880/13063_2020_4153_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/4b92505af763/13063_2020_4153_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/a5d50843803a/13063_2020_4153_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/45801a30ff8d/13063_2020_4153_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/9b755db205b4/13063_2020_4153_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/6000c6d2fed3/13063_2020_4153_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/107080a5d788/13063_2020_4153_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/279d3eefdd9a/13063_2020_4153_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/a33ebaf1bf41/13063_2020_4153_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/cf4b4dddb8f8/13063_2020_4153_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/f48256fa0d0d/13063_2020_4153_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/8bc87bf38acd/13063_2020_4153_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/9da5777660d6/13063_2020_4153_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/096cfc7c5880/13063_2020_4153_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/4b92505af763/13063_2020_4153_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/a5d50843803a/13063_2020_4153_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/45801a30ff8d/13063_2020_4153_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/9b755db205b4/13063_2020_4153_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/6000c6d2fed3/13063_2020_4153_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/107080a5d788/13063_2020_4153_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/279d3eefdd9a/13063_2020_4153_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0d1/7132898/a33ebaf1bf41/13063_2020_4153_Fig13_HTML.jpg

相似文献

1
A simulation study comparing the power of nine tests of the treatment effect in randomized controlled trials with a time-to-event outcome.一项模拟研究比较了九种用于评估时间事件结局的随机对照试验中处理效应的检验效能。
Trials. 2020 Apr 6;21(1):315. doi: 10.1186/s13063-020-4153-2.
2
Comparison of survival distributions in clinical trials: A practical guidance.临床试验中生存分布的比较:实用指南。
Clin Trials. 2020 Oct;17(5):507-521. doi: 10.1177/1740774520928614. Epub 2020 Jun 27.
3
Augmenting the logrank test in the design of clinical trials in which non-proportional hazards of the treatment effect may be anticipated.在可能预期治疗效果存在非比例风险的临床试验设计中增强对数秩检验。
BMC Med Res Methodol. 2016 Feb 11;16:16. doi: 10.1186/s12874-016-0110-x.
4
Combined test versus logrank/Cox test in 50 randomised trials.50项随机试验中联合检验与对数秩检验/考克斯检验的比较
Trials. 2019 Mar 18;20(1):172. doi: 10.1186/s13063-019-3251-5.
5
Impact of a non-constant baseline hazard on detection of time-dependent treatment effects: a simulation study.非恒定基线风险对时间依赖性治疗效果检测的影响:一项模拟研究。
BMC Med Res Methodol. 2021 Aug 28;21(1):177. doi: 10.1186/s12874-021-01372-0.
6
An approach to trial design and analysis in the era of non-proportional hazards of the treatment effect.治疗效果非比例风险时代的试验设计与分析方法。
Trials. 2014 Aug 7;15:314. doi: 10.1186/1745-6215-15-314.
7
Using the geometric average hazard ratio in sample size calculation for time-to-event data with composite endpoints.使用复合终点的生存数据在样本量计算中使用几何平均风险比。
BMC Med Res Methodol. 2021 May 6;21(1):99. doi: 10.1186/s12874-021-01286-x.
8
Are non-constant rates and non-proportional treatment effects accounted for in the design and analysis of randomised controlled trials? A review of current practice.随机对照试验的设计和分析中是否考虑了非恒定率和非比例治疗效果?对当前实践的回顾。
BMC Med Res Methodol. 2019 May 16;19(1):103. doi: 10.1186/s12874-019-0749-1.
9
The Average Hazard Ratio - A Good Effect Measure for Time-to-event Endpoints when the Proportional Hazard Assumption is Violated?平均风险比——当比例风险假设不成立时,用于事件发生时间终点的良好效应量度?
Methods Inf Med. 2018 May;57(3):89-100. doi: 10.3414/ME17-01-0058. Epub 2018 May 2.
10
A Comparison of Statistical Methods for Time-To-Event Analyses in Randomized Controlled Trials Under Non-Proportional Hazards.非比例风险下随机对照试验中事件发生时间分析的统计方法比较
Stat Med. 2025 Feb 28;44(5):e70019. doi: 10.1002/sim.70019.

引用本文的文献

1
Sample Size Reestimation in Stochastic Curtailment Tests With Time-to-Events Outcome in the Case of Nonproportional Hazards Utilizing Two Weibull Distributions With Unknown Shape Parameters.在非比例风险情况下,利用两个形状参数未知的威布尔分布,对具有事件发生时间结局的随机截尾试验进行样本量重新估计。
Pharm Stat. 2025 Jan-Feb;24(1):e2429. doi: 10.1002/pst.2429. Epub 2024 Aug 18.
2
Methods for non-proportional hazards in clinical trials: A systematic review.临床试验中非比例风险方法:系统评价。
Stat Methods Med Res. 2024 Jun;33(6):1069-1092. doi: 10.1177/09622802241242325. Epub 2024 Apr 9.
3
Simultaneous inference procedures for the comparison of multiple characteristics of two survival functions.

本文引用的文献

1
Methods for Accommodating Nonproportional Hazards in Clinical Trials: Ready for the Primary Analysis?临床试验中处理非比例风险的方法:准备好进行初步分析了吗?
J Clin Oncol. 2019 Dec 10;37(35):3455-3459. doi: 10.1200/JCO.19.01681. Epub 2019 Oct 24.
2
Combined test versus logrank/Cox test in 50 randomised trials.50项随机试验中联合检验与对数秩检验/考克斯检验的比较
Trials. 2019 Mar 18;20(1):172. doi: 10.1186/s13063-019-3251-5.
3
Nivolumab for Recurrent Squamous-Cell Carcinoma of the Head and Neck.纳武利尤单抗用于复发性头颈部鳞状细胞癌
同时比较两种生存函数多个特征的推断程序。
Stat Methods Med Res. 2024 Apr;33(4):589-610. doi: 10.1177/09622802241231497. Epub 2024 Mar 11.
4
Statistical considerations in long-term efficacy evaluation of anti-cancer therapies.抗癌疗法长期疗效评估中的统计学考量
Front Pharmacol. 2023 Oct 3;14:1265953. doi: 10.3389/fphar.2023.1265953. eCollection 2023.
5
Empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to EGFR + lung adenocarcinoma.考虑不确定性和变异性的时间事件数学模型验证的经验方法:在 EGFR+肺腺癌中的应用。
BMC Bioinformatics. 2023 Sep 4;24(1):331. doi: 10.1186/s12859-023-05430-w.
6
Futility monitoring for randomized clinical trials with non-proportional hazards: An optimal conditional power approach.随机临床试验中具有非比例风险的无效监测:最优条件功效方法。
Clin Trials. 2023 Dec;20(6):603-612. doi: 10.1177/17407745231181908. Epub 2023 Jun 27.
7
Exposure to phenytoin associates with a lower risk of post-COVID cognitive deficits: a cohort study.一项队列研究表明,接触苯妥英钠与新冠后认知缺陷风险较低相关。
Brain Commun. 2022 Aug 16;4(4):fcac206. doi: 10.1093/braincomms/fcac206. eCollection 2022.
8
Which test for crossing survival curves? A user's guideline.哪种检验方法适用于比较生存曲线?使用指南。
BMC Med Res Methodol. 2022 Jan 30;22(1):34. doi: 10.1186/s12874-022-01520-0.
9
Impact of a non-constant baseline hazard on detection of time-dependent treatment effects: a simulation study.非恒定基线风险对时间依赖性治疗效果检测的影响:一项模拟研究。
BMC Med Res Methodol. 2021 Aug 28;21(1):177. doi: 10.1186/s12874-021-01372-0.
10
Assessing the Impact of COVID-19 on the Clinical Trial Objective and Analysis of Oncology Clinical Trials-Application of the Estimand Framework.评估2019冠状病毒病对临床试验目标的影响及肿瘤学临床试验分析——估计量框架的应用
Stat Biopharm Res. 2020 Jul 14;12(4):427-437. doi: 10.1080/19466315.2020.1785543.
N Engl J Med. 2016 Nov 10;375(19):1856-1867. doi: 10.1056/NEJMoa1602252. Epub 2016 Oct 8.
4
Comparison of Treatment Effects Measured by the Hazard Ratio and by the Ratio of Restricted Mean Survival Times in Oncology Randomized Controlled Trials.肿瘤学随机对照试验中风险比和受限平均生存时间比测量的治疗效果比较。
J Clin Oncol. 2016 May 20;34(15):1813-9. doi: 10.1200/JCO.2015.64.2488. Epub 2016 Feb 16.
5
Augmenting the logrank test in the design of clinical trials in which non-proportional hazards of the treatment effect may be anticipated.在可能预期治疗效果存在非比例风险的临床试验设计中增强对数秩检验。
BMC Med Res Methodol. 2016 Feb 11;16:16. doi: 10.1186/s12874-016-0110-x.
6
Ovarian cancer screening and mortality in the UK Collaborative Trial of Ovarian Cancer Screening (UKCTOCS): a randomised controlled trial.英国卵巢癌筛查协作试验(UKCTOCS)中的卵巢癌筛查与死亡率:一项随机对照试验
Lancet. 2016 Mar 5;387(10022):945-956. doi: 10.1016/S0140-6736(15)01224-6. Epub 2015 Dec 17.
7
An approach to trial design and analysis in the era of non-proportional hazards of the treatment effect.治疗效果非比例风险时代的试验设计与分析方法。
Trials. 2014 Aug 7;15:314. doi: 10.1186/1745-6215-15-314.
8
Saxagliptin and cardiovascular outcomes in patients with type 2 diabetes mellitus.沙格列汀与 2 型糖尿病患者的心血管结局。
N Engl J Med. 2013 Oct 3;369(14):1317-26. doi: 10.1056/NEJMoa1307684. Epub 2013 Sep 2.
9
Penicillin to prevent recurrent leg cellulitis.青霉素预防腿部蜂窝织炎复发。
N Engl J Med. 2013 May 2;368(18):1695-703. doi: 10.1056/NEJMoa1206300.
10
Flexible parametric proportional-hazards and proportional-odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects.用于删失生存数据的灵活参数比例风险模型和比例优势模型及其在预后建模和治疗效果估计中的应用
Stat Med. 2002 Aug 15;21(15):2175-97. doi: 10.1002/sim.1203.