Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, Vienna, Austria.
Statistical Innovation, Data Science & Artificial Intelligence, AstraZeneca, Gothenburg, Sweden.
BMC Med Res Methodol. 2022 Aug 15;22(1):228. doi: 10.1186/s12874-022-01683-w.
Platform trials can evaluate the efficacy of several experimental treatments compared to a control. The number of experimental treatments is not fixed, as arms may be added or removed as the trial progresses. Platform trials are more efficient than independent parallel group trials because of using shared control groups. However, for a treatment entering the trial at a later time point, the control group is divided into concurrent controls, consisting of patients randomised to control when that treatment arm is in the platform, and non-concurrent controls, patients randomised before. Using non-concurrent controls in addition to concurrent controls can improve the trial's efficiency by increasing power and reducing the required sample size, but can introduce bias due to time trends.
We focus on a platform trial with two treatment arms and a common control arm. Assuming that the second treatment arm is added at a later time, we assess the robustness of recently proposed model-based approaches to adjust for time trends when utilizing non-concurrent controls. In particular, we consider approaches where time trends are modeled either as linear in time or as a step function, with steps at time points where treatments enter or leave the platform trial. For trials with continuous or binary outcomes, we investigate the type 1 error rate and power of testing the efficacy of the newly added arm, as well as the bias and root mean squared error of treatment effect estimates under a range of scenarios. In addition to scenarios where time trends are equal across arms, we investigate settings with different time trends or time trends that are not additive in the scale of the model.
A step function model, fitted on data from all treatment arms, gives increased power while controlling the type 1 error, as long as the time trends are equal for the different arms and additive on the model scale. This holds even if the shape of the time trend deviates from a step function when patients are allocated to arms by block randomisation. However, if time trends differ between arms or are not additive to treatment effects in the scale of the model, the type 1 error rate may be inflated.
The efficiency gained by using step function models to incorporate non-concurrent controls can outweigh potential risks of biases, especially in settings with small sample sizes. Such biases may arise if the model assumptions of equality and additivity of time trends are not satisfied. However, the specifics of the trial, scientific plausibility of different time trends, and robustness of results should be carefully considered.
平台试验可以评估几种实验治疗方法与对照相比的疗效。实验治疗方法的数量不是固定的,因为随着试验的进展,可能会增加或删除手臂。由于使用共享对照组,平台试验比独立的平行组试验更有效率。然而,对于后来进入试验的治疗方法,对照组分为同期对照组,由该治疗手臂在平台上时随机分配到对照组的患者组成,以及非同期对照组,由之前随机分配的患者组成。除了同期对照组之外,使用非同期对照组可以通过提高功效和减少所需的样本量来提高试验的效率,但由于时间趋势的原因,可能会引入偏差。
我们专注于一个有两个治疗手臂和一个共同对照组的平台试验。假设第二治疗手臂在稍后时间加入,我们评估了最近提出的基于模型的方法在使用非同期对照组时调整时间趋势的稳健性。特别是,我们考虑了将时间趋势建模为线性或阶跃函数的方法,其中阶跃发生在治疗方法进入或离开平台试验的时间点。对于连续或二分类结局,我们研究了测试新加入手臂疗效的检验的Ⅰ类错误率和功效,以及在一系列情况下治疗效果估计的偏差和均方根误差。除了时间趋势在各手臂间相等的情况外,我们还研究了不同时间趋势或在模型尺度上不是相加的时间趋势的情况。
只要不同手臂的时间趋势相等且在模型尺度上相加,使用在所有治疗手臂的数据上拟合的阶跃函数模型可以增加功效,同时控制Ⅰ类错误率。这在患者通过区组随机化分配到手臂时时间趋势呈阶跃函数形式的情况下也成立。然而,如果时间趋势在手臂间不同或不在模型尺度上与治疗效果相加,那么Ⅰ类错误率可能会增加。
通过使用阶跃函数模型纳入非同期对照组获得的效率可以超过潜在的偏差风险,尤其是在样本量较小的情况下。如果时间趋势的相等和相加的模型假设不成立,可能会出现这种偏差。然而,应该仔细考虑试验的具体情况、不同时间趋势的科学合理性以及结果的稳健性。