Sim Julius
School of Primary, Community and Social Care and Keele Clinical Trials Unit, Keele University, Staffordshire, ST5 5BG UK.
Pilot Feasibility Stud. 2019 Aug 28;5:107. doi: 10.1186/s40814-019-0493-7. eCollection 2019.
Feasibility studies and external pilot studies are used increasingly to inform planning decisions related to a definitive randomized controlled trial. These studies can provide information on process measures, such as consent rates, treatment fidelity and compliance, and methods of outcome measurement. Additionally, they can provide initial parameter estimates for a sample size calculation, such as a standard deviation or the 'success' rate for a binary outcome in the control group. However, the issue of estimating treatment effects in pilot or feasibility studies is controversial.
Between-group estimates of treatment effect from pilot studies are sometimes used to calculate the sample size for a main trial, alongside estimated standard deviations. However, whilst estimating a standard deviation is an empirical matter, a targeted treatment effect should be established in terms of clinical judgement, as a minimum important difference (MID), not through analysis of pilot data. Secondly, between-group effects measured in pilot studies are sometimes used to indicate the magnitude of an effect that might be obtained in a main trial, and a decision on progression made with reference to the associated confidence interval. Such estimates will be imprecise in typically small pilot studies and therefore do not allow a robust decision on a main trial; both a decision to proceed and a decision not to proceed may be made too readily. Thirdly, a within-group change might be estimated from a pilot or a feasibility study in a desire to assess the potential efficacy of a novel intervention prior to testing it in a main trial, but again such estimates are liable to be imprecise and do not allow sound causal inferences.
Treatment effects calculated from pilot or feasibility studies should not be the basis of a sample size calculation for a main trial, as the MID to be detected should be based primarily on clinical judgement rather than statistics. Deciding on progression to a main trial based on these treatment effects is also misguided, as they will normally be imprecise, and may be biased if the pilot or feasibility study is unrepresentative of the main trial.
可行性研究和外部预试验越来越多地用于为与确定性随机对照试验相关的规划决策提供信息。这些研究可以提供关于过程指标的信息,如同意率、治疗保真度和依从性以及结局测量方法。此外,它们可以为样本量计算提供初始参数估计,例如标准差或对照组二元结局的“成功率”。然而,在预试验或可行性研究中估计治疗效果的问题存在争议。
预试验中组间治疗效果估计值有时会与估计的标准差一起用于计算主要试验的样本量。然而,虽然估计标准差是一个实证问题,但目标治疗效果应以临床判断为依据,确定为最小重要差异(MID),而不是通过对预试验数据的分析。其次,预试验中测量的组间效果有时用于表明在主要试验中可能获得的效果大小,并参考相关置信区间做出是否推进试验的决定。在典型的小型预试验中,此类估计将不准确,因此无法就主要试验做出可靠的决定;继续试验和不继续试验的决定都可能过于草率。第三,可能会从预试验或可行性研究中估计组内变化,以便在主要试验中测试新干预措施之前评估其潜在疗效,但同样,此类估计容易不准确,并且无法得出合理的因果推断。
从预试验或可行性研究中计算出的治疗效果不应作为主要试验样本量计算的基础,因为要检测的MID应主要基于临床判断而非统计数据。基于这些治疗效果决定是否推进到主要试验也是错误的,因为它们通常不准确,如果预试验或可行性研究不能代表主要试验,可能会产生偏差。