Hendrickson Rebecca C, Thomas Ronald G, Schork Nicholas J, Raskind Murray A
VISN 20 Northwest Network Mental Illness Research, Education and Clinical Center (MIRECC), VA Puget Sound Health Care System, Seattle, WA, United States.
Department of Psychiatry and Behavioral Sciences, University of Washington School of Medicine, Seattle, WA, United States.
Front Digit Health. 2020 Aug 28;2:13. doi: 10.3389/fdgth.2020.00013. eCollection 2020.
Parallel-group randomized controlled trials (PG-RCTs) are the gold standard for detecting differences in mean improvement across treatment conditions. However, PG-RCTs provide limited information about individuals, making them poorly optimized for quantifying the relationship of a biomarker measured at baseline with treatment response. In N-of-1 trials, an individual subject moves between treatment conditions to determine their specific response to each treatment. Aggregated N-of-1 trials analyze a cohort of such participants, and can be designed to optimize both statistical power and clinical or logistical constraints, such as allowing all participants to begin with an open-label stabilization phase to facilitate the enrollment of more acutely symptomatic participants. Here, we describe a set of statistical simulation studies comparing the power of four different trial designs to detect a relationship between a predictive biomarker measured at baseline and subjects' specific response to the PTSD pharmacotherapeutic agent prazosin. Data was simulated from 4 trial designs: (1) open-label; (2) open-label + blinded discontinuation; (3) traditional crossover; and (4) open label + blinded discontinuation + brief crossover (the N-of-1 design). Designs were matched in length and assessments. The primary outcome, analyzed with a linear mixed effects model, was whether a statistically significant association between biomarker value and response to prazosin was detected with 5% Type I error. Simulations were repeated 1,000 times to determine power and bias, with varied parameters. Trial designs 2 & 4 had substantially higher power with fewer subjects than open label design. Trial design 4 also had higher power than trial design 2. Trial design 4 had slightly lower power than the traditional crossover design, although power declined much more rapidly as carryover was introduced. These results suggest that an aggregated N-of-1 trial design beginning with an open label titration phase may provide superior power over open label or open label and blinded discontinuation designs, and similar power to a traditional crossover design, in detecting an association between a predictive biomarker and the clinical response to the PTSD pharmacotherapeutic prazosin. This is achieved while allowing all participants to spend the first 8 weeks of the trial on open-label active treatment.
平行组随机对照试验(PG-RCT)是检测不同治疗条件下平均改善差异的金标准。然而,PG-RCT提供的个体信息有限,使其在量化基线测量的生物标志物与治疗反应之间的关系方面优化不足。在单病例试验(N-of-1试验)中,个体受试者在不同治疗条件之间转换,以确定其对每种治疗的具体反应。汇总的单病例试验分析一组此类参与者,并且可以设计为优化统计效力以及临床或后勤方面的限制,例如允许所有参与者从开放标签稳定期开始,以促进更多急性症状参与者的入组。在此,我们描述了一组统计模拟研究,比较了四种不同试验设计检测基线测量的预测性生物标志物与受试者对创伤后应激障碍(PTSD)药物治疗剂哌唑嗪的具体反应之间关系的效力。数据是从4种试验设计模拟而来:(1)开放标签;(2)开放标签+盲法停药;(3)传统交叉;以及(4)开放标签+盲法停药+简短交叉(单病例试验设计)。各设计在长度和评估方面进行了匹配。使用线性混合效应模型分析的主要结果是,在I型错误率为5%的情况下,是否检测到生物标志物值与对哌唑嗪反应之间存在统计学显著关联。模拟重复进行1000次以确定效力和偏差,参数各不相同。试验设计2和4在受试者数量较少的情况下具有比开放标签设计更高的效力。试验设计4的效力也高于试验设计2。试验设计4的效力略低于传统交叉设计,尽管随着引入残留效应,效力下降得更快。这些结果表明,从开放标签滴定阶段开始的汇总单病例试验设计,在检测预测性生物标志物与PTSD药物治疗哌唑嗪的临床反应之间的关联时,可能比开放标签或开放标签加盲法停药设计具有更高的效力,并且与传统交叉设计具有相似的效力。这是在允许所有参与者在试验的前8周接受开放标签活性治疗的情况下实现的。