Fred Hutchinson Cancer Research Center and University of Washington School of Medicine, Seattle, WA, USA.
Haematologica. 2009 Oct;94(10):1435-9. doi: 10.3324/haematol.2009.011411.
This paper contends that commonly used clinical trial designs do not reflect clinical reality as viewed by patients or physicians. Specifically, randomized phase III designs focus on improvements that are more significant statistically than medically and put an emphasis on avoiding a false positive result that is more appropriate for diseases that are curable, in contrast to acute leukemias. The resultant large sample sizes needed for each treatment restrict the trial to one or two new treatments, although historical reality suggests the difficulty in knowing, without clinical data, whether these are the best of several new treatments. The p value-based statistics discourage use of data from previous patients in the trial to inform treatment of subsequent patients, contravening patients' assumptions. Standard phase II trials focus on a single outcome, ignoring the complexity of medical practice, and ignore prognostic heterogeneity. Finally, although patients are more interested in whether a new treatment is better than another, rather than whether it is active, randomization between different treatments does not begin until phase II trials have been completed. This paper proposes alternatives based on the Bayesian statistical approach. The thesis that I will develop here is that commonly used clinical trial designs are unrealistic in the sense that they do not correspond well to patients' views of medical practice and greatly over-simplify such practice. By emphasizing Bayesian rather than p value-based statistics and focusing on acute myeloid leukemia, I hope to familiarize physicians with some of the many new published designs that address these problems.
本文认为,常用的临床试验设计不能反映患者或医生眼中的临床实际情况。具体来说,随机 III 期设计侧重于统计学上更显著但医学上意义不大的改善,强调避免假阳性结果,这更适用于可治愈的疾病,与急性白血病形成对比。每种治疗方法都需要大量的样本量,这限制了试验只能涉及一到两种新的治疗方法,尽管从历史现实来看,在没有临床数据的情况下,很难确定这些新治疗方法是否是几种新治疗方法中的最佳方法。基于 p 值的统计数据不鼓励在试验中使用先前患者的数据来告知后续患者的治疗,这与患者的假设相悖。标准的 II 期试验仅关注单一结果,忽略了医疗实践的复杂性,也忽略了预后异质性。最后,尽管患者更关心新治疗方法是否优于其他方法,而不是其是否有效,但直到 II 期试验完成后,才开始对不同治疗方法进行随机分组。本文提出了基于贝叶斯统计方法的替代方案。我将在这里发展的论点是,常用的临床试验设计在不对应患者对医疗实践的看法且大大简化这种实践的意义上是不现实的。通过强调贝叶斯统计而不是基于 p 值的统计,并专注于急性髓系白血病,我希望让医生熟悉解决这些问题的许多新发布的设计。