Department of Radiology, University of Washington, Seattle, 98109-1023, USA.
Acad Radiol. 2012 Feb;19(2):184-90. doi: 10.1016/j.acra.2011.10.008. Epub 2011 Nov 21.
Positron emission tomography (PET) is used to evaluate response to therapy with increasing interest in having PET provide endpoints for clinical trials. Here we demonstrate impacts of PET measurement error and choice of quantification method on clinical trial design.
Sample size was calculated for two-arm randomized trials with percent change in (18)F-fluorodeoxyglucose (FDG) PET uptake as an efficacy endpoint. Two methods of uptake quantification were considered: standardized uptake values (SUVs) and kinetic measures from dynamic imaging. Calculations assumed a 20 percentage point difference in treatment groups' average percent change, and yielded 80% power at α = 0.05. The range of precision (10%-40%) in PET uptake measures was based on review of the literature. The range of SUV sensitivities (50%-100%) relative to kinetic analyses was based on a study of 75 locally advanced breast cancer patients.
Sample sizes increased from 8 to 126 as PET precision worsened from 10% to 40% at full measurement sensitivity to true change. In a subgroup with low initial FDG uptake, a sample size of 126 was required under 20% standard deviation using clinical SUVs. More sophisticated imaging quantification could reduce this sample size to 32.
The dependence of sample size on measurement precision and the sensitivity of imaging measures to true change should be considered in single site and multicenter PET trials to avoid underpowered studies with inconclusive results. Sophisticated PET imaging methods that are more sensitive to changes in uptake may be advantageous in early studies with limited patient numbers.
正电子发射断层扫描(PET)用于评估治疗反应,越来越多的人对将 PET 作为临床试验的终点感兴趣。在这里,我们展示了 PET 测量误差和量化方法选择对临床试验设计的影响。
对于以(18)F-氟脱氧葡萄糖(FDG)PET 摄取百分比变化为疗效终点的双臂随机试验,计算了样本量。考虑了两种摄取量化方法:标准化摄取值(SUVs)和动态成像的动力学测量。计算假设治疗组平均百分比变化的差异为 20 个百分点,α=0.05 时,效力为 80%。PET 摄取测量精度(10%-40%)的范围基于文献综述。SUV 相对于动力学分析的灵敏度范围(50%-100%)基于 75 名局部晚期乳腺癌患者的研究。
随着 PET 精度从 10%恶化至 40%,即从完全测量灵敏度到真实变化,样本量从 8 增加到 126。在初始 FDG 摄取较低的亚组中,使用临床 SUV,在 20%标准差下,需要 126 个样本量。更复杂的成像量化可以将样本量减少到 32。
在单站点和多中心 PET 试验中,应考虑样本量对测量精度的依赖性以及成像测量对真实变化的敏感性,以避免出现样本量不足且结果不确定的研究。在患者数量有限的早期研究中,更灵敏的摄取变化的复杂 PET 成像方法可能具有优势。