Velasquez Linda M, Boellaard Ronald, Kollia Georgia, Hayes Wendy, Hoekstra Otto S, Lammertsma Adriaan A, Galbraith Susan M
Bristol-Myers Squibb Co, Princeton, New Jersey 08543, USA.
J Nucl Med. 2009 Oct;50(10):1646-54. doi: 10.2967/jnumed.109.063347. Epub 2009 Sep 16.
(18)F-FDG PET is often used to monitor tumor response in multicenter oncology clinical trials. This study assessed the repeatability of several semiquantitative standardized uptake values (mean SUV [SUV(mean)], maximum SUV [SUV(max)], peak SUV [SUV(peak)], and the 3-dimensional isocontour at 70% of the maximum pixel value [SUV(70%)]) as measured by repeated baseline (18)F-FDG PET studies in a multicenter phase I oncology trial.
Double-baseline (18)F-FDG PET studies were acquired for 62 sequentially enrolled patients. Tumor metabolic activity was assessed by SUV(mean), SUV(max), SUV(peak), and SUV(70%). The effect on SUV repeatability of compliance with recommended image-acquisition guidelines and quality assurance (QA) standards was assessed. Summary statistics for absolute differences relative to the average of baseline values and repeatability analysis were performed for all patients and for a subgroup that passed QA, in both a multi- and a single-observer setting. Intrasubject precision of baseline measurements was assessed by repeatability coefficients, intrasubject coefficients of variation (CV), and confidence intervals on mean baseline differences for all SUV parameters.
The mean differences between the 2 SUV baseline measurements were small, varying from -2.1% to 1.9%, and the 95% confidence intervals for these mean differences had a maximum half-width of about 5.6% across the SUV parameters assessed. For SUV(max), the intrasubject CV varied from 10.7% to 12.8% for the QA multi- and single-observer datasets and was 16% for the full dataset. The 95% repeatability coefficients ranged from -28.4% to 39.6% for the QA datasets and up to -34.3% to 52.3% for the full dataset.
Repeatability results of double-baseline (18)F-FDG PET scans were similar for all SUV parameters assessed, for both the full and the QA datasets, in both the multi- and the single-observer settings. Centralized quality assurance and analysis of data improved intrasubject CV from 15.9% to 10.7% for averaged SUV(max). Thresholds for metabolic response in the multicenter multiobserver non-QA settings were -34% and 52% and in the range of -26% to 39% with centralized QA. These results support the use of (18)F-FDG PET for tumor assessment in multicenter oncology clinical trials.
(18)F-FDG PET常用于多中心肿瘤学临床试验中监测肿瘤反应。本研究评估了在一项多中心I期肿瘤学试验中,通过重复基线(18)F-FDG PET研究测量的几个半定量标准化摄取值(平均SUV [SUV(mean)]、最大SUV [SUV(max)]、峰值SUV [SUV(peak)]以及最大像素值70%处的三维等轮廓线[SUV(70%)])的可重复性。
对62例连续入组的患者进行了双基线(18)F-FDG PET研究。通过SUV(mean)、SUV(max)、SUV(peak)和SUV(70%)评估肿瘤代谢活性。评估了遵守推荐的图像采集指南和质量保证(QA)标准对SUV可重复性的影响。在多观察者和单观察者设置下,对所有患者以及通过QA的亚组,进行了相对于基线值平均值的绝对差异的汇总统计和可重复性分析。通过可重复性系数、受试者内变异系数(CV)以及所有SUV参数的平均基线差异的置信区间,评估了基线测量的受试者内精密度。
两次SUV基线测量之间的平均差异较小,在-2.1%至1.9%之间,并且在评估的SUV参数中,这些平均差异的95%置信区间的最大半宽约为5.6%。对于SUV(max),在QA多观察者和单观察者数据集中,受试者内CV在10.7%至12.8%之间,在完整数据集中为16%。对于QA数据集,95%可重复性系数在-28.4%至39.6%之间,对于完整数据集高达-34.3%至52.3%。
在多观察者和单观察者设置下,对于完整数据集和QA数据集,所有评估的SUV参数的双基线(18)F-FDG PET扫描的可重复性结果相似。集中式质量保证和数据分析将平均SUV(max)的受试者内CV从15.9%提高到了10.7%。在多中心多观察者非QA设置下,代谢反应的阈值为-34%和52%,在集中式QA时为-26%至39%。这些结果支持在多中心肿瘤学临床试验中使用(18)F-FDG PET进行肿瘤评估。