Marom Edith M, Munden Reginald F, Truong Mylene T, Gladish Gregory W, Podoloff Donald A, Mawlawi Osama, Broemeling Lyle D, Bruzzi John F, Macapinlac Homer A
Departments of Diagnostic Imaging, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA.
J Thorac Imaging. 2006 Aug;21(3):205-12. doi: 10.1097/01.rti.0000213643.49664.4d.
To assess interobserver and intraobserver variabilities in measuring the maximal standardized uptake value (SUV) of non-small-cell lung cancer.
Positron emission tomography-computed tomography examinations of 20 consecutive patients referred for initial evaluation of newly diagnosed non-small-cell lung cancer were retrospectively reviewed by 5 experienced positron emission tomography-computed tomography readers, who independently measured the maximal SUV/body weight of the primary tumors. Interobserver and intraobserver variabilities were assessed by using 4 statistical methods: correlation, regression analysis, Bland-Altman analysis, and analysis of variance. The SUV measurements derived in the study were compared with the SUV measurements documented in the original reports using correlation and regression analysis. The percentages of tumors whose retrospective SUV measurements were more than 20% different and more than 25% different from those in the original report were assessed.
Both interobserver and intraobserver SUV measurements were highly reproducible. Pearson correlation coefficients were greater than 0.95 and 0.94, respectively. Good interobserver and intraobserver agreement was shown with regression analysis (F test P value >0.05), the Bland-Altman analysis, and analysis of variance (F test P value >0.95). The mean original SUV was much less than the mean study SUV (P<0.05). The study SUV differed from the SUV of the original report by more than 20% in 50% of the tumors, and by more than 25% in 45% of the tumors.
There was excellent interobserver and intraobserver agreement in SUVs measured in the study environment but poor agreement between study SUVs and those documented in original reports, which can affect treatment decisions substantially.
评估在测量非小细胞肺癌最大标准化摄取值(SUV)时观察者间和观察者内的变异性。
对20例连续转诊进行新诊断非小细胞肺癌初始评估的患者的正电子发射断层扫描 - 计算机断层扫描检查进行回顾性分析,由5名经验丰富的正电子发射断层扫描 - 计算机断层扫描阅片者独立测量原发肿瘤的最大SUV/体重。采用4种统计方法评估观察者间和观察者内的变异性:相关性分析、回归分析、Bland - Altman分析和方差分析。使用相关性和回归分析将研究中得出的SUV测量值与原始报告中记录的SUV测量值进行比较。评估回顾性SUV测量值与原始报告中测量值相差超过20%和超过25%的肿瘤百分比。
观察者间和观察者内的SUV测量均具有高度可重复性。Pearson相关系数分别大于0.95和0.94。回归分析(F检验P值>0.05)、Bland - Altman分析和方差分析(F检验P值>0.95)均显示观察者间和观察者内具有良好的一致性。原始SUV的平均值远低于研究SUV的平均值(P<0.05)。在50%的肿瘤中,研究SUV与原始报告中的SUV相差超过20%,在45%的肿瘤中相差超过25%。
在研究环境中测量的SUV,观察者间和观察者内具有极好的一致性,但研究SUV与原始报告中记录的SUV之间一致性较差,这可能会对治疗决策产生重大影响。