Klinik für Diagnostische Radiologie, Arnold-Heller-Straße 3, Haus 23, 24105 Kiel, Germany.
Eur J Radiol. 2013 Oct;82(10):e567-73. doi: 10.1016/j.ejrad.2013.05.030. Epub 2013 Jul 1.
To examine technical parameters of measurement accuracy and differences in tumor response classification using RECIST 1.1 and volumetric assessment in three common metastasis types (lung nodules, liver lesions, lymph node metastasis) simultaneously.
56 consecutive patients (32 female) aged 41-82 years with a wide range of metastatic solid tumors were examined with MSCT for baseline and follow up. Images were evaluated by three experienced radiologists using manual measurements and semi-automatic lesion segmentation. Institutional ethics review was obtained and all patients gave written informed consent. Data analysis comprised interobserver variability operationalized as coefficient of variation and categorical response classification according to RECIST 1.1 for both manual and volumetric measures. Continuous data were assessed for statistical significance with Wilcoxon signed-rank test and categorical data with Fleiss kappa.
Interobserver variability was 6.3% (IQR 4.6%) for manual and 4.1% (IQR 4.4%) for volumetrically obtained sum of relevant diameters (p<0.05, corrected). 4-8 patients' response to therapy was classified differently across observers by using volumetry compared to standard manual measurements. Fleiss kappa revealed no significant difference in categorical agreement of response classification between manual (0.7558) and volumetric (0.7623) measurements.
Under standard RECIST thresholds there was no advantage of volumetric compared to manual response evaluation. However volumetric assessment yielded significantly lower interobserver variability. This may allow narrower thresholds for volumetric response classification in the future.
同时检查 RECIST 1.1 和容积评估在三种常见转移类型(肺结节、肝病变、淋巴结转移)中测量准确性和肿瘤反应分类差异的技术参数。
56 例连续患者(32 例女性)年龄 41-82 岁,患有广泛的转移性实体瘤,进行 MSCT 基线和随访检查。由三位有经验的放射科医生使用手动测量和半自动病变分割对图像进行评估。获得机构伦理审查,所有患者均签署书面知情同意书。数据分析包括观察者间变异性,表现为变异系数和根据 RECIST 1.1 对手动和容积测量的分类反应。连续数据采用 Wilcoxon 符号秩检验进行统计显著性评估,分类数据采用 Fleiss kappa。
手动测量的观察者间变异性为 6.3%(IQR 4.6%),容积测量的相关直径总和的观察者间变异性为 4.1%(IQR 4.4%)(p<0.05,校正后)。与标准手动测量相比,容积测量可使 4-8 名患者的治疗反应在不同观察者之间进行分类。Fleiss kappa 显示手动(0.7558)和容积(0.7623)测量的反应分类的分类一致性没有显著差异。
在标准 RECIST 阈值下,与手动反应评估相比,容积评估没有优势。然而,容积评估的观察者间变异性显著降低。这可能允许在未来对容积反应分类采用更窄的阈值。