Department of Nuclear Medicine, University Hospital, LMU Munich, 81377 Munich, Germany.
Department of Nuclear Medicine, University Hospital, LMU Munich, 81377 Munich, Germany; German Cancer Consortium (DKTK), Partner Site Munich, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany; Bavarian Cancer Research Center (BZKF), 91054 Erlangen, Germany.
Z Med Phys. 2023 Feb;33(1):91-102. doi: 10.1016/j.zemedi.2022.12.005. Epub 2023 Jan 27.
Large datasets are required to ensure reliable non-invasive glioma assessment with radiomics-based machine learning methods. This can often only be achieved by pooling images from different centers. Moreover, trained models should perform with high accuracy when applied to data from different centers. In this study, the impact of reconstruction settings and segmentation methods on radiomic features derived from amino acid and TSPO PET images of glioma patients was examined. Additionally, the ability to model and thus reduce feature differences was investigated.
[F]FET and [F]GE-180 PET data were acquired from 19 glioma patients. For each acquisition, 10 reconstruction settings and 9 segmentation methods were included to emulate multicentric data. Statistical robustness measures were calculated before and after ComBat harmonization. Differences between features due to setting variations were assessed using Friedman test, coefficient of variation (CV) and inter-rater reliability measures, including intraclass and Spearman's rank correlation coefficients and Fleiss' Kappa.
According to Friedman analyses, most features (>60%) showed significant differences. Yet, CV and inter-rater reliability measures indicated higher robustness. ComBat resulted in almost complete harmonization (>87%) according to Friedman test and little to no improvement according to CV and inter-rater reliability measures. [F]GE-180 features were more sensitive to reconstruction settings than [F]FET features.
According to Friedman test, feature distributions could be successfully aligned using ComBat. However, depending on settings, changes in patient ranks were observed for some features and could not be eliminated by harmonization. Thus, for clinical utilization it is recommended to exclude affected features.
基于放射组学的机器学习方法需要大型数据集来确保对胶质瘤进行可靠的无创评估。这通常只能通过汇集来自不同中心的图像来实现。此外,训练有素的模型在应用于来自不同中心的数据时应具有高精度。在这项研究中,检查了重建设置和分割方法对胶质瘤患者氨基酸和 TSPO PET 图像衍生的放射特征的影响。此外,还研究了建模能力,从而减少特征差异的能力。
从 19 名胶质瘤患者中采集了[F]FET 和[F]GE-180 PET 数据。对于每次采集,包括 10 种重建设置和 9 种分割方法,以模拟多中心数据。在进行 ComBat 协调之前和之后计算了统计稳健性度量。使用 Friedman 检验、变异系数 (CV) 和组内和 Spearman 等级相关系数以及 Fleiss' Kappa 等内部评估者可靠性度量评估了由于设置变化而导致的特征差异。
根据 Friedman 分析,大多数特征(>60%)存在显著差异。然而,CV 和内部评估者可靠性度量表明稳健性更高。ComBat 根据 Friedman 检验几乎完全协调(>87%),根据 CV 和内部评估者可靠性度量几乎没有改善。[F]GE-180 特征比[F]FET 特征对重建设置更敏感。
根据 Friedman 检验,可以使用 ComBat 成功对齐特征分布。然而,根据设置的不同,对于一些特征观察到患者等级的变化,并且协调无法消除这些变化。因此,建议在临床应用中排除受影响的特征。