Department of Urology, Amsterdam University Medical Centers, Prostate Cancer Network Netherlands, Amsterdam, The Netherlands.
Department of Radiology & Nuclear Medicine, Amsterdam University Medical Centers, Cancer Center Amsterdam, Amsterdam, The Netherlands.
PLoS One. 2023 Nov 9;18(11):e0293672. doi: 10.1371/journal.pone.0293672. eCollection 2023.
Radiomics extracted from prostate-specific membrane antigen (PSMA)-PET modeled with machine learning (ML) may be used for prediction of disease risk. However, validation of previously proposed approaches is lacking. We aimed to optimize and validate ML models based on 18F-DCFPyL-PET radiomics for the prediction of lymph-node involvement (LNI), extracapsular extension (ECE), and postoperative Gleason score (GS) in primary prostate cancer (PCa) patients.
Patients with intermediate- to high-risk PCa who underwent 18F-DCFPyL-PET/CT before radical prostatectomy with pelvic lymph-node dissection were evaluated. The training dataset included 72 patients, the internal validation dataset 24 patients, and the external validation dataset 27 patients. PSMA-avid intra-prostatic lesions were delineated semi-automatically on PET and 480 radiomics features were extracted. Conventional PET-metrics were derived for comparative analysis. Segmentation, preprocessing, and ML methods were optimized in repeated 5-fold cross-validation (CV) on the training dataset. The trained models were tested on the combined validation dataset. Combat harmonization was applied to external radiomics data. Model performance was assessed using the receiver-operating-characteristics curve (AUC).
The CV-AUCs in the training dataset were 0.88, 0.79 and 0.84 for LNI, ECE, and GS, respectively. In the combined validation dataset, the ML models could significantly predict GS with an AUC of 0.78 (p<0.05). However, validation AUCs for LNI and ECE prediction were not significant (0.57 and 0.63, respectively). Conventional PET metrics-based models had comparable AUCs for LNI (0.59, p>0.05) and ECE (0.66, p>0.05), but a lower AUC for GS (0.73, p<0.05). In general, Combat harmonization improved external validation AUCs (-0.03 to +0.18).
In internal and external validation, 18F-DCFPyL-PET radiomics-based ML models predicted high postoperative GS but not LNI or ECE in intermediate- to high-risk PCa. Therefore, the clinical benefit seems to be limited. These results underline the need for external and/or multicenter validation of PET radiomics-based ML model analyses to assess their generalizability.
基于机器学习(ML)的前列腺特异性膜抗原(PSMA)-PET 模型提取的放射组学可用于预测疾病风险。然而,缺乏对先前提出的方法的验证。我们旨在优化和验证基于 18F-DCFPyL-PET 放射组学的 ML 模型,以预测中高危前列腺癌(PCa)患者的淋巴结受累(LNI)、包膜外侵犯(ECE)和术后 Gleason 评分(GS)。
对接受根治性前列腺切除术和盆腔淋巴结清扫术的中高危 PCa 患者进行 18F-DCFPyL-PET/CT 检查。训练数据集包括 72 例患者,内部验证数据集 24 例,外部验证数据集 27 例。在 PET 上半自动勾画 PSMA 活性前列腺内病变,并提取 480 个放射组学特征。为了比较分析,还得出了常规 PET 指标。在训练数据集上重复进行 5 折交叉验证(CV)以优化分割、预处理和 ML 方法。在联合验证数据集上测试训练好的模型。应用 Combat 协调对外部放射组学数据进行处理。使用受试者工作特征曲线(AUC)评估模型性能。
在训练数据集上,CV-AUC 分别为 LNI、ECE 和 GS 的 0.88、0.79 和 0.84。在联合验证数据集上,ML 模型能够显著预测 GS,AUC 为 0.78(p<0.05)。然而,LNI 和 ECE 预测的验证 AUC 不显著(分别为 0.57 和 0.63)。基于常规 PET 指标的模型对 LNI(0.59,p>0.05)和 ECE(0.66,p>0.05)的 AUC 具有可比性,但对 GS(0.73,p<0.05)的 AUC 较低。总体而言,Combat 协调将外部验证 AUC 提高了(-0.03 至 +0.18)。
在内部和外部验证中,基于 18F-DCFPyL-PET 的放射组学 ML 模型预测中高危 PCa 患者的高术后 GS,但不能预测 LNI 或 ECE。因此,临床获益似乎有限。这些结果强调了需要对基于 PET 放射组学的 ML 模型分析进行外部和/或多中心验证,以评估其泛化能力。