Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Y920, Lee Shau Kee Building, Hung Hom, Kowloon, Hong Kong, China.
Department of Biomedical Engineering, Faculty of Engineering, The Hong Kong Polytechnic University, Hong Kong, China.
Sci Rep. 2023 Oct 25;13(1):18263. doi: 10.1038/s41598-023-45477-6.
Image perturbation is a promising technique to assess radiomic feature repeatability, but whether it can achieve the same effect as test-retest imaging on model reliability is unknown. This study aimed to compare radiomic model reliability based on repeatable features determined by the two methods using four different classifiers. A 191-patient public breast cancer dataset with 71 test-retest scans was used with pre-determined 117 training and 74 testing samples. We collected apparent diffusion coefficient images and manual tumor segmentations for radiomic feature extraction. Random translations, rotations, and contour randomizations were performed on the training images, and intra-class correlation coefficient (ICC) was used to filter high repeatable features. We evaluated model reliability in both internal generalizability and robustness, which were quantified by training and testing AUC and prediction ICC. Higher testing performance was found at higher feature ICC thresholds, but it dropped significantly at ICC = 0.95 for the test-retest model. Similar optimal reliability can be achieved with testing AUC = 0.7-0.8 and prediction ICC > 0.9 at the ICC threshold of 0.9. It is recommended to include feature repeatability analysis using image perturbation in any radiomic study when test-retest is not feasible, but care should be taken when deciding the optimal feature repeatability criteria.
图像扰动是评估放射组学特征可重复性的一种很有前途的技术,但它是否能像重复成像一样对模型可靠性产生相同的效果尚不清楚。本研究旨在比较两种方法确定的可重复特征的放射组学模型可靠性,使用了四种不同的分类器。我们使用了一个包含 71 次重复扫描的 191 例公共乳腺癌数据集,其中有 117 个训练样本和 74 个测试样本。我们采集了表观扩散系数图像和手动肿瘤分割,用于放射组学特征提取。对训练图像进行了随机平移、旋转和轮廓随机化,使用组内相关系数(ICC)来筛选高可重复性特征。我们在内部泛化和稳健性方面评估了模型的可靠性,通过训练和测试 AUC 和预测 ICC 来量化。在特征 ICC 阈值较高时,测试性能较高,但在 ICC=0.95 时,测试-重测模型的性能显著下降。在 ICC 阈值为 0.9 时,测试 AUC=0.7-0.8 和预测 ICC>0.9 可以达到相似的最佳可靠性。建议在无法进行重复成像时,在任何放射组学研究中都使用图像扰动进行特征重复性分析,但在决定最佳特征重复性标准时需要谨慎。