Department of Cancer Physiology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, 33612, USA.
Institute of Biomedical Engineering, Bogazici University, Istanbul, 34684, Turkey.
Med Phys. 2019 Nov;46(11):5075-5085. doi: 10.1002/mp.13808. Epub 2019 Sep 23.
Recent efforts have demonstrated that radiomic features extracted from the peritumoral region, the area surrounding the tumor parenchyma, have clinical utility in various cancer types. However, as like any radiomic features, peritumoral features could also be unstable and/or nonreproducible. Hence, the purpose of this study was to assess the stability and reproducibility of computed tomography (CT) radiomic features extracted from the peritumoral regions of lung lesions where stability was defined as the consistency of a feature by different segmentations, and reproducibility was defined as the consistency of a feature to different image acquisitions.
Stability was measured utilizing the "moist run" dataset and reproducibility was measured utilizing the Reference Image Database to Evaluate Therapy Response test-retest dataset. Peritumoral radiomic features were extracted from incremental distances of 3-12 mm outside the tumor segmentation. A total of 264 statistical, histogram, and texture radiomic features were assessed from the selected peritumoral region-of-interests (ROIs). All features (except wavelet texture features) were extracted using standardized algorithms defined by the Image Biomarker Standardisation Initiative. Stability and reproducibility of features were assessed using the concordance correlation coefficient. The clinical utility of stable and reproducible peritumoral features was tested in three previously published lung cancer datasets using overall survival as the endpoint.
Features found to be stable and reproducible, regardless of the peritumoral distances, included statistical, histogram, and a subset of texture features suggesting that these features are less affected by changes (e.g., size or shape) of the peritumoral region due to different segmentations and image acquisitions. The stability and reproducibility of Laws and wavelet texture features were inconsistent across all peritumoral distances. The analyses also revealed that a subset of features were consistently stable irrespective of the initial parameters (e.g., seed point) for a given segmentation algorithm. No significant differences were found in stability for features that were extracted from ROIs bounded by a lung parenchyma mask versus ROIs that were not bounded by a lung parenchyma mask (i.e., peritumoral regions that extended outside of lung parenchyma). After testing the clinical utility of peritumoral features, stable and reproducible features were shown to be more likely to create repeatable models than unstable and nonreproducible features.
This study identified a subset of stable and reproducible CT radiomic features extracted from the peritumoral region of lung lesions. The stable and reproducible features identified in this study could be applied to a feature selection pipeline for CT radiomic analyses. According to our findings, top performing features in survival models were more likely to be stable and reproducible hence, it may be best practice to utilize them to achieve repeatable studies and reduce the chance of overfitting.
最近的研究表明,从肿瘤实质周围的瘤周区域提取的放射组学特征在各种癌症类型中具有临床应用价值。然而,与任何放射组学特征一样,瘤周特征也可能不稳定和/或不可重现。因此,本研究的目的是评估从肺部病变的瘤周区域提取的 CT 放射组学特征的稳定性和可重复性,其中稳定性定义为不同分割的特征一致性,可重复性定义为特征与不同图像采集的一致性。
使用“湿运行”数据集测量稳定性,使用参考图像数据库评估治疗反应测试-再测试数据集测量可重复性。从肿瘤分割外的 3-12mm 增量距离提取瘤周放射组学特征。从选定的瘤周感兴趣区域(ROI)中评估了 264 个统计、直方图和纹理放射组学特征。除了小波纹理特征外,所有特征(除了小波纹理特征外)都使用由图像生物标志物标准化倡议定义的标准化算法提取。使用一致性相关系数评估特征的稳定性和可重复性。使用总生存期作为终点,在三个先前发表的肺癌数据集上测试稳定和可重复的瘤周特征的临床实用性。
无论瘤周距离如何,发现稳定且可重复的特征包括统计、直方图和纹理特征的一个子集,这表明这些特征受不同分割和图像采集引起的瘤周区域变化(例如大小或形状)的影响较小。在所有瘤周距离下,Laws 和小波纹理特征的稳定性和可重复性均不一致。分析还表明,在给定分割算法的初始参数(例如种子点)下,特征的子集是一致稳定的。从受肺实质掩模限制的 ROI 中提取的特征的稳定性与不受肺实质掩模限制的 ROI(即延伸至肺实质之外的瘤周区域)的稳定性没有显著差异。在测试瘤周特征的临床实用性后,稳定和可重复的特征比不稳定和不可重复的特征更有可能创建可重复的模型。
本研究确定了从肺部病变瘤周区域提取的一组稳定且可重复的 CT 放射组学特征。本研究中确定的稳定和可重复的特征可应用于 CT 放射组学分析的特征选择管道。根据我们的发现,在生存模型中表现最好的特征更有可能是稳定和可重复的,因此最好利用它们来实现可重复的研究并降低过度拟合的机会。