Departement of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, South Korea.
Center for Neuroscience Imaging Research, Institute for Basic Science (IBS), Suwon, 16419, South Korea.
Cancer Imaging. 2019 Jul 26;19(1):54. doi: 10.1186/s40644-019-0239-z.
Radiomics suffers from feature reproducibility. We studied the variability of radiomics features and the relationship of radiomics features with tumor size and shape to determine guidelines for optimal radiomics study.
We dealt with 260 lung nodules (180 for training, 80 for testing) limited to 2 cm or less. We quantified how voxel geometry (isotropic/anisotropic) and the number of histogram bins, factors commonly adjusted in multi-center studies, affect reproducibility. First, features showing high reproducibility between the original and isotropic transformed voxel settings were identified. Second, features showing high reproducibility in various binning settings were identified. Two hundred fifty-two features were computed and features with high intra-correlation coefficient were selected. Features that explained nodule status (benign/malignant) were retained using the least absolute shrinkage selector operator. Common features among different settings were identified, and the final features showing high reproducibility correlated with nodule status were identified. The identified features were used for the random forest classifier to validate the effectiveness of the features. The properties of the uncalculated feature were inspected to suggest a tentative guideline for radiomics studies.
Nine features showing high reproducibility for both the original and isotropic voxel settings were selected and used to classify nodule status (AUC 0.659-0.697). Five features showing high reproducibility among different binning settings were selected and used in classification (AUC 0.729-0.748). Some texture features are likely to be successfully computed if a nodule was larger than 1000 mm.
Features showing high reproducibility among different settings correlated with nodule status were identified.
放射组学存在特征可重复性问题。我们研究了放射组学特征的可变性以及放射组学特征与肿瘤大小和形状的关系,以确定最佳放射组学研究的指南。
我们处理了 260 个肺结节(180 个用于训练,80 个用于测试),结节直径限制在 2cm 或以下。我们量化了体素几何形状(各向同性/各向异性)和直方图箱数(多中心研究中常见的调整因素)如何影响可重复性。首先,确定在原始和各向同性变换体素设置之间具有高可重复性的特征。其次,确定在各种分箱设置中具有高可重复性的特征。计算了 252 个特征,并选择了具有高内相关系数的特征。使用最小绝对收缩选择算子保留了能够解释结节状态(良性/恶性)的特征。确定了不同设置之间共有的特征,并确定了具有高可重复性且与结节状态相关的最终特征。所识别的特征用于随机森林分类器来验证特征的有效性。检查未计算特征的性质,以提出放射组学研究的暂定指南。
选择了 9 个在原始和各向同性体素设置中均具有高可重复性的特征,并用于分类结节状态(AUC 0.659-0.697)。选择了 5 个在不同分箱设置中具有高可重复性的特征用于分类(AUC 0.729-0.748)。如果结节大于 1000mm,则可能成功计算出一些纹理特征。
确定了与结节状态相关的具有高可重复性的特征。