Qiu Qingtao, Duan Jinghao, Duan Zuyun, Meng Xiangjuan, Ma Changsheng, Zhu Jian, Lu Jie, Liu Tonghai, Yin Yong
Department of Radiation Oncology, Shandong Cancer Hospital Affiliated to Shandong University, Jinan 250117, China.
Department of Radiology, Second People's Hospital of Dongying City, Dongying 257335, China.
Quant Imaging Med Surg. 2019 Mar;9(3):453-464. doi: 10.21037/qims.2019.03.02.
The reproducibility and non-redundancy of radiomic features are challenges in accelerating the clinical translation of radiomics. In this study, we focused on the robustness and non-redundancy of radiomic features extracted from computed tomography (CT) scans in hepatocellular carcinoma (HCC) patients with respect to different tumor segmentation methods.
Arterial enhanced CT images were retrospectively randomly obtained from 106 patients. As a training data set, 26 HCC patients were used to calculate the features' reproducibility and redundancy. Another data set (55 HCC patients and 25 healthy volunteers) was used for classification. The GrowCut and GraphCut semiautomatic segmentation methods were implemented in 3D Slicer software by two independent observers, and manual delineation was performed by five abdominal radiation oncologists to acquire the gross tumor volume (GTV). Seventy-one radiomic features were extracted from GTVs using Imaging Biomarker Explorer (IBEX) software, including 17 tumor intensity statistical features, 16 shape features and 38 textural features. For each radiomic feature, intraclass correlation coefficient (ICC) and hierarchical clustering were used to quantify its reproducibility and redundancy. Features with ICC values greater than 0.75 were considered reproducible. To generate the number of non-redundancy feature subgroups, the R statistic method was used. Then, a classification model was built using a support vector machine (SVM) algorithm with 10-fold cross validation, and area under ROC curve (AUC) was used to evaluate the utility of non-redundant feature extraction by hierarchical clustering.
The percentages of excellent reproducible features in the manual delineation group, GraphCut and GrowCut segmentation group were 69% [49], 73% [52] and 79% [56], respectively. Sixty-five percent [46] of the features showed strong robustness for all segmentation methods. The optimal number of cluster subgroup were 9, 13 and 11 for manual delineation, GraphCut and GrowCut segmentation, respectively. The optimal cluster subgroup number was 6 for all groups when the collectively high reproducibility features were selected for clustering. The receiver operating characteristic (ROC) analysis of radiomics classification model with and without feature reduction for healthy liver and HCC had an AUC value of 0.857 and 0.721 respectively.
Our study demonstrates that variations exist in the reproducibility of quantitative imaging features extracted from tumor regions segmented using different methods. The reproducibility and non-redundancy of the radiomic features rely greatly on the tumor segmentation in HCC CT images. We recommend that the most reliable and uniform radiomic features should be selected in the clinical use of radiomics. Classification experiments with feature reduction showed that radiomic features were effective in identifying healthy liver and HCC.
放射组学特征的可重复性和非冗余性是加速放射组学临床转化的挑战。在本研究中,我们关注肝细胞癌(HCC)患者计算机断层扫描(CT)图像中提取的放射组学特征在不同肿瘤分割方法下的稳健性和非冗余性。
回顾性随机获取106例患者的动脉期增强CT图像。作为训练数据集,26例HCC患者用于计算特征的可重复性和冗余性。另一个数据集(55例HCC患者和25名健康志愿者)用于分类。两名独立观察者在3D Slicer软件中采用GrowCut和GraphCut半自动分割方法,并由五名腹部放射肿瘤学家进行手动勾勒以获取肿瘤总体积(GTV)。使用影像生物标志物探索者(IBEX)软件从GTV中提取71个放射组学特征,包括17个肿瘤强度统计特征、16个形状特征和38个纹理特征。对于每个放射组学特征,使用组内相关系数(ICC)和层次聚类来量化其可重复性和冗余性。ICC值大于0.75的特征被认为具有可重复性。采用R统计方法生成非冗余特征亚组的数量。然后,使用支持向量机(SVM)算法构建具有10倍交叉验证的分类模型,并使用ROC曲线下面积(AUC)评估通过层次聚类进行非冗余特征提取的效用。
手动勾勒组、GraphCut和GrowCut分割组中具有良好可重复性特征的百分比分别为69% [49]、73% [52]和79% [56]。65% [46]的特征对所有分割方法均显示出较强的稳健性。手动勾勒、GraphCut和GrowCut分割的最佳聚类亚组数分别为9、13和11。当选择共同具有高可重复性的特征进行聚类时,所有组的最佳聚类亚组数均为6。对有和没有特征约简的健康肝脏和HCC的放射组学分类模型进行的受试者操作特征(ROC)分析,AUC值分别为0.857和0.721。
我们的研究表明,使用不同方法分割肿瘤区域所提取的定量影像特征的可重复性存在差异。放射组学特征的可重复性和非冗余性在很大程度上依赖于HCC CT图像中的肿瘤分割。我们建议在放射组学的临床应用中应选择最可靠和一致的放射组学特征。特征约简的分类实验表明,放射组学特征在识别健康肝脏和HCC方面是有效的。