Hong Duo, Xu Ke, Zhang Lina, Wan Xiaoting, Guo Yan
Department of Radiology, The First Hospital of China Medical University, Shenyang, China.
GE Healthcare, Shanghai, China.
Front Oncol. 2020 Jan 31;10:28. doi: 10.3389/fonc.2020.00028. eCollection 2020.
To develop and validate a radiomic signature to identify EGFR mutations in patients with advanced lung adenocarcinoma. This study involved 201 patients with advanced lung adenocarcinoma (140 in the training cohort and 61 in the validation cohort). A total of 396 features were extracted from manual segmentation based on enhanced and non-enhance CT imaging after image preprocessing. The Lasso algorithm was used for feature selection, 6 machine learning methods were used to construct radiomics models. Receiver operating characteristic (ROC) curve analysis was applied to evaluate the performance of the radiomic signature between different data and methods. A nomogram was developed using clinical factors and the radiomics signature, then it was analyzed based on its discriminatory ability and calibration. Decision curve analysis (DCA) was implemented to evaluate the clinical utility. Ten features for contrast data and eleven features for non-contrast data were selected through LASSO algorithm. The performance of the radiomics signature for contrast images was better than that for non-contrast images in all of the 6 different machine learning methods. Finally, the best radiomics signature was built with logistic regression method based on enhanced CT imaging with an area under the curve (AUC) of 0.851 (95% CI, 0.750 to 0.951) in the validation cohort. A nomogram was developed using the radiomics signature and sex with a C-index of 0.908 (95%CI, 0.862 to 0.954) in the training cohort and 0.835 (95% CI, 0.825 to 0.845) in the validation cohort. It showed good discrimination and calibration (Hosmer-Lemeshow test, = 0.621 for the training cohort and = 0.605 for the validation cohort). Radiomics signature can help to distinguish between EGFR positive and wild type advanced lung adenocarcinomas.
开发并验证一种用于识别晚期肺腺癌患者表皮生长因子受体(EGFR)突变的放射组学特征。本研究纳入了201例晚期肺腺癌患者(训练队列140例,验证队列61例)。在图像预处理后,基于增强和非增强CT成像的手动分割提取了总共396个特征。采用套索算法进行特征选择,使用6种机器学习方法构建放射组学模型。应用受试者操作特征(ROC)曲线分析来评估不同数据和方法之间放射组学特征的性能。使用临床因素和放射组学特征构建列线图,然后基于其区分能力和校准进行分析。实施决策曲线分析(DCA)以评估临床效用。通过套索算法选择了10个增强数据特征和11个非增强数据特征。在所有6种不同的机器学习方法中,放射组学特征对增强图像的性能优于非增强图像。最后,基于增强CT成像,采用逻辑回归方法构建了最佳放射组学特征,验证队列中的曲线下面积(AUC)为0.851(95%CI,0.750至0.951)。使用放射组学特征和性别构建列线图,训练队列中的C指数为0.908(95%CI,0.862至0.954),验证队列中的C指数为0.835(95%CI,0.825至0.845)。它显示出良好的区分能力和校准(Hosmer-Lemeshow检验,训练队列P = 0.621,验证队列P = 0.605)。放射组学特征有助于区分EGFR阳性和野生型晚期肺腺癌。