Dong Mengshi, Hou Gang, Li Shu, Li Nan, Zhang Lina, Xu Ke
Department of Radiology, The First Affiliated Hospital of China Medical University, Shenyang, China.
Institute of Respiratory Disease, The First Affiliated Hospital of China Medical University, Shenyang, China.
Front Oncol. 2021 Jan 8;10:558428. doi: 10.3389/fonc.2020.558428. eCollection 2020.
To establish and validate a radiomics model to estimate the malignancy of mediastinal lymph nodes (LNs) based on contrast-enhanced CT imaging.
In total, 201 pathologically confirmed mediastinal LNs from 129 patients were enrolled and assigned to training and test sets. Radiomics features were extracted from the region of interest (ROI) delineated on venous-phase CT imaging of LN. Feature selection was performed with least absolute shrinkage and selection operator (LASSO) binary logistic regression. Multivariate logistic regression was performed with the backward stepwise elimination. A model was fitted to associate mediastinal LN malignancy with selected features. The performance of the model was assessed and compared to that of five other machine learning algorithms (support vector machine, naive Bayes, random forest, decision tree, K-nearest neighbor) using receiver operating characteristic (ROC) curves. Calibration curves and Hosmer-Lemeshow tests were used to assess the calibration degree. Decision curve analysis (DCA) was used to assess the clinical usefulness of the logistic regression model in both the training and test sets. Stratified analysis was performed for different scanners and slice thicknesses.
Among the six machine learning methods, the logistic regression model with the eight strongest features showed a significant association with mediastinal LN status and the satisfactory diagnostic performance for distinguishing malignant LNs from benign LNs. The accuracy, sensitivity, specificity and area under the ROC curve (AUC) were 0.850/0.803, 0.821/0.806, 0.893/0.800, and 0.922/0.850 in the training/test sets, respectively. The Hosmer-Lemeshow test showed that the P value was > 0.05, indicating good calibration, and the calibration curves showed good agreement between the classifications and actual observations. DCA showed that the model would obtain more benefit when the threshold probability was between 30% and 90% in the test set. Stratified analysis showed that the performance was not affected by different scanners or slice thicknesses. There was no significant difference (DeLong test, P > 0.05) between any two subgroups, which showed the generalization of the radiomics score across different factors.
The model we built could help assist the preoperative estimation of mediastinal LN malignancy based on contrast-enhanced CT imaging, with stability for different scanners and slice thicknesses.
建立并验证一种基于增强CT影像的放射组学模型,以评估纵隔淋巴结(LN)的恶性程度。
共纳入129例患者的201个经病理证实的纵隔LN,并将其分为训练集和测试集。从LN静脉期CT影像上勾勒出的感兴趣区域(ROI)中提取放射组学特征。采用最小绝对收缩和选择算子(LASSO)二元逻辑回归进行特征选择。采用向后逐步淘汰法进行多变量逻辑回归。构建一个模型,将纵隔LN的恶性程度与所选特征相关联。使用受试者工作特征(ROC)曲线评估该模型的性能,并与其他五种机器学习算法(支持向量机、朴素贝叶斯、随机森林、决策树、K近邻)进行比较。使用校准曲线和Hosmer-Lemeshow检验评估校准程度。采用决策曲线分析(DCA)评估逻辑回归模型在训练集和测试集中的临床实用性。对不同扫描仪和层厚进行分层分析。
在六种机器学习方法中,具有八个最强特征的逻辑回归模型与纵隔LN状态显著相关,在区分恶性LN和良性LN方面具有令人满意的诊断性能。训练集/测试集中的准确率、灵敏度、特异性和ROC曲线下面积(AUC)分别为0.850/0.803、0.821/0.806、0.893/0.800和0.922/0.850。Hosmer-Lemeshow检验显示P值>0.05,表明校准良好,校准曲线显示分类与实际观察结果之间具有良好的一致性。DCA显示,在测试集中,当阈值概率在30%至90%之间时,该模型将获得更多益处。分层分析表明,性能不受不同扫描仪或层厚的影响。任意两个亚组之间均无显著差异(DeLong检验,P>0.05),这表明放射组学评分在不同因素之间具有普遍性。
我们构建的模型有助于基于增强CT影像对纵隔LN恶性程度进行术前评估,且对不同扫描仪和层厚具有稳定性。