Park Soyoung, Kim Jong Hee, Cha Yoon Ki, Chung Myung Jin, Woo Jung Han, Park Subin
Department of Health Sciences and Technology, SAIHST, Sungkyunkwan University, Seoul 06351, Republic of Korea.
Department of Radiology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul 06351, Republic of Korea.
Diagnostics (Basel). 2023 Sep 14;13(18):2953. doi: 10.3390/diagnostics13182953.
Axillary lymph node (ALN) status is one of the most critical prognostic factors in patients with breast cancer. However, ALN evaluation with contrast-enhanced CT (CECT) has been challenging. Machine learning (ML) is known to show excellent performance in image recognition tasks. The purpose of our study was to evaluate the performance of the ML algorithm for predicting ALN metastasis by combining preoperative CECT features of both ALN and primary tumor. This was a retrospective single-institutional study of a total of 266 patients with breast cancer who underwent preoperative chest CECT. Random forest (RF), extreme gradient boosting (XGBoost), and neural network (NN) algorithms were used. Statistical analysis and recursive feature elimination (RFE) were adopted as feature selection for ML. The best ML-based ALN prediction model for breast cancer was NN with RFE, which achieved an AUROC of 0.76 ± 0.11 and an accuracy of 0.74 ± 0.12. By comparing NN with RFE model performance with and without ALN features from CECT, NN with RFE model with ALN features showed better performance at all performance evaluations, which indicated the effect of ALN features. Through our study, we were able to demonstrate that the ML algorithm could effectively predict the final diagnosis of ALN metastases from CECT images of the primary tumor and ALN. This suggests that ML has the potential to differentiate between benign and malignant ALNs.
腋窝淋巴结(ALN)状态是乳腺癌患者最重要的预后因素之一。然而,通过对比增强CT(CECT)评估ALN一直具有挑战性。机器学习(ML)在图像识别任务中表现出色。我们研究的目的是通过结合ALN和原发肿瘤的术前CECT特征来评估ML算法预测ALN转移的性能。这是一项对266例接受术前胸部CECT的乳腺癌患者进行的回顾性单机构研究。使用了随机森林(RF)、极端梯度提升(XGBoost)和神经网络(NN)算法。采用统计分析和递归特征消除(RFE)作为ML的特征选择方法。基于ML的乳腺癌最佳ALN预测模型是采用RFE的NN,其曲线下面积(AUROC)为0.76±0.11,准确率为0.74±0.12。通过比较有和没有来自CECT的ALN特征的采用RFE的NN模型性能,具有ALN特征的采用RFE的NN模型在所有性能评估中表现更好,这表明了ALN特征的作用。通过我们的研究,我们能够证明ML算法可以有效地从原发肿瘤和ALN的CECT图像预测ALN转移的最终诊断。这表明ML有区分良性和恶性ALN的潜力。