Yang Qianmei, Liu Cuifang, Wang Yongyue, Dong Guifang, Sun Jinghuan
Department of Ultrasound, The First Affiliated Hospital of Chongqing University of Chinese Medicine, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, 400021, China.
Department of Radiology, The First Affiliated Hospital of Chongqing University of Chinese Medicine, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, 400021, China.
Discov Oncol. 2025 May 8;16(1):704. doi: 10.1007/s12672-025-02493-4.
The aim of this study was to develop and validate a machine learning (ML) based prediction model for sentinel lymph node metastasis in breast cancer to identify patients with a high risk of sentinel lymph node metastasis.
In this machine learning study, we retrospectively collected 225 female breast cancer patients who underwent sentinel lymph node biopsy (SLNB). Feature screening was performed using the logistic regression analysis. Subsequently, five ML algorithms, namely LOGIT, LASSO, XGBOOST, RANDOM FOREST model and GBM model were employed to train and develop an ML model. In addition, model interpretation was performed by the Shapley Additive Explanations (SHAP) analysis to clarify the importance of each feature of the model and its decision basis.
Combined univariate and multivariate logistic regression analysis, identified Multifocal, LVI, Maximum Diameter, Shape US, Maximum Cortical Thickness as significant predictors. We than successfully leveraged machine learning algorithms, particularly the RANDOM FOREST model, to develop a predictive model for sentinel lymph node metastasis in breast cancer. Finally, the SHAP method identified Maximum Diameter and Maximum Cortical Thickness as the primary decision factors influencing the ML model's predictions.
With the integration of pathological and imaging characteristics, ML algorithm can accurately predict sentinel lymph node metastasis in breast cancer patients. The RANDOM FOREST model showed ideal performance. With the incorporation of these models in the clinic, can helpful for clinicians to identify patients at risk of sentinel lymph node metastasis of breast cancer and make more reasonable treatment decisions.
本研究旨在开发并验证一种基于机器学习(ML)的乳腺癌前哨淋巴结转移预测模型,以识别具有前哨淋巴结转移高风险的患者。
在这项机器学习研究中,我们回顾性收集了225例行前哨淋巴结活检(SLNB)的女性乳腺癌患者。使用逻辑回归分析进行特征筛选。随后,采用五种ML算法,即LOGIT、LASSO、XGBOOST、随机森林模型和GBM模型来训练和开发ML模型。此外,通过夏普利值附加解释(SHAP)分析进行模型解释,以阐明模型各特征的重要性及其决策依据。
单因素和多因素逻辑回归分析相结合,确定多灶性、淋巴管浸润、最大直径、超声形态、最大皮质厚度为显著预测因素。然后,我们成功利用机器学习算法,特别是随机森林模型,开发了一种乳腺癌前哨淋巴结转移预测模型。最后,SHAP方法确定最大直径和最大皮质厚度是影响ML模型预测的主要决策因素。
通过整合病理和影像特征,ML算法可以准确预测乳腺癌患者的前哨淋巴结转移。随机森林模型表现出理想的性能。将这些模型应用于临床,有助于临床医生识别有乳腺癌前哨淋巴结转移风险的患者,并做出更合理的治疗决策。