Wang Si-Rui, Cao Chun-Li, Du Ting-Ting, Wang Jin-Li, Li Jun, Li Wen-Xiao, Chen Ming
The Ultrasound Diagnosis Department, The First Affiliated Hospital of Shihezi University, Xinjiang, China.
J Ultrasound Med. 2024 Sep;43(9):1611-1625. doi: 10.1002/jum.16483. Epub 2024 May 29.
This study seeks to construct a machine learning model that merges clinical characteristics with ultrasound radiomic analysis-encompassing both the intratumoral and peritumoral-to predict the status of axillary lymph nodes in patients with early-stage breast cancer.
The study employed retrospective methods, collecting clinical information, ultrasound data, and postoperative pathological results from 321 breast cancer patients (including 224 in the training group and 97 in the validation group). Through correlation analysis, univariate analysis, and Lasso regression analysis, independent risk factors related to axillary lymph node metastasis in breast cancer were identified from conventional ultrasound and immunohistochemical indicators, and a clinical feature model was constructed. Additionally, features were extracted from ultrasound images of the intratumoral and its 1-5 mm peritumoral to establish a radiomics feature formula. Furthermore, by combining clinical features and ultrasound radiomics features, six machine learning models (Logistic Regression, Decision Tree, Support Vector Machine, Extreme Gradient Boosting, Random Forest, and K-Nearest Neighbors) were compared for diagnostic efficacy, and constructing a joint prediction model based on the optimal ML algorithm. The use of Shapley Additive Explanations (SHAP) enhanced the visualization and interpretability of the model during the diagnostic process.
Among the 321 breast cancer patients, 121 had axillary lymph node metastasis, and 200 did not. The clinical feature model had an AUC of 0.779 and 0.777 in the training and validation groups, respectively. Radiomics model analysis showed that the model including the Intratumor +3 mm peritumor area had the best diagnostic performance, with AUCs of 0.847 and 0.844 in the training and validation groups, respectively. The joint prediction model based on the XGBoost algorithm reached AUCs of 0.917 and 0.905 in the training and validation groups, respectively. SHAP analysis indicated that the Rad Score had the highest weight in the prediction model, playing a significant role in predicting axillary lymph node metastasis in breast cancer.
The predictive model, which integrates clinical features and radiomic characteristics using the XGBoost algorithm, demonstrates significant diagnostic value for axillary lymph node metastasis in breast cancer. This model can provide significant references for preoperative surgical strategy selection and prognosis evaluation for breast cancer patients, helping to reduce postoperative complications and improve long-term survival rates. Additionally, the utilization of SHAP enhancing the global and local interpretability of the model.
本研究旨在构建一种机器学习模型,该模型将临床特征与超声影像组学分析(包括肿瘤内和肿瘤周围)相结合,以预测早期乳腺癌患者腋窝淋巴结的状态。
本研究采用回顾性方法,收集了321例乳腺癌患者(包括224例训练组和97例验证组)的临床信息、超声数据和术后病理结果。通过相关性分析、单因素分析和Lasso回归分析,从传统超声和免疫组化指标中确定与乳腺癌腋窝淋巴结转移相关的独立危险因素,并构建临床特征模型。此外,从肿瘤内及其周围1-5mm的超声图像中提取特征,建立影像组学特征公式。此外,通过结合临床特征和超声影像组学特征,比较了六种机器学习模型(逻辑回归、决策树、支持向量机、极端梯度提升、随机森林和K近邻)的诊断效能,并基于最优机器学习算法构建联合预测模型。使用Shapley值加法解释(SHAP)增强了模型在诊断过程中的可视化和可解释性。
在321例乳腺癌患者中,121例有腋窝淋巴结转移,200例无腋窝淋巴结转移。临床特征模型在训练组和验证组中的AUC分别为0.779和0.777。影像组学模型分析表明,包括肿瘤内+3mm肿瘤周围区域的模型具有最佳诊断性能,在训练组和验证组中的AUC分别为0.847和0.844。基于XGBoost算法的联合预测模型在训练组和验证组中的AUC分别达到0.917和0.905。SHAP分析表明,Rad Score在预测模型中的权重最高,在预测乳腺癌腋窝淋巴结转移中起重要作用。
使用XGBoost算法整合临床特征和影像组学特征的预测模型,对乳腺癌腋窝淋巴结转移具有显著的诊断价值。该模型可为乳腺癌患者术前手术策略选择和预后评估提供重要参考,有助于减少术后并发症,提高长期生存率。此外,SHAP的应用增强了模型的全局和局部可解释性。