Sun Zhenguo, Gao Jianxiong, Yu Wenji, Yuan Xiaoshuai, Du Peng, Chen Peng, Wang Yuetao
Department of Nuclear Medicine, The Third Affiliated Hospital of Soochow University, Changzhou, Jiangsu, China.
Department of Nuclear Medicine, The First People's Hospital of Lianyungang/The First Affiliated Hospital of Kangda College of Nanjing Medical University, Lianyungang, Jiangsu, China.
Front Oncol. 2025 May 14;15:1590769. doi: 10.3389/fonc.2025.1590769. eCollection 2025.
Accurately evaluating human epidermal growth factor receptor (HER2) expression status in breast cancer enables clinicians to develop individualized treatment plans and improve patient prognosis. The purpose of this study was to assess the performance of a machine learning (ML) model that was developed using F-FDG PET/CT parameters and clinicopathological features in distinguishing different levels of HER2 expression in breast cancer.
This retrospective study enrolled breast cancer patients who underwent F-FDG PET/CT scans prior to treatment at Lianyungang First People's Hospital (centre 1, n=157) and the Third Affiliated Hospital of Soochow University (centre 2, n=84). Two classification tasks were analysed: distinguishing HER2-zero expression from HER2-low/positive expression (Task 1) and distinguishing HER2-low expression from HER2-positive expression (Task 2). For each task, patients from Centre 1 were randomly divided into training and internal test sets at a 7:3 ratio, whereas patients from Centre 2 served as an external test set. The prediction models included logistic regression (LR), support vector machine (SVM), extreme gradient boosting (XGBoost) and multilayer perceptron (MLP), and SHAP analysis provided model interpretability. Model performance was evaluated via the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV).
XGBoost models exhibited the best predictive performance in both tasks. For Task 1, recursive feature elimination (RFE) was used to select 8 features, excluding pathological features, and the XGBoost model achieved AUCs of 0.888, 0.844 and 0.759 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the tumour minimum diameter, mean standardized uptake value (SUVmean) and CTmean. For Task 2, 9 features were selected, including progesterone receptor (PR) status as a pathological feature. The XGBoost model achieved AUCs of 0.920, 0.814 and 0.693 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the PR status, maximum tumour diameter and metabolic tumour volume (MTV).
ML models that incorporate F-FDG PET/CT parameters and clinicopathological features can aid in the prediction of different HER2 expression statuses in breast cancer.
准确评估乳腺癌中人表皮生长因子受体(HER2)的表达状态,有助于临床医生制定个体化治疗方案并改善患者预后。本研究旨在评估一种利用F-FDG PET/CT参数和临床病理特征开发的机器学习(ML)模型在区分乳腺癌不同HER2表达水平方面的性能。
这项回顾性研究纳入了在连云港市第一人民医院(中心1,n = 157)和苏州大学附属第三医院(中心2,n = 84)接受治疗前进行F-FDG PET/CT扫描的乳腺癌患者。分析了两项分类任务:区分HER2零表达与HER2低/阳性表达(任务1)以及区分HER2低表达与HER2阳性表达(任务2)。对于每项任务,中心1的患者以7:3的比例随机分为训练集和内部测试集,而中心2的患者作为外部测试集。预测模型包括逻辑回归(LR)、支持向量机(SVM)、极端梯度提升(XGBoost)和多层感知器(MLP),SHAP分析提供了模型的可解释性。通过受试者操作特征曲线下面积(AUC)、准确率、灵敏度、特异度、阳性预测值(PPV)和阴性预测值(NPV)评估模型性能。
XGBoost模型在两项任务中均表现出最佳预测性能。对于任务1,使用递归特征消除(RFE)选择了8个特征(不包括病理特征),XGBoost模型在训练集、内部测试集和外部测试集上的AUC分别为0.888、0.844和0.759。根据SHAP值,排名前三的特征是肿瘤最小直径、平均标准化摄取值(SUVmean)和CTmean。对于任务2,选择了9个特征,包括作为病理特征的孕激素受体(PR)状态。XGBoost模型在训练集、内部测试集和外部测试集上的AUC分别为0.920、0.814和0.693。根据SHAP值,排名前三的特征是PR状态、肿瘤最大直径和代谢肿瘤体积(MTV)。
结合F-FDG PET/CT参数和临床病理特征的ML模型有助于预测乳腺癌中不同的HER2表达状态。