Du Jun, Shi Jun, Sun Dongdong, Wang Yifei, Liu Guanfeng, Chen Jingru, Wang Wei, Zhou Wenchao, Zheng Yushan, Wu Haibo
Department of Pathology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230036, Anhui, China.
Intelligent Pathology Institute, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230036, Anhui, China.
Breast Cancer Res. 2025 Apr 18;27(1):57. doi: 10.1186/s13058-025-01998-8.
Treatment with HER2-targeted therapies is recommended for HER2-positive breast cancer patients with HER2 gene amplification or protein overexpression. Interestingly, recent clinical trials of novel HER2-targeted therapies demonstrated promising efficacy in HER2-low breast cancers, raising the prospect of including a HER2-low category (immunohistochemistry, IHC) score of 1 + or 2 + with non-amplified in-situ hybridization for HER2-targeted treatments, which necessitated the accurate detection and evaluation of HER2 expression in tumors. Traditionally, HER2 protein levels are routinely assessed by IHC in clinical practice, which not only requires significant time consumption and financial investment but is also technically challenging for many basic hospitals in developing countries. Therefore, directly predicting HER2 expression by hematoxylin-eosin (HE) staining should be of significant clinical values, and machine learning may be a potent technology to achieve this goal.
In this study, we developed an artificial intelligence (AI) classification model using whole slide image of HE-stained slides to automatically assess HER2 status.
A publicly available TCGA-BRCA dataset and an in-house USTC-BC dataset were applied to evaluate our AI model and the state-of-the-art method SlideGraph + in terms of accuracy (ACC), the area under the receiver operating characteristic curve (AUC), and F1 score. Overall, our AI model achieved the superior performance in HER2 scoring in both datasets with AUC of 0.795 ± 0.028 and 0.688 ± 0.008 on the USCT-BC and TCGA-BRCA datasets, respectively. In addition, we visualized the results generated from our AI model by attention heatmaps, which proved that our AI model had strong interpretability.
Our AI model is able to directly predict HER2 expression through HE images with strong interpretability, and has a better ACC particularly in HER2-low breast cancers, which provides a method for AI evaluation of HER2 status and helps to perform HER2 evaluation economically and efficiently. It has the potential to assist pathologists to improve diagnosis and assess biomarkers for companion diagnostics.
对于HER2基因扩增或蛋白过表达的HER2阳性乳腺癌患者,推荐使用HER2靶向治疗。有趣的是,近期新型HER2靶向治疗的临床试验显示,其在HER2低表达乳腺癌中疗效显著,这使得对HER2低表达(免疫组化,IHC)评分为1+或2+且原位杂交未扩增的患者采用HER2靶向治疗成为可能,这就需要准确检测和评估肿瘤中的HER2表达。传统上,临床实践中常规通过免疫组化评估HER2蛋白水平,这不仅需要大量时间和资金投入,而且对发展中国家的许多基层医院来说在技术上也具有挑战性。因此,通过苏木精-伊红(HE)染色直接预测HER2表达应具有重要的临床价值,而机器学习可能是实现这一目标的有效技术。
在本研究中,我们利用HE染色玻片的全切片图像开发了一种人工智能(AI)分类模型,以自动评估HER2状态。
应用公开可用的TCGA-BRCA数据集和内部的USTC-BC数据集,从准确率(ACC)、受试者操作特征曲线下面积(AUC)和F1分数方面评估我们的AI模型和最先进的方法SlideGraph +。总体而言,我们的AI模型在两个数据集中的HER2评分方面均表现出色,在USTC-BC和TCGA-BRCA数据集上的AUC分别为0.795±0.028和0.688±0.008。此外,我们通过注意力热图可视化了AI模型生成的结果,证明我们的AI模型具有很强的可解释性。
我们的AI模型能够通过具有强可解释性的HE图像直接预测HER2表达,并且具有更好的ACC,尤其是在HER2低表达乳腺癌中,这为HER2状态的AI评估提供了一种方法,有助于经济高效地进行HER2评估。它有可能协助病理学家改善诊断并评估伴随诊断的生物标志物。