Bhattarai Shristi, Saini Geetanjali, Li Hongxiao, Seth Gaurav, Fisher Timothy B, Janssen Emiel A M, Kiraz Umay, Kong Jun, Aneja Ritu
Department of Clinical and Diagnostic Sciences, School of Health Professions, University of Alabama at Birmingham, Birmingham, AL 35294, USA.
Department of Mathematics and Statistics, Georgia State University, Atlanta, GA 30302, USA.
Diagnostics (Basel). 2023 Dec 28;14(1):74. doi: 10.3390/diagnostics14010074.
Neoadjuvant chemotherapy (NAC) is the standard treatment for early-stage triple negative breast cancer (TNBC). The primary endpoint of NAC is a pathological complete response (pCR). NAC results in pCR in only 30-40% of TNBC patients. Tumor-infiltrating lymphocytes (TILs), Ki67 and phosphohistone H3 (pH3) are a few known biomarkers to predict NAC response. Currently, systematic evaluation of the combined value of these biomarkers in predicting NAC response is lacking. In this study, the predictive value of markers derived from H&E and IHC stained biopsy tissue was comprehensively evaluated using a supervised machine learning (ML)-based approach. Identifying predictive biomarkers could help guide therapeutic decisions by enabling precise stratification of TNBC patients into responders and partial or non-responders.
Serial sections from core needle biopsies ( = 76) were stained with H&E and immunohistochemically for the Ki67 and pH3 markers, followed by whole-slide image (WSI) generation. The serial section stains in H&E stain, Ki67 and pH3 markers formed WSI triplets for each patient. The resulting WSI triplets were co-registered with H&E WSIs serving as the reference. Separate mask region-based CNN (MRCNN) models were trained with annotated H&E, Ki67 and pH3 images for detecting tumor cells, stromal and intratumoral TILs (sTILs and tTILs), Ki67, and pH3 cells. Top image patches with a high density of cells of interest were identified as hotspots. Best classifiers for NAC response prediction were identified by training multiple ML models and evaluating their performance by accuracy, area under curve, and confusion matrix analyses.
Highest prediction accuracy was achieved when hotspot regions were identified by tTIL counts and each hotspot was represented by measures of tTILs, sTILs, tumor cells, Ki67, and pH3 features. Regardless of the hotspot selection metric, a complementary use of multiple histological features (tTILs, sTILs) and molecular biomarkers (Ki67 and pH3) resulted in top ranked performance at the patient level.
Overall, our results emphasize that prediction models for NAC response should be based on biomarkers in combination rather than in isolation. Our study provides compelling evidence to support the use of ML-based models to predict NAC response in patients with TNBC.
新辅助化疗(NAC)是早期三阴性乳腺癌(TNBC)的标准治疗方法。NAC的主要终点是病理完全缓解(pCR)。NAC仅使30%-40%的TNBC患者达到pCR。肿瘤浸润淋巴细胞(TILs)、Ki67和磷酸化组蛋白H3(pH3)是一些已知的预测NAC反应的生物标志物。目前,缺乏对这些生物标志物联合预测NAC反应价值的系统评估。在本研究中,使用基于监督机器学习(ML)的方法全面评估了来自苏木精和伊红(H&E)染色及免疫组化(IHC)染色活检组织的标志物的预测价值。识别预测性生物标志物有助于通过将TNBC患者精确分层为反应者和部分或无反应者来指导治疗决策。
对76例粗针活检的连续切片进行H&E染色以及Ki67和pH3标志物的免疫组化染色,随后生成全切片图像(WSI)。H&E染色、Ki67和pH3标志物的连续切片染色为每位患者形成WSI三联体。将所得的WSI三联体与作为参考的H&E WSI进行配准。使用注释的H&E、Ki67和pH3图像训练基于单独掩码区域的卷积神经网络(MRCNN)模型,以检测肿瘤细胞、基质和瘤内TILs(基质TILs和肿瘤内TILs)、Ki67和pH3细胞。将具有高密度感兴趣细胞的顶级图像块识别为热点。通过训练多个ML模型并通过准确性、曲线下面积和混淆矩阵分析评估其性能,确定用于预测NAC反应的最佳分类器。
当通过肿瘤内TILs计数识别热点区域且每个热点由肿瘤内TILs、基质TILs、肿瘤细胞、Ki67和pH3特征的测量值表示时,实现了最高的预测准确性。无论热点选择指标如何,在患者水平上,多种组织学特征(肿瘤内TILs、基质TILs)和分子生物标志物(Ki67和pH3)的互补使用都产生了排名靠前的性能。
总体而言,我们的结果强调,NAC反应的预测模型应基于生物标志物的组合而非单一使用。我们的研究提供了有力证据支持使用基于ML的模型来预测TNBC患者的NAC反应。