Sung You-Na, Lee Hyeseong, Kim Eunsu, Jung Woon Yong, Sohn Jin-Hee, Lee Yoo Jin, Keum Bora, Ahn Sangjeong, Lee Sung Hak
Department of Pathology, Korea University Anam Hospital, College of Medicine, Korea University Seoul, South Korea.
Department of Hospital Pathology, Seoul St. Mary's Hospital, College of Medicine, The Catholic University Seoul, South Korea.
Am J Cancer Res. 2024 Jul 15;14(7):3513-3522. doi: 10.62347/RJBH6076. eCollection 2024.
In early gastric cancer (EGC), the presence of lymph node metastasis (LNM) is a crucial factor for determining the treatment options. Endoscopic resection is used for treatment of EGC with minimal risk of LNM. However, owing to the lack of definitive criteria for identifying patients who require additional surgery, some patients undergo unnecessary additional surgery. Considering that histopathologic patterns are significant factor for predicting lymph node metastasis in gastric cancer, we aimed to develop a machine learning algorithm which can predict LNM status using hematoxylin and eosin (H&E)-stained images. The images were obtained from several institutions. Our pipeline comprised two sequential approaches including a feature extractor and a risk classifier. For the feature extractor, a segmentation network (DeepLabV3+) was trained on 243 WSIs across three datasets to differentiate each histological subtype. The risk classifier was trained with XGBoost using 70 morphological features inferred from the trained feature extractor. The trained segmentation network, the feature extractor, achieved high performance, with pixel accuracies of 0.9348 and 0.8939 for the internal and external datasets in patch level, respectively. The risk classifier achieved an overall AUC of 0.75 in predicting LNM status. Remarkably, one of the datasets also showed a promising result with an AUC of 0.92. This is the first multi-institution study to develop machine learning algorithm for predicting LNM status in patients with EGC using H&E-stained histopathology images. Our findings have the potential to improve the selection of patients who require surgery among those with EGC showing high-risk histological features.
在早期胃癌(EGC)中,淋巴结转移(LNM)的存在是决定治疗方案的关键因素。内镜切除术用于治疗LNM风险最小的EGC。然而,由于缺乏确定哪些患者需要额外手术的明确标准,一些患者接受了不必要的额外手术。考虑到组织病理学模式是预测胃癌淋巴结转移的重要因素,我们旨在开发一种机器学习算法,该算法可以使用苏木精和伊红(H&E)染色图像预测LNM状态。这些图像来自多个机构。我们的流程包括两个连续的方法,即特征提取器和风险分类器。对于特征提取器,在三个数据集中的243张全切片图像(WSIs)上训练一个分割网络(DeepLabV3+),以区分每种组织学亚型。风险分类器使用从训练好的特征提取器推断出的70个形态学特征,通过XGBoost进行训练。训练好的分割网络,即特征提取器,表现出高性能,在补丁级别上,内部和外部数据集的像素准确率分别为0.9348和0.8939。风险分类器在预测LNM状态时的总体曲线下面积(AUC)为0.75。值得注意的是,其中一个数据集也显示出了令人鼓舞的结果,AUC为0.92。这是第一项利用H&E染色的组织病理学图像开发用于预测EGC患者LNM状态的机器学习算法的多机构研究。我们的研究结果有可能改善在具有高风险组织学特征的EGC患者中选择需要手术的患者的情况。