Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Korea.
Department of Pathology, Haeundae Paik Hospital, Inje University College of Medicine, Busan, Korea.
Cancer Res Treat. 2023 Oct;55(4):1240-1249. doi: 10.4143/crt.2022.1330. Epub 2023 Mar 21.
To identify important features of lymph node metastasis (LNM) and develop a prediction model for early gastric cancer (EGC) using a gradient boosting machine (GBM) method.
The clinicopathologic data of 2556 patients with EGC who underwent gastrectomy were used as training set and the internal validation set (set 1) at a ratio of 8:2. Additionally, 548 patients with EGC who underwent endoscopic submucosal dissection (ESD) as the initial treatment were included in the external validation set (set 2). The GBM model was constructed, and its performance was compared with that of the Japanese guidelines.
LNM was identified in 12.6% (321/2556) of the gastrectomy group (training set & set 1) and 4.3% (24/548) of the ESD group (set 2). In the GBM analysis, the top five features that most affected LNM were lymphovascular invasion, depth, differentiation, size, and location. The accuracy, sensitivity, specificity, and the area under the receiver operating characteristics of set 1 were 0.566, 0.922, 0.516, and 0.867, while those of set 2 were 0.810, 0.958, 0.803, and 0.944, respectively. When the sensitivity of GBM was adjusted to that of Japanese guidelines (beyond the expanded criteria in set 1 [0.922] and eCuraC-2 in set 2 [0.958]), the specificities of GBM in sets 1 and 2 were 0.516 (95% confidence interval, 0.502-0.523) and 0.803 (0.795-0.805), while those of the Japanese guidelines were 0.502 (0.488-0.509) and 0.788 (0.780-0.790), respectively.
The GBM model showed good performance comparable with the eCura system in predicting LNM risk in EGCs.
利用梯度提升机(GBM)方法,确定淋巴结转移(LNM)的重要特征,并为早期胃癌(EGC)建立预测模型。
采用 2556 例行胃切除术的 EGC 患者的临床病理资料作为训练集和内部验证集(集 1),比例为 8:2。此外,纳入 548 例行内镜黏膜下剥离术(ESD)作为初始治疗的 EGC 患者作为外部验证集(集 2)。构建 GBM 模型,并与日本指南进行比较。
在胃切除术组(训练集和集 1)中,LNM 的检出率为 12.6%(321/2556),在 ESD 组(集 2)中为 4.3%(24/548)。在 GBM 分析中,对 LNM 影响最大的前五个特征是淋巴管浸润、深度、分化、大小和位置。集 1 的准确性、敏感度、特异度和受试者工作特征曲线下面积分别为 0.566、0.922、0.516 和 0.867,集 2 分别为 0.810、0.958、0.803 和 0.944。当 GBM 的敏感度调整至日本指南的标准(超过集 1 中的扩展标准[0.922]和集 2 中的 eCuraC-2[0.958])时,集 1 和集 2 中 GBM 的特异度分别为 0.516(95%置信区间,0.502-0.523)和 0.803(0.795-0.805),而日本指南的特异度分别为 0.502(0.488-0.509)和 0.788(0.780-0.790)。
GBM 模型在预测 EGC 中 LNM 风险方面表现良好,与 eCura 系统相当。