Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini, 4, 20090, Pieve Emanuele, MI, Italy; Otorhinolaryngology Unit, IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, MI, Italy.
Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini, 4, 20090, Pieve Emanuele, MI, Italy; Otorhinolaryngology Unit, IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, MI, Italy.
Eur J Surg Oncol. 2023 Sep;49(9):106965. doi: 10.1016/j.ejso.2023.06.017. Epub 2023 Jun 27.
Indications for elective treatment of the neck in patients with major salivary gland cancers are still debated. Our purpose was to develop a machine learning (ML) model able to generate a predictive algorithm to identify lymph node metastases (LNM) in patients with major salivary gland cancer (SGC).
A Retrospective study was performed with data obtained from the Surveillance, Epidemiology, and End Results (SEER) program. Patients diagnosed with a major SGC between 1988 and 2019 were included. Two 2-class supervised ML decision models (random forest, RF; extreme gradient boosting, XGB) were used to predict the presence of LNM, implementing thirteen demographics and clinical variables collected from the SEER database. A permutation feature importance (PFI) score was computed using the testing dataset to identify the most important variables used in model prediction.
A total of 10 350 patients (males: 52%; mean age: 59.9 ± 17.2 years) were included in the study. The RF and the XGB prediction models showed an overall accuracy of 0.68. Both models showed a high specificity (RF: 0.90; XGB: 0.83) and low sensitivity (RF: 0.27; XGB: 0.38) in identifying LNM. According, a high negative predictive value (RF: 0.70; XGB: 0.72) and a low positive predictive value (RF: 0.58; XGB: 0.56) were measured. T classification and tumor size were the most important features in the construction of the prediction algorithms.
Classification performance of the ML algorithms showed high specificity and negative predictive value that allow to preoperatively identify patients with a lower risk of LNM.
Based on data from the Surveillance, Epidemiology, and End Results (SEER) program, our study showed that machine learning algorithms owns a high specificity and negative predictive value, allowing to preoperatively identify patients with a lower risk of lymph node metastasis.
在患有大涎腺癌的患者中,选择性治疗颈部的适应证仍存在争议。我们的目的是开发一种机器学习(ML)模型,以生成一种预测算法,以识别大涎腺癌(SGC)患者的淋巴结转移(LNM)。
对 1988 年至 2019 年期间从监测,流行病学和结果(SEER)计划中获得的数据进行了回顾性研究。纳入了诊断为大涎腺癌的患者。使用两种 2 类监督 ML 决策模型(随机森林,RF;极端梯度增强,XGB)来预测 LNM 的存在,实现了从 SEER 数据库中收集的 13 个人口统计学和临床变量。使用测试数据集计算置换特征重要性(PFI)评分,以识别用于模型预测的最重要变量。
共有 10350 名患者(男性:52%;平均年龄:59.9±17.2 岁)纳入研究。RF 和 XGB 预测模型的总体准确性为 0.68。两种模型在识别 LNM 方面均具有较高的特异性(RF:0.90;XGB:0.83)和较低的敏感性(RF:0.27;XGB:0.38)。因此,高阴性预测值(RF:0.70;XGB:0.72)和低阳性预测值(RF:0.58;XGB:0.56)。T 分类和肿瘤大小是构建预测算法的最重要特征。
ML 算法的分类性能显示出较高的特异性和阴性预测值,可术前识别 LNM 风险较低的患者。
基于监测,流行病学和结果(SEER)计划的数据,我们的研究表明,机器学习算法具有较高的特异性和阴性预测值,可术前识别 LNM 风险较低的患者。