Zhu Jiang, Zheng Jinxin, Li Longfei, Huang Rui, Ren Haoyu, Wang Denghui, Dai Zhijun, Su Xinliang
Department of Endocrine and Breast Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China.
Department of Breast Surgery, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China.
Front Med (Lausanne). 2021 Mar 9;8:635771. doi: 10.3389/fmed.2021.635771. eCollection 2021.
While there are no clear indications of whether central lymph node dissection is necessary in patients with T1-T2, non-invasive, clinically uninvolved central neck lymph nodes papillary thyroid carcinoma (PTC), this study seeks to develop and validate models for predicting the risk of central lymph node metastasis (CLNM) in these patients based on machine learning algorithms. This is a retrospective study comprising 1,271 patients with T1-T2 stage, non-invasive, and clinically node negative (cN0) PTC who underwent surgery at the Department of Endocrine and Breast Surgery of The First Affiliated Hospital of Chongqing Medical University from February 1, 2016, to December 31, 2018. We applied six machine learning (ML) algorithms, including Logistic Regression (LR), Gradient Boosting Machine (GBM), Extreme Gradient Boosting (XGBoost), Random Forest (RF), Decision Tree (DT), and Neural Network (NNET), coupled with preoperative clinical characteristics and intraoperative information to develop prediction models for CLNM. Among all the samples, 70% were randomly selected to train the models while the remaining 30% were used for validation. Indices like the area under the receiver operating characteristic (AUROC), sensitivity, specificity, and accuracy were calculated to test the models' performance. The results showed that ~51.3% (652 out of 1,271) of the patients had pN1 disease. In multivariate logistic regression analyses, gender, tumor size and location, multifocality, age, and Delphian lymph node status were all independent predictors of CLNM. In predicting CLNM, six ML algorithms posted AUROC of 0.70-0.75, with the extreme gradient boosting (XGBoost) model standing out, registering 0.75. Thus, we employed the best-performing ML algorithm model and uploaded the results to a self-made online risk calculator to estimate an individual's probability of CLNM (https://jin63.shinyapps.io/ML_CLNM/). With the incorporation of preoperative and intraoperative risk factors, ML algorithms can achieve acceptable prediction of CLNM with Xgboost model performing the best. Our online risk calculator based on ML algorithm may help determine the optimal extent of initial surgical treatment for patients with T1-T2 stage, non-invasive, and clinically node negative PTC.
虽然对于T1-T2期、非侵袭性、临床未累及中央颈部淋巴结的甲状腺乳头状癌(PTC)患者是否有必要进行中央淋巴结清扫尚无明确指征,但本研究旨在基于机器学习算法开发并验证用于预测这些患者中央淋巴结转移(CLNM)风险的模型。这是一项回顾性研究,纳入了2016年2月1日至2018年12月31日在重庆医科大学附属第一医院内分泌与乳腺外科接受手术的1271例T1-T2期、非侵袭性且临床淋巴结阴性(cN0)的PTC患者。我们应用了六种机器学习(ML)算法,包括逻辑回归(LR)、梯度提升机(GBM)、极端梯度提升(XGBoost)、随机森林(RF)、决策树(DT)和神经网络(NNET),结合术前临床特征和术中信息来开发CLNM预测模型。在所有样本中,随机选择70%用于训练模型,其余30%用于验证。计算受试者工作特征曲线下面积(AUROC)、敏感性、特异性和准确性等指标来测试模型性能。结果显示,约51.3%(1271例中的652例)患者存在pN1疾病。在多因素逻辑回归分析中,性别、肿瘤大小和位置、多灶性、年龄和Delphian淋巴结状态均为CLNM的独立预测因素。在预测CLNM方面,六种ML算法的AUROC为0.70-0.75,其中极端梯度提升(XGBoost)模型表现突出,为0.75。因此,我们采用了性能最佳的ML算法模型,并将结果上传至自制的在线风险计算器,以估计个体发生CLNM的概率(https://jin63.shinyapps.io/ML_CLNM/)。结合术前和术中风险因素,ML算法能够对CLNM实现可接受的预测,其中Xgboost模型表现最佳。我们基于ML算法的在线风险计算器可能有助于确定T1-T2期、非侵袭性且临床淋巴结阴性PTC患者初始手术治疗的最佳范围。