Zhang Chuhan, Xu Shun, Jiang Youhong, Jiang Changrui, Li Shangxin, Wang Zhitong, Dong Yan, Jin Feng, Zhao Dan, Zhao Yating
Department of Thoracic Surgery, The First Hospital of China Medical University, Shenyang 10001, China.
Cancer Institute, The First Hospital of China Medical University, Shenyang 10001, China.
J Oncol. 2022 Sep 25;2022:4008113. doi: 10.1155/2022/4008113. eCollection 2022.
Lymph node metastasis (LNM) is the main route of metastasis in lung adenocarcinoma (LA), and preoperative prediction of LNM in early LA is key for accurate medical treatment. We aimed to establish a preoperative prediction model of LNM of early LA through clinical data mining to reduce unnecessary lymph node dissection, reduce surgical injury, and shorten the operation time.
We retrospectively collected imaging data and clinical features of 1121 patients with early LA who underwent video-assisted thoracic surgery at the First Hospital of China Medical University from 2004 to 2021. Logistic regression analysis was used to select variables and establish the preoperative diagnosis model using random forest classifier (RFC). The prediction results from the test set were used to evaluate the prediction performance of the model.
Combining the results of logistic analysis and practical clinical application experience, nine clinical features were included. In the random forest classifier model, when the number of nodes was three and the -tree value is 500, we obtained the best prediction model (accuracy = 0.9769), with a positive prediction rate of 90% and a negative prediction rate of 98.69%.
We established a preoperative prediction model for LNM of early LA using a machine learning random forest method combined with clinical and imaging features. More excellent predictors may be obtained by refining imaging features.
淋巴结转移(LNM)是肺腺癌(LA)的主要转移途径,早期LA术前预测LNM是精准治疗的关键。我们旨在通过临床数据挖掘建立早期LA的LNM术前预测模型,以减少不必要的淋巴结清扫,降低手术创伤,缩短手术时间。
回顾性收集2004年至2021年在中国医科大学附属第一医院接受电视辅助胸腔手术的1121例早期LA患者的影像数据和临床特征。采用逻辑回归分析选择变量,并使用随机森林分类器(RFC)建立术前诊断模型。用测试集的预测结果评估模型的预测性能。
结合逻辑分析结果和实际临床应用经验,纳入9项临床特征。在随机森林分类器模型中,当节点数为3且树值为500时,得到最佳预测模型(准确率=0.9769),阳性预测率为90%,阴性预测率为98.69%。
我们采用机器学习随机森林方法结合临床和影像特征,建立了早期LA的LNM术前预测模型。通过细化影像特征可能获得更优的预测指标。