Suppr超能文献

开发和验证一种机器学习模型,以预测肾癌淋巴结转移的风险。

Development and validation of a machine learning model to predict the risk of lymph node metastasis in renal carcinoma.

机构信息

Department of Neuro Rehabilitation, Shaanxi Provincial Rehabilitation Hospital, Xi 'an, China.

Department of Cardiac Surgery, Fuwai Hospital Chinese Academy of Medical Sciences, Shenzhen, Shenzhen, China.

出版信息

Front Endocrinol (Lausanne). 2022 Nov 18;13:1054358. doi: 10.3389/fendo.2022.1054358. eCollection 2022.

Abstract

SIMPLE SUMMARY

Studies have shown that about 30% of kidney cancer patients will have metastasis, and lymph node metastasis (LNM) may be related to a poor prognosis. Our retrospective study aims to provide a reliable machine learning-based model to predict the occurrence of LNM in kidney cancer. We screened the pathological grade, liver metastasis, M staging, primary site, T staging, and tumor size from the training group (n=39016) formed by the SEER database and the validation group (n=771) formed by the medical center. Independent predictors of LNM in cancer patients. Using six different algorithms to build a prediction model, it is found that the prediction performance of the XGB model in the training group and the validation group is significantly better than any other machine learning model. The results show that prediction tools based on machine learning can accurately predict the probability of LNM in patients with kidney cancer and have satisfactory clinical application prospects.

BACKGROUND

Lymph node metastasis (LNM) is associated with the prognosis of patients with kidney cancer. This study aimed to provide reliable machine learning-based (ML-based) models to predict the probability of LNM in kidney cancer.

METHODS

Data on patients diagnosed with kidney cancer were extracted from the Surveillance, Epidemiology and Outcomes (SEER) database from 2010 to 2017, and variables were filtered by least absolute shrinkage and selection operator (LASSO), univariate and multivariate logistic regression analyses. Statistically significant risk factors were used to build predictive models. We used 10-fold cross-validation in the validation of the model. The area under the receiver operating characteristic curve (AUC) was used to assess the performance of the model. Correlation heat maps were used to investigate the correlation of features using permutation analysis to assess the importance of predictors. Probability density functions (PDFs) and clinical utility curves (CUCs) were used to determine clinical utility thresholds.

RESULTS

The training cohort of this study included 39,016 patients, and the validation cohort included 771 patients. In the two cohorts, 2544 (6.5%) and 66 (8.1%) patients had LNM, respectively. Pathological grade, liver metastasis, M stage, primary site, T stage, and tumor size were independent predictive factors of LNM. In both model validation, the XGB model significantly outperformed any of the machine learning models with an AUC value of 0.916.A web calculator (https://share.streamlit.io/liuwencai4/renal_lnm/main/renal_lnm.py) were built based on the XGB model. Based on the PDF and CUC, we suggested 54.6% as a threshold probability for guiding the diagnosis of LNM, which could distinguish about 89% of LNM patients.

CONCLUSIONS

The predictive tool based on machine learning can precisely indicate the probability of LNM in kidney cancer patients and has a satisfying application prospect in clinical practice.

摘要

简单总结

研究表明,约 30%的肾癌患者会发生转移,而淋巴结转移(LNM)可能与预后不良有关。我们的回顾性研究旨在提供一种可靠的基于机器学习的模型,以预测肾癌患者发生 LNM 的可能性。我们从 SEER 数据库(n=39016)形成的训练组和医疗中心(n=771)形成的验证组中筛选出病理分级、肝转移、M 分期、原发部位、T 分期和肿瘤大小等作为癌症患者 LNM 的独立预测因子。使用六种不同的算法构建预测模型,结果发现 XGB 模型在训练组和验证组中的预测性能明显优于任何其他机器学习模型。结果表明,基于机器学习的预测工具可以准确预测肾癌患者发生 LNM 的概率,具有令人满意的临床应用前景。

背景

淋巴结转移(LNM)与肾癌患者的预后有关。本研究旨在提供基于机器学习的(ML 为基础)模型,以预测肾癌患者发生 LNM 的概率。

方法

从 2010 年至 2017 年,从监测、流行病学和结果(SEER)数据库中提取诊断为肾癌的患者数据,并通过最小绝对值收缩和选择算子(LASSO)、单变量和多变量逻辑回归分析对变量进行过滤。使用统计学上显著的风险因素构建预测模型。我们在模型验证中使用了 10 折交叉验证。接收器工作特征曲线(AUC)下的面积用于评估模型的性能。使用置换分析绘制特征相关热图,以评估预测因子的重要性。概率密度函数(PDF)和临床实用曲线(CUC)用于确定临床实用阈值。

结果

本研究的训练队列包括 39016 名患者,验证队列包括 771 名患者。在这两个队列中,分别有 2544 名(6.5%)和 66 名(8.1%)患者发生 LNM。病理分级、肝转移、M 期、原发部位、T 期和肿瘤大小是 LNM 的独立预测因素。在两个模型验证中,XGB 模型的 AUC 值均显著优于任何一种机器学习模型,分别为 0.916。基于 XGB 模型,我们构建了一个网络计算器(https://share.streamlit.io/liuwencai4/renal_lnm/main/renal_lnm.py)。基于 PDF 和 CUC,我们建议 54.6%作为指导 LNM 诊断的概率阈值,该阈值可区分约 89%的 LNM 患者。

结论

基于机器学习的预测工具可以精确指示肾癌患者发生 LNM 的概率,在临床实践中具有令人满意的应用前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f035/9716136/8df557ba0bf6/fendo-13-1054358-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验