Department of Infectious Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052 Henan, China.
Department of Gastroenterology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052 Henan, China.
Dis Markers. 2022 May 18;2022:1586074. doi: 10.1155/2022/1586074. eCollection 2022.
A more accurate prediction of liver metastasis (LM) in pancreatic cancer (PC) would help improve clinical therapeutic effects and follow-up strategies for the management of this disease. This study was to assess various prediction models to evaluate the risk of LM based on machine learning algorithms.
We retrospectively reviewed clinicopathological characteristics of PC patients from the Surveillance, Epidemiology, and End Results database from 2010 to 2018. The logistic regression, extreme gradient boosting, support vector, random forest (RF), and deep neural network machine algorithms were used to establish models to predict the risk of LM in PC patients. Specificity, sensitivity, and receiver operating characteristic (ROC) curves were used to determine the discriminatory capacity of the prediction models.
A total of 47,919 PC patients were identified; 15,909 (33.2%) of which developed LM. After iterative filtering, a total of nine features were included to establish the risk model for LM based on machine learning. The RF showed the most promising results in the prediction of complications among the models (ROC 0.871 for training and 0.832 for test sets). In risk stratification analysis, the LM rate and 5-year cancer-specific survival (CSS) in the high-risk group were worse than those in the intermediate- and low-risk groups. Surgery, radiotherapy, and chemotherapy were found to significantly improve the CSS in the high- and intermediate-risk groups.
In this study, the RF model constructed could accurately predict the risk of LM in PC patients, which has the potential to provide clinicians with more personalized clinical decision-making recommendations.
更准确地预测胰腺癌(PC)的肝转移(LM)将有助于改善该疾病的临床治疗效果和随访策略。本研究旨在评估基于机器学习算法的各种预测模型,以评估 LM 的风险。
我们回顾性分析了 2010 年至 2018 年期间来自监测、流行病学和最终结果数据库的 PC 患者的临床病理特征。逻辑回归、极端梯度提升、支持向量、随机森林(RF)和深度神经网络机器算法被用于建立预测 PC 患者 LM 风险的模型。特异性、敏感性和接收器工作特征(ROC)曲线用于确定预测模型的区分能力。
共确定了 47919 例 PC 患者,其中 15909 例(33.2%)发生了 LM。经过迭代筛选,共纳入了 9 个特征,基于机器学习建立了 LM 风险模型。RF 在模型预测并发症方面表现出最有前途的结果(训练集 ROC 为 0.871,测试集为 0.832)。在风险分层分析中,高危组的 LM 发生率和 5 年癌症特异性生存率(CSS)均差于中危组和低危组。手术、放疗和化疗被发现显著改善了高危组和中危组的 CSS。
在这项研究中,构建的 RF 模型可以准确预测 PC 患者 LM 的风险,这有可能为临床医生提供更个性化的临床决策建议。