Li Han, Yang Bo, Wang Chenjie, Li Bo, Han Lei, Jiang Yi, Song Yanqiong, Wen Lianbin, Rao Mingyue, Zhang Jianwen, Li Xueting, He Kun, Han Yunwei
Department of Oncology, The Affiliated Hospital of Southwest Medical University, Luzhou, China.
Department of General Surgery (Hepatobiliary Surgery), The Affiliated Hospital of Southwest Medical University, Luzhou, China.
Front Pharmacol. 2024 Sep 19;15:1452201. doi: 10.3389/fphar.2024.1452201. eCollection 2024.
In this retrospective study, we aimed to identify key risk factors and establish an interpretable model for HCC with a diameter ≥ 5 cm using Lasso regression for effective risk stratification and clinical decision-making.
In this study, 843 patients with advanced hepatocellular carcinoma (HCC) and tumor diameter ≥ 5 cm were included. Using Lasso regression to screen multiple characteristic variables, cox proportional hazard regression and random survival forest models (RSF) were established. By comparing the area under the curve (AUC), the optimal model was selected. The model was visualized, and the order of interpretable importance was determined. Finally, risk stratification was established to identify patients at high risk.
Lasso regression identified 8 factors as characteristic risk factors. Subsequent analysis revealed that the lasso-cox model had AUC values of 0.773, 0.758, and 0.799, while the lasso-RSF model had AUC values of 0.734, 0.695, and 0.741, respectively. Based on these results, the lasso-cox model was chosen as the superior model. Interpretability assessments using SHAP values indicated that the most significant characteristic risk factors, in descending order of importance, were tumor number, BCLC stage, alkaline phosphatase (ALP), ascites, albumin (ALB), and aspartate aminotransferase (AST). Additionally, through risk score stratification and subgroup analysis, it was observed that the median OS of the low-risk group was significantly better than that of the middle- and high-risk groups.
We have developed an interpretable predictive model for middle and late HCC with tumor diameter ≥ 5 cm using lasso-cox regression analysis. This model demonstrates excellent prediction performance and can be utilized for risk stratification.
在这项回顾性研究中,我们旨在识别关键风险因素,并使用套索回归建立一个可解释的模型,用于直径≥5厘米的肝癌患者,以进行有效的风险分层和临床决策。
本研究纳入了843例晚期肝细胞癌(HCC)且肿瘤直径≥5厘米的患者。使用套索回归筛选多个特征变量,建立了Cox比例风险回归模型和随机生存森林模型(RSF)。通过比较曲线下面积(AUC),选择最佳模型。对模型进行可视化,并确定可解释重要性的顺序。最后,建立风险分层以识别高危患者。
套索回归确定了8个因素为特征性风险因素。随后的分析显示,套索 - Cox模型的AUC值分别为0.773、0.758和0.799,而套索 - RSF模型的AUC值分别为0.734、0.695和0.741。基于这些结果,选择套索 - Cox模型作为 superior 模型。使用SHAP值进行的可解释性评估表明,按重要性降序排列,最显著的特征性风险因素为肿瘤数量、BCLC分期、碱性磷酸酶(ALP)、腹水、白蛋白(ALB)和天冬氨酸氨基转移酶(AST)。此外,通过风险评分分层和亚组分析,观察到低风险组的中位总生存期明显优于中风险组和高风险组。
我们使用套索 - Cox回归分析为肿瘤直径≥5厘米的中晚期肝癌患者开发了一个可解释的预测模型。该模型具有出色的预测性能,可用于风险分层。