Bezjak Miran, Kocman Branislav, Jadrijević Stipislav, Filipec Kanižaj Tajana, Antonijević Miro, Dalbelo Bašić Bojana, Mikulić Danko
Division of Abdominal Surgery and Organ Transplantation, Department of Surgery, University Hospital Merkur, Zagreb, Croatia.
Division of Gastroenterology, Department of Internal Medicine, University Hospital Merkur, Zagreb, Croatia.
Ann Transl Med. 2023 Aug 30;11(10):345. doi: 10.21037/atm-22-6469. Epub 2023 Jun 29.
Hepatocellular carcinoma (HCC) is one of the leading indications for liver transplantation (LT) however, selection criteria remain controversial. We aimed to identify survival factors and predictors for tumour recurrence using machine learning (ML) methods. We also compared ML models to the Cox regression model.
Thirty pretransplant donor and recipient general and tumour specific parameters were analysed from 170 patients who underwent orthotopic liver transplantation for HCC between March 2013 and December 2019 at the University Hospital Merkur, Zagreb. Survival rates were calculated using the Kaplan-Meier method and multivariate analysis was performed using the Cox proportional hazards regression model. Data was also processed through Coxnet (a regularized Cox regression model), Random Survival Forest (RSF), Survival Support Vector Machine (SVM) and Survival Gradient Boosting models, which included pre-processing, variable selection, imputation of missing data, training and cross-validation of the models. The cross-validated concordance index (CI) was used as an evaluation metric and to determine the best performing model.
Kaplan-Meier curves for 5-year survival time showed survival probability of 80% for recipient survival and 82% for graft survival. The 5-year HCC recurrence was observed in 19% of patients. The best predictive accuracy was observed in the RSF model with CI of 0.72, followed by the Survival SVM model (CI 0.70). Overall ML models outperform the Cox regression model with respect to their limitations. Random Forest analysis provided several relevant outcome predictors: alpha fetoprotein (AFP), donor C-reactive protein (CRP), recipient age and neutrophil to lymphocyte ratio (NLR). Cox multivariate analysis showed similarities with RSF models in identifying detrimental variables. Some variables such as donor age and number of transarterial chemoembolization treatments (TACE) were pointed out, but these were not influential in our RSF model.
Using ML methods in addition to classical statistical analysis, it is possible to develop sufficient prognostic models, which, compared to established risk scores, could help us quantify survival probability and make changes in organ utilization.
肝细胞癌(HCC)是肝移植(LT)的主要适应证之一,然而,选择标准仍存在争议。我们旨在使用机器学习(ML)方法确定生存因素和肿瘤复发的预测因素。我们还将ML模型与Cox回归模型进行了比较。
分析了2013年3月至2019年12月在萨格勒布市梅尔库大学医院接受原位肝移植治疗HCC的170例患者的30个移植前供体和受体的一般及肿瘤特异性参数。使用Kaplan-Meier方法计算生存率,并使用Cox比例风险回归模型进行多变量分析。数据还通过Coxnet(一种正则化Cox回归模型)、随机生存森林(RSF)、生存支持向量机(SVM)和生存梯度提升模型进行处理,包括预处理、变量选择、缺失数据插补、模型训练和交叉验证。交叉验证一致性指数(CI)用作评估指标,以确定表现最佳的模型。
5年生存时间的Kaplan-Meier曲线显示,受体生存率为80%,移植物生存率为82%。19%的患者出现了5年HCC复发。在RSF模型中观察到最佳预测准确性,CI为0.72,其次是生存SVM模型(CI为0.70)。总体而言,ML模型在局限性方面优于Cox回归模型。随机森林分析提供了几个相关的结果预测因素:甲胎蛋白(AFP)、供体C反应蛋白(CRP)、受体年龄和中性粒细胞与淋巴细胞比率(NLR)。Cox多变量分析在识别有害变量方面与RSF模型显示出相似性。指出了一些变量,如供体年龄和经动脉化疗栓塞治疗(TACE)的次数,但这些在我们的RSF模型中没有影响。
除了经典统计分析外,使用ML方法可以开发出足够的预后模型,与既定的风险评分相比,这些模型可以帮助我们量化生存概率并改变器官利用情况。