Department of Interventional Vascular Surgery, Affiliated Hospital (Clinical College) of Xiangnan University, Chenzhou, P. R. China; Key Laboratory of Medical Imaging and Artifical Intelligence of Hunan Province, Chenzhou, P. R. China.
Department of Oncology, Affiliated Hospital (Clinical College) of Xiangnan University, Chenzhou, P. R. China.
Turk J Gastroenterol. 2023 Jul;34(7):760-770. doi: 10.5152/tjg.2023.22357.
BACKGROUND/AIMS: Hepatocellular carcinoma, a highly malignant tumor, is difficult to diagnose, treat, and predict the prognosis. Notch signaling pathway can affect hepatocellular carcinoma. We aimed to predict the occurrence of hepatocellular carcinoma based on Notch signal-related genes using machine learning algorithms.
We downloaded hepatocellular carcinoma data from the Cancer Genome Atlas and Gene Expression Omnibus databases and used machine learning methods to screen the hub Notch signal-related genes. Machine learning classification was used to construct a prediction model for the classification and diagnosis of hepatocellular carcinoma cancer. Bioinformatics methods were applied to explore the expression of these hub genes in the hepatocellular carcinoma tumor immune microenvironment.
We identified 4 hub genes, namely, LAMA4, POLA2, RAD51, and TYMS, which were used as the final variables, and found that AdaBoostClassifie was the best algorithm for the classification and diagnosis model of hepatocellular carcinoma. The area under curve, accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score of this model in the training set were 0.976, 0.881, 0.877, 0.977, 0.996, 0.500, and 0.932; respectively. The area under curves were 0.934, 0.863, 0.881, 0.886, 0.981, 0.489, and 0.926. The area under curve in the external validation set was 0.934. Immune cell infiltration was related to the expression of 4 hub genes. Patients in the low-risk group of hepatocellular carcinoma were more likely to have an immune escape.
The Notch signaling pathway was closely related to the occurrence and development of hepatocellular carcinoma. The hepatocellular carcinoma classification and diagnosis model established based on this had a high degree of reliability and stability.
背景/目的:肝细胞癌是一种高度恶性的肿瘤,其诊断、治疗和预后预测都很困难。Notch 信号通路可以影响肝细胞癌。我们旨在使用机器学习算法基于 Notch 信号相关基因预测肝细胞癌的发生。
我们从癌症基因组图谱和基因表达综合数据库中下载了肝细胞癌数据,并使用机器学习方法筛选出与 Notch 信号相关的关键基因。使用机器学习分类构建肝细胞癌分类和诊断预测模型。应用生物信息学方法探讨这些关键基因在肝细胞癌肿瘤免疫微环境中的表达。
我们确定了 4 个关键基因,即 LAMA4、POLA2、RAD51 和 TYMS,将其作为最终变量,并发现 AdaBoostClassifier 是肝细胞癌分类和诊断模型的最佳算法。该模型在训练集中的曲线下面积、准确性、敏感度、特异性、阳性预测值、阴性预测值和 F1 评分分别为 0.976、0.881、0.877、0.977、0.996、0.500 和 0.932;外部验证集中的曲线下面积分别为 0.934、0.863、0.881、0.886、0.981、0.489 和 0.926。外部验证集的曲线下面积为 0.934。免疫细胞浸润与 4 个关键基因的表达有关。肝细胞癌低危组患者更容易发生免疫逃逸。
Notch 信号通路与肝细胞癌的发生发展密切相关。基于此建立的肝细胞癌分类和诊断模型具有较高的可靠性和稳定性。