Comprehensive Cancer Centre of Drum Tower Hospital, Medical School of Nanjing University, Clinical Cancer Institute of Nanjing University, Nanjing, 210008, Jiangsu Province, China.
Shanghai Biotecan Pharmaceuticals Co., Ltd., Pudong New District, Shanghai, China.
Sci Rep. 2020 Mar 10;10(1):4435. doi: 10.1038/s41598-020-61298-3.
Hepatocellular carcinoma (HCC) is a common malignant tumor in China. In the present study, we aimed to construct and verify a prediction model of recurrence in HCC patients using databases (TCGA, AMC and Inserm) and machine learning methods and obtain the gene signature that could predict early relapse of HCC. Statistical methods, such as feature selection, survival analysis and Chi-Square test in R software, were used to analyze and select mutant genes related to disease free survival (DFS), race and vascular invasion. In addition, whole-exome sequencing was performed on 10 HCC patients recruited from our center, and the sequencing results were compared with the databases. Using the databases and machine learning methods, the prediction model of recurrence was constructed and optimized, and the selected mutant genes were verified in the test group. The accuracy of prediction was 74.19%. Moreover, these 10 patients from our center were used to verify these mutant genes and the prediction model, and a success rate of 80% was achieved. Collectively, we discovered recurrence-related genes and established recurrence prediction model of recurrence for HCC patients, which could provide significant guidance for clinical prediction of recurrence.
肝细胞癌 (HCC) 是中国常见的恶性肿瘤。本研究旨在利用数据库 (TCGA、AMC 和 Inserm) 和机器学习方法构建并验证 HCC 患者复发的预测模型,并获得可预测 HCC 早期复发的基因特征。使用 R 软件中的特征选择、生存分析和卡方检验等统计方法,分析和筛选与无病生存 (DFS)、种族和血管侵犯相关的突变基因。此外,对从我们中心招募的 10 名 HCC 患者进行了全外显子测序,并将测序结果与数据库进行了比较。利用数据库和机器学习方法构建并优化了复发预测模型,并在实验组中验证了所选的突变基因。预测的准确性为 74.19%。此外,我们使用中心的这 10 名患者来验证这些突变基因和预测模型,成功率达到 80%。总之,我们发现了与复发相关的基因,并建立了 HCC 患者的复发预测模型,可为临床预测复发提供重要指导。