Li Wenli, Lu Jianjun, Ma Zhanzhong, Zhao Jiafeng, Liu Jun
Department of Clinical Laboratory, Yue Bei People's Hospital, Shantou University Medical College, Shaoguan, China.
Department of Reproductive Medicine Center, The Affiliated Yue Bei People's Hospital of Shantou University Medical College, Shaoguan, China.
Front Genet. 2020 Jan 14;10:1323. doi: 10.3389/fgene.2019.01323. eCollection 2019.
Nowadays, clinical treatment outcomes of patients with hepatocellular carcinoma (HCC) have been improved. However, due to the complexity of the molecular mechanisms, the recurrence rate and mortality in HCC inpatients are still at a high level. Therefore, there is an urgent need in screening biomarkers of HCC to show therapeutic effects and improve the prognosis. In this study, we aim to establish a gene signature that can predict the prognosis of HCC patients by downloading and analyzing RNA sequencing data and clinical information from three independent public databases. Firstly, we applied the limma R package to analyze biomarkers by the genetic data and clinical information downloaded from the Gene Expression Omnibus database (GEO), and then used the least absolute shrinkage and selection operator (LASSO) Cox regression and survival analysis to establish a gene signature and a prediction model by data from the Cancer Genome Atlas (TCGA). Besides, messenger RNA (mRNA) and protein expressions of the six-gene signature were explored using Oncomine, Human Protein Atlas (HPA) and the International Cancer Genome Consortium (ICGC). A total of 8,306 differentially expressed genes (DEGs) were obtained between HCC ( = 115) and normal tissues ( = 52). Top 5,000 significant genes were selected and subjected to the weighted correlation network analysis (WGCNA), which constructed nine gene co-expression modules that assign these genes to different modules by cluster dendrogram trees. By analyzing the most significant module (red module), six genes (SQSTM1, AHSA1, VNN2, SMG5, SRXN1, and GLS) were screened by univariate, LASSO, and multivariate Cox regression analysis. By a survival analysis with the HCC data in TCGA, we established a nomogram based on the six-gene signature and multiple clinicopathological features. The six-gene signature was then validated as an independent prognostic factor in independent HCC cohort from ICGC. Receiver operating characteristic (ROC) curve analysis confirmed the predictive capacity of the six-gene signature and nomogram. Besides, overexpression of the six genes at the mRNA and protein levels was validated using Oncomine and HPA, respectively. The predictive six-gene signature and nomograms established in this study can assist clinicians in selecting personalized treatment for patients with HCC.
如今,肝细胞癌(HCC)患者的临床治疗效果已有改善。然而,由于分子机制的复杂性,HCC住院患者的复发率和死亡率仍处于较高水平。因此,迫切需要筛选HCC的生物标志物以显示治疗效果并改善预后。在本研究中,我们旨在通过下载和分析来自三个独立公共数据库的RNA测序数据和临床信息,建立一种能够预测HCC患者预后的基因特征。首先,我们应用limma R包,通过从基因表达综合数据库(GEO)下载的基因数据和临床信息来分析生物标志物,然后使用最小绝对收缩和选择算子(LASSO)Cox回归和生存分析,利用来自癌症基因组图谱(TCGA)的数据建立基因特征和预测模型。此外,使用Oncomine、人类蛋白质图谱(HPA)和国际癌症基因组联盟(ICGC)探索了六基因特征的信使RNA(mRNA)和蛋白质表达。在HCC(n = 115)和正常组织(n = 52)之间共获得8306个差异表达基因(DEG)。选择前5000个显著基因并进行加权基因共表达网络分析(WGCNA),该分析构建了九个基因共表达模块,通过聚类树状图将这些基因分配到不同模块。通过分析最显著的模块(红色模块),通过单变量、LASSO和多变量Cox回归分析筛选出六个基因(SQSTM1、AHSA1、VNN2、SMG5、SRXN1和GLS)。通过对TCGA中的HCC数据进行生存分析,我们基于六基因特征和多个临床病理特征建立了列线图。然后,在来自ICGC的独立HCC队列中,六基因特征被验证为独立的预后因素。受试者工作特征(ROC)曲线分析证实了六基因特征和列线图的预测能力。此外,分别使用Oncomine和HPA验证了这六个基因在mRNA和蛋白质水平上的过表达。本研究中建立的预测性六基因特征和列线图可帮助临床医生为HCC患者选择个性化治疗。