Dai Gong-Peng, Wang Li-Ping, Wen Yu-Qing, Ren Xue-Qun, Zuo Shu-Guang
Department of General Surgery, Huaihe Hospital of Henan University, Kaifeng, Henan 475001, P.R. China.
Center for Translational Medicine, Huaihe Hospital of Henan University, Kaifeng, Henan 475001, P.R. China.
Oncol Lett. 2020 Jan;19(1):388-398. doi: 10.3892/ol.2019.11068. Epub 2019 Nov 7.
Colorectal cancer (CRC) is a life-threatening disease with a poor prognosis. Therefore, it is crucial to identify molecular prognostic biomarkers for CRC. The present study aimed to identify potential key genes that could be used to predict the prognosis of patients with CRC. Three CRC microarray datasets (GSE20916, GSE73360 and GSE44861) were downloaded from the Gene Expression Omnibus (GEO) database, and one dataset was obtained from The Cancer Genome Atlas (TCGA) database. The three GEO datasets were analyzed to detect differentially expressed genes (DEGs) using the BRB-ArrayTools software. Functional and pathway enrichment analyses of these DEGs were performed using the Database for Annotation, Visualization and Integrated Discovery tool. A protein-protein interaction (PPI) network of DEGs was constructed, hub genes were extracted, and modules of the PPI network were analyzed. To investigate the prognostic values of the hub genes in CRC, data from the CRC datasets of TCGA were used to perform the survival analyses based on the sample splitting method and Cox regression model. Correlation among the hub genes was evaluated using Spearman's correlation analysis. In the three GEO datasets, a total of 105 common DEGs were identified, including 51 down- and 54 up-regulated genes in CRC compared with normal colorectal tissues. A PPI network consisting of 100 DEGs and 551 edges was constructed, and 44 nodes were identified as hub genes. Among these 44 genes, the four hub genes TIMP metallopeptidase inhibitor 1 (TIMP1), solute carrier family 4 member 4 (SLC4A4), aldo-keto reductase family 1 member B10 (AKR1B10) and ATP binding cassette subfamily E member 1 (ABCE1) were associated with overall survival (OS) in patients with CRC. Three significant modules were extracted from the PPI network. The hub gene TIMP1 was present in Module 1, ABCE1 was involved in Module 2 and SLC4A4 was identified in Module 3. Univariate analysis revealed that TIMP1, SLC4A4, AKR1B10 and ABCE1 were associated with the OS of patients with CRC. Multivariate analysis demonstrated that SLC4A4 may be an independent prognostic factor associated with OS. Furthermore, the results from correlation analysis revealed that there was no correlation between TIMP1, SLC4A4 and ABCE1, whereas AKR1B10 was positively correlated with SLC4A4. In conclusion, the four key genes TIMP1, SLC4A4, AKR1B10 and ABCE1 associated with the OS of patients with CRC were identified by integrated bioinformatics analysis. These key genes may be used as prognostic biomarkers to predict the survival of patients with CRC, and may therefore represent novel therapeutic targets for CRC.
结直肠癌(CRC)是一种预后不良的危及生命的疾病。因此,识别CRC的分子预后生物标志物至关重要。本研究旨在识别可用于预测CRC患者预后的潜在关键基因。从基因表达综合数据库(GEO)下载了三个CRC微阵列数据集(GSE20916、GSE73360和GSE44861),并从癌症基因组图谱(TCGA)数据库获得了一个数据集。使用BRB-ArrayTools软件分析这三个GEO数据集以检测差异表达基因(DEG)。使用注释、可视化和综合发现数据库工具对这些DEG进行功能和通路富集分析。构建了DEG的蛋白质-蛋白质相互作用(PPI)网络,提取了枢纽基因,并分析了PPI网络的模块。为了研究枢纽基因在CRC中的预后价值,使用TCGA的CRC数据集的数据基于样本拆分方法和Cox回归模型进行生存分析。使用Spearman相关分析评估枢纽基因之间的相关性。在这三个GEO数据集中,共鉴定出105个常见的DEG,与正常结直肠组织相比,CRC中有51个基因下调和54个基因上调。构建了一个由100个DEG和551条边组成的PPI网络,并将44个节点鉴定为枢纽基因。在这44个基因中,四个枢纽基因金属蛋白酶组织抑制因子1(TIMP1)、溶质载体家族4成员4(SLC4A4)、醛糖还原酶家族1成员B10(AKR1B10)和ATP结合盒亚家族E成员1(ABCEI)与CRC患者的总生存期(OS)相关。从PPI网络中提取了三个重要模块。枢纽基因TIMP1存在于模块1中,ABCE1参与模块2,SLC4A4在模块3中被鉴定。单因素分析显示,TIMP1、SLC4A4、AKR1B10和ABCE1与CRC患者的OS相关。多因素分析表明,SLC4A4可能是与OS相关的独立预后因素。此外,相关分析结果显示,TIMP1、SLC4A4和ABCE1之间无相关性,而AKR1B10与SLC4A4呈正相关。总之,通过综合生物信息学分析鉴定了与CRC患者OS相关的四个关键基因TIMP1、SLC4A4、AKR1B10和ABCE1。这些关键基因可作为预后生物标志物来预测CRC患者的生存情况,因此可能代表CRC的新型治疗靶点。