Medical College, Anhui University of Science and Technology, Huainan, Anhui, 232001, People's Republic of China.
Department of Medical Laboratory, The Affiliated Huaian No. 1 People's Hospital of Nanjing Medical University, Huai'an, Jiangsu, 223300, People's Republic of China.
World J Surg Oncol. 2021 Aug 12;19(1):240. doi: 10.1186/s12957-021-02360-w.
Oral cancer (OC) is a common and dangerous malignant tumor with a low survival rate. However, the micro level mechanism has not been explained in detail.
Gene and miRNA expression micro array data were extracted from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) and miRNAs (DE miRNAs) were identified by R software. Gene Ontology (GO) enrichment and Kyoto Encyclopedia of genes and genomes (KEGG) pathway analysis were used to assess the potential molecular mechanisms of DEGs. Cytoscape software was utilized to construct protein-protein interaction (PPI) network and miRNA-gene network. Central genes were screened out with the participation of gene degree, molecular complex detection (MCODE) plugin, and miRNA-gene network. Then, the identified genes were checked by The Cancer Genome Atlas (TCGA) gene expression profile, Kaplan-Meier data, Oncomine, and the Human Protein Atlas database. Receiver operating characteristic (ROC) curve was drawn to predict the diagnostic efficiency of crucial gene level in normal and tumor tissues. Univariate and multivariate Cox regression were used to analyze the effect of dominant genes and clinical characteristics on the overall survival rate of OC patients.
Gene expression data of gene expression profiling chip(GSE9844, GSE30784, and GSE74530) were obtained from GEO database, including 199 tumor and 63 non-tumor samples. We identified 298 gene mutations, including 200 upregulated and 98 downregulated genes. GO functional annotation analysis showed that DEGs were enriched in extracellular structure and extracellular matrix containing collagen. In addition, KEGG pathway enrichment analysis demonstrated that the DEGs were significantly enriched in IL-17 signaling pathway and PI3K-Akt signaling pathway. Then, we detected three most relevant modules in PPI network. Central genes (CXCL8, DDX60, EIF2AK2, GBP1, IFI44, IFI44L, IFIT1, IL6, MMP9,CXCL1, CCL20, RSAD2, and RTP4) were screened out with the participation of MCODE plugin, gene degree, and miRNA-gene network. TCGA gene expression profile and Kaplan-Meier analysis showed that high expression of CXCL8, DDX60, IL6, and RTP4 was associated with poor prognosis in OC patients, while patients with high expression of IFI44L and RSAD2 had a better prognosis. The elevated expression of CXCL8, DDX60, IFI44L, RSAD2, and RTP44 in OC was verified by using Oncomine database. ROC curve showed that the mRNA levels of these five genes had a helpful diagnostic effect on tumor tissue. The Human Protein Atlas database showed that the protein expressions of DDX60, IFI44L, RSAD2, and RTP44 in tumor tissues were higher than those in normal tissues. Finally, univariate and multivariate Cox regression showed that DDX60, IFI44L, RSAD2, and RTP44 were independent prognostic indicators of OC.
This study revealed the potential biomarkers and relevant pathways of OC from publicly available GEO database, and provided a theoretical basis for elucidating the diagnosis, treatment, and prognosis of OC.
口腔癌(OC)是一种常见且危险的恶性肿瘤,其生存率较低。然而,其微观水平的机制尚未详细阐明。
从基因表达综合数据库(GEO)中提取基因和 miRNA 表达微阵列数据。使用 R 软件鉴定差异表达基因(DEGs)和 miRNA(DE miRNAs)。通过基因本体论(GO)富集和京都基因与基因组百科全书(KEGG)途径分析来评估 DEGs 的潜在分子机制。使用 Cytoscape 软件构建蛋白质-蛋白质相互作用(PPI)网络和 miRNA-基因网络。通过基因度、分子复合物检测(MCODE)插件和 miRNA-基因网络参与筛选出核心基因。然后,通过癌症基因组图谱(TCGA)基因表达谱、Kaplan-Meier 数据、Oncomine 和人类蛋白质图谱数据库对鉴定出的基因进行验证。绘制受试者工作特征(ROC)曲线以预测关键基因在正常和肿瘤组织中的诊断效率。使用单变量和多变量 Cox 回归分析优势基因和临床特征对 OC 患者总生存率的影响。
从 GEO 数据库中获得了基因表达谱芯片(GSE9844、GSE30784 和 GSE74530)的基因表达数据,包括 199 个肿瘤样本和 63 个非肿瘤样本。我们鉴定出 298 个基因突变,包括 200 个上调基因和 98 个下调基因。GO 功能注释分析表明,DEGs 富集在外泌体结构和含有胶原蛋白的细胞外基质中。此外,KEGG 途径富集分析表明,DEGs 显著富集在 IL-17 信号通路和 PI3K-Akt 信号通路中。然后,我们在 PPI 网络中检测到三个最相关的模块。通过 MCODE 插件、基因度和 miRNA-基因网络的参与,筛选出核心基因(CXCL8、DDX60、EIF2AK2、GBP1、IFI44、IFI44L、IFIT1、IL6、MMP9、CXCL1、CCL20、RSAD2 和 RTP4)。TCGA 基因表达谱和 Kaplan-Meier 分析表明,OC 患者中 CXCL8、DDX60、IL6 和 RTP4 的高表达与预后不良相关,而 IFI44L 和 RSAD2 高表达的患者预后较好。Oncomine 数据库验证了 OC 中 CXCL8、DDX60、IFI44L、RSAD2 和 RTP4 的高表达。ROC 曲线表明,这五个基因的 mRNA 水平对肿瘤组织具有有益的诊断作用。人类蛋白质图谱数据库显示,肿瘤组织中 DDX60、IFI44L、RSAD2 和 RTP44 的蛋白表达高于正常组织。最后,单变量和多变量 Cox 回归表明,DDX60、IFI44L、RSAD2 和 RTP44 是 OC 的独立预后指标。
本研究从公共可用的 GEO 数据库中揭示了 OC 的潜在生物标志物和相关途径,为阐明 OC 的诊断、治疗和预后提供了理论依据。