Li Yang, Wang Xu, Shi Liangliang, Xu Jianming, Sun Bin
Department of Gastroenterology, the First Affiliated Hospital of Anhui Medical University, Hefei 230001, China.
Department of Gastroenterology, Nanjing Drum Tower Hospital, the Affiliated Hospital of Nanjing University Medical School, Nanjing University, Nanjing 210008, China.
Transl Cancer Res. 2020 Jan;9(1):85-94. doi: 10.21037/tcr.2019.11.11.
Esophageal squamous cell carcinoma (ESCC) is one of the most common malignant neoplasms of the digestive tract worldwide. The lack of key molecular biomarkers is associated with the poor prognosis in ESCC patients. The present study was aimed to identify candidate genes for diagnostic, prognostic, and therapeutic applications in ESCC by bioinformatics.
Two datasets of ESCC (GSE20347 and GSE70409) from gene expression omnibus (GEO) were analyzed using GEO2R online tool to identify the differentially expressed genes (DEGs). Subsequently, functions and pathways enrichment analyses of DEGs and their protein-protein interaction (PPI) network analyses were performed. When key DEGs were identified, their relationship with ESCC prognosis was further validated.
There were 134 commonly changed DEGs (33 up-regulated and 101 down-regulated) from GSE20347 and GSE70409 datasets were identified using integrated bioinformatical analysis. Gene ontology (GO) and pathway enrichment analysis was performed to annotate genes and gene products, highlight biological processes (BPs) and systemic functional information. Through the PPI network and cluster analysis, two clusters containing 21 key DEGs were detected and 14 of them were validated based on TCGA and GTEx data. Among these key DEGs, and were significantly associated with the prognosis in ESCC cases.
In conclusion, a total of 14 key DEGs and outcome in ESCC were identified by integrated bioinformatics analyses. and might be novel potential diagnostic and prognostic biomarkers in ESCC.
食管鳞状细胞癌(ESCC)是全球最常见的消化道恶性肿瘤之一。关键分子生物标志物的缺乏与ESCC患者的不良预后相关。本研究旨在通过生物信息学鉴定ESCC诊断、预后和治疗应用的候选基因。
使用GEO2R在线工具分析来自基因表达综合数据库(GEO)的两个ESCC数据集(GSE20347和GSE70409),以鉴定差异表达基因(DEG)。随后,对DEG进行功能和通路富集分析以及蛋白质-蛋白质相互作用(PPI)网络分析。当鉴定出关键DEG时,进一步验证它们与ESCC预后的关系。
通过综合生物信息学分析,从GSE20347和GSE70409数据集中鉴定出134个共同变化的DEG(33个上调和101个下调)。进行基因本体(GO)和通路富集分析以注释基因和基因产物,突出生物学过程(BP)和系统功能信息。通过PPI网络和聚类分析,检测到两个包含21个关键DEG的聚类,其中14个基于TCGA和GTEx数据得到验证。在这些关键DEG中,[具体基因1]和[具体基因2]与ESCC病例的预后显著相关。
总之,通过综合生物信息学分析鉴定出ESCC中的14个关键DEG及其结果。[具体基因1]和[具体基因2]可能是ESCC中新型潜在的诊断和预后生物标志物。