Wang Yan, Wang Ying-Shao, Hu Nai-Bo, Teng Guang-Shuai, Zhou Yuan, Bai Jie
Department of Hematology, The Second Hospital of Tianjin Medical University, Tianjin 300211, China.
State Key Laboratory of Experimental Hematology, Institute of Hematology and Blood Diseases Hospital, National Clinical Research Center for Blood Diseases, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China.
Zhongguo Shi Yan Xue Ye Xue Za Zhi. 2022 Jun;30(3):804-812. doi: 10.19746/j.cnki.issn.1009-2137.2022.03.023.
To screen differentially expressed gene (DEG) related to myelodysplastic syndrome (MDS) based on Gene Expression Omnibus (GEO) database, and explore the core genes and pathogenesis of MDS by analyzing the biological functions and related signaling pathways of DEG.
The expression profiles of GSE4619, GSE19429, GSE58831 including MDS patients and normal controls were downloaded from GEO database. The gene expression analysis tool (GEO2R) of GEO database was used to screen DEG according to | log FC (fold change) |≥1 and P<0.01. David online database was used to annotate gene ontology function (GO). Metascape online database was used to enrich and analyze differential genes in Kyoto Encyclopedia of Genes and Genomes (KEGG). The protein-protein interaction network (PPI) was constructed by using STRING database. CytoHubba and Mcode plug-ins of Cytoscape were used to analyze the key gene clusters and hub genes. R language was used to diagnose hub genes and draw the ROC curve. GSEA enrichment analysis was performed on GSE19429 according to the expression of LEF1.
A total of 74 co-DEG were identified, including 14 up-regulated genes and 60 down regulated genes. GO enrichment analysis indicated that BP of down regulated genes was mainly enriched in the transcription and regulation of RNA polymerase II promoter, negative regulation of cell proliferation, and immune response. CC of down regulated genes was mainly enriched in the nucleus, transcription factor complexes, and adhesion spots. MF was mainly enriched in protein binding, DNA binding, and β-catenin binding. KEGG pathway was enriched in primary immunodeficiency, Hippo signaling pathway, cAMP signaling pathway, transcriptional mis-regulation in cancer and hematopoietic cell lineage. BP of up-regulated genes was mainly enriched in type I interferon signaling pathway and viral response. CC was mainly enriched in cytoplasm. MF was mainly enriched in RNA binding. Ten hub genes and three important gene clusters were screened by STRING database and Cytoscape software. The functions of the three key gene clusters were closely related to immune regulation. ROC analysis showed that the hub genes had a good diagnostic significance for MDS. GSEA analysis indicated that LEF1 may affect the normal function of hematopoietic stem cells by regulating inflammatory reaction, which further revealed the pathogenesis of MDS.
Bioinformatics can effectively screen the core genes and key signaling pathways of MDS, which provides a new strategy for the diagnosis and treatment of MDS.
基于基因表达综合数据库(GEO)筛选与骨髓增生异常综合征(MDS)相关的差异表达基因(DEG),并通过分析DEG的生物学功能及相关信号通路,探讨MDS的核心基因及发病机制。
从GEO数据库下载包含MDS患者和正常对照的GSE4619、GSE19429、GSE58831基因表达谱。利用GEO数据库的基因表达分析工具(GEO2R),根据|log FC(倍数变化)|≥1且P<0.01筛选DEG。使用David在线数据库注释基因本体功能(GO)。利用Metascape在线数据库对京都基因与基因组百科全书(KEGG)中的差异基因进行富集分析。通过STRING数据库构建蛋白质-蛋白质相互作用网络(PPI)。使用Cytoscape的CytoHubba和Mcode插件分析关键基因簇和枢纽基因。用R语言对枢纽基因进行诊断并绘制ROC曲线。根据LEF1的表达对GSE19429进行基因集富集分析(GSEA)。
共鉴定出74个共DEG,其中上调基因14个,下调基因60个。GO富集分析表明,下调基因的生物学过程(BP)主要富集于RNA聚合酶II启动子的转录和调控、细胞增殖的负调控以及免疫应答。下调基因的细胞组分(CC)主要富集于细胞核、转录因子复合物和黏着斑。分子功能(MF)主要富集于蛋白质结合、DNA结合和β-连环蛋白结合。KEGG通路富集于原发性免疫缺陷、Hippo信号通路、cAMP信号通路、癌症中的转录失调和造血细胞谱系。上调基因的BP主要富集于I型干扰素信号通路和病毒应答。CC主要富集于细胞质。MF主要富集于RNA结合。通过STRING数据库和Cytoscape软件筛选出10个枢纽基因和3个重要基因簇。三个关键基因簇的功能与免疫调节密切相关。ROC分析表明,枢纽基因对MDS具有良好的诊断意义。GSEA分析表明,LEF1可能通过调节炎症反应影响造血干细胞的正常功能,进一步揭示了MDS的发病机制。
生物信息学可有效筛选MDS的核心基因和关键信号通路,为MDS的诊断和治疗提供新策略。