Li Meng, Mei Yulin, Pan Aijun
Department of Critical Care Medicine, the Affiliated Provincial Hospital of Anhui Medical University, Hefei 230001, Anhui, China.
Wannan Medical College, Wuhu 241002, Anhui, China. Corresponding author: Pan Aijun, Email:
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Oct;36(10):1025-1032. doi: 10.3760/cma.j.cn121430-20240524-00457.
To explore the characteristics of key ferroptosis-related genes as therapeutic targets for sepsis based on bioinformatics analysis, and describe their immune characteristics.
The transcriptome datasets GSE57065, GSE9960, GSE28750, and GSE137340 were downloaded from the Gene Expression Omnibus (GEO) database, immune-related gene (IRG) were obtained from ImmPort and InnateDB databases, and ferroptosis-related gene (FRG) were downloaded from the FerrDb database. The datasets GSE57065, GSE9960, and GSE28750 were integrated into an analysis dataset by the surrogate variable analysis (SVA) package and analyzed this analysis dataset by using the "limma" package to obtain differentially expressed gene (DEG), then the intersection set of DEG, FRG, and IRG were considered as ferroptosis and immune-related DEG (FImDEG). Gene ontology (GO) functional annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed using "ClusterProfiler" to understand the biological function of FImDEG. The key genes were screened by protein-protein interaction (PPI) network, least absolute shrinkage and selection operator (LASSO) regression algorithms, and support vector machine (SVM) analyses, and Logistic regression model was built based on above key genes. Receiver operator characteristics curve (ROC curve) was plotted to evaluate the diagnostic efficacy of the key genes alone or combinative. The degree of infiltration of 22 immune cells was assessed using the "CIBERSORT" package, and the correlation between the expressions of key genes and infiltration degree of immune cells was analyzed. Dataset GSE137340 was used to verify these key genes.
A dataset consisting of 146 sepsis samples and 61 healthy control samples was obtained by processing the database and removing batch effect. A total of 4 537 DEG were obtained, including 2 066 up-regulated genes and 2 471 down-regulated genes. 2 519 IRG and 855 FRG were obtained from the relevant database. Using the intersection of DEG, IRG and FRG, 34 FImDEG were obtained, including 20 up-regulated genes and 14 down-regulated genes. GO functional annotation showed that the biological functions of 34 FImDEG were mainly inhibition of transferase activity, regulation of DNA-binding transcription factor activity and cell response to stimulation. In terms of molecular function, it was mainly related to RNA polymerase II-specific DNA-binding transcription factor binding and various protein ligase binding. Changes in cell composition occurred mainly in promyelocytic leukemia protein and chromatin silencing complexes. Enrichment analysis of KEGG pathway showed that the major pathways involved in 34 FImDEG included cell aging, expression of programmed death-ligand 1 (PD-L1) and programmed death-1 (PD-1) checkpoint pathways in cancer, interleukin-17 (IL-17) signaling pathway, lipid and atherosclerosis, and NOD-like receptor signaling pathway. Four key genes, including cytochrome b-245 β chain (CYBB), mitogen-activated protein kinase 14 (MAPK14), prostaglandin-endoperoxide synthase 2 (PTGS2) and V-relreticuloendotheliosis viral oncogene homology A (RELA), were screened through PPI network and LASSO and SVM machine learning. ROC curve analysis showed that the area under ROC curve (AUC) of the four key genes for diagnosing sepsis was all greater than 0.65, and the AUC of MAPK14 was 0.911. Logistic regression model was constructed based on four key genes, and the AUC was 0.956. Immunoinfiltration analysis showed that compared with healthy control samples, the infiltration degree of neutrophils and macrophages M0 was significantly increased in sepsis samples, while the infiltration degree of resting natural killer cell (NK cell), naive CD4 T cell and CD8 T cell was significantly lowered. Correlation analysis showed that the positive correlation between MAPK14 expression and the infiltration degree of neutrophils was the highest. Validation results in the GSE137340 dataset showed that compared with healthy control samples, the expressions of CYBB and MAPK14 in sepsis samples were significantly up-regulated, however, the expressions of PTGS2 and RELA were significantly down-regulated, similar to the expression trend in the above analysis dataset.
Four key genes, including CYBB, MAPK14, PTGS2, and RELA, in the development of sepsis were identified through bioinformatics analysis, which play an important role in the immune process, and MAPK14 may be an important target for immune intervention.
基于生物信息学分析探索关键铁死亡相关基因作为脓毒症治疗靶点的特征,并描述其免疫特征。
从基因表达综合数据库(GEO)下载转录组数据集GSE57065、GSE9960、GSE28750和GSE137340,从ImmPort和InnateDB数据库获取免疫相关基因(IRG),从FerrDb数据库下载铁死亡相关基因(FRG)。使用替代变量分析(SVA)软件包将数据集GSE57065、GSE9960和GSE28750整合为一个分析数据集,并使用“limma”软件包对该分析数据集进行分析以获得差异表达基因(DEG),然后将DEG、FRG和IRG的交集视为铁死亡和免疫相关差异表达基因(FImDEG)。使用“ClusterProfiler”进行基因本体(GO)功能注释和京都基因与基因组百科全书(KEGG)富集分析,以了解FImDEG的生物学功能。通过蛋白质-蛋白质相互作用(PPI)网络、最小绝对收缩和选择算子(LASSO)回归算法以及支持向量机(SVM)分析筛选关键基因,并基于上述关键基因构建逻辑回归模型。绘制受试者工作特征曲线(ROC曲线)以评估关键基因单独或联合诊断的效能。使用“CIBERSORT”软件包评估22种免疫细胞浸润程度,并分析关键基因表达与免疫细胞浸润程度之间的相关性。使用数据集GSE137340验证这些关键基因。
通过处理数据库并消除批次效应,获得了一个由146个脓毒症样本和61个健康对照样本组成的数据集。共获得4537个DEG,包括2066个上调基因和2471个下调基因。从相关数据库中获得2519个IRG和855个FRG。利用DEG、IRG和FRG的交集,获得34个FImDEG, 包括20个上调基因和14个下调基因。GO功能注释显示,34个FImDEG的生物学功能主要为转移酶活性抑制、DNA结合转录因子活性调节和细胞对刺激的反应。在分子功能方面,主要与RNA聚合酶II特异性DNA结合转录因子结合和各种蛋白质连接酶结合有关。细胞组成变化主要发生在早幼粒细胞白血病蛋白和染色质沉默复合物中。KEGG通路富集分析显示,34个FImDEG涉及的主要通路包括细胞衰老、癌症中程序性死亡配体1(PD-L1)和程序性死亡1(PD-1)检查点通路的表达、白细胞介素-17(IL-17)信号通路、脂质与动脉粥样硬化以及NOD样受体信号通路。通过PPI网络以及LASSO和SVM机器学习筛选出4个关键基因,包括细胞色素b-245β链(CYBB)、丝裂原活化蛋白激酶14(MAPK14)、前列腺素内过氧化物合酶2(PTGS2)和V-rel网状内皮增生病毒癌基因同源物A(RELA)。ROC曲线分析显示,4个关键基因诊断脓毒症的ROC曲线下面积(AUC)均大于0.65,其中MAPK14的AUC为0.911。基于4个关键基因构建逻辑回归模型,AUC为0.956。免疫浸润分析显示,与健康对照样本相比,脓毒症样本中中性粒细胞和M0巨噬细胞浸润程度显著增加,而静息自然杀伤细胞(NK细胞)、初始CD4 T细胞和CD8 T细胞浸润程度显著降低。相关性分析显示,MAPK14表达与中性粒细胞浸润程度的正相关性最高。GSE137340数据集中的验证结果显示,与健康对照样本相比,脓毒症样本中CYBB和MAPK14的表达显著上调,然而,PTGS2和RELA的表达显著下调,与上述分析数据集中的表达趋势相似。
通过生物信息学分析确定了脓毒症发生发展中的4个关键基因,包括CYBB、MAPK14、PTGS2和RELA,它们在免疫过程中起重要作用,MAPK14可能是免疫干预的重要靶点。