Hu Shiyu, Cai Jingjing, Chen Sizhan, Wang Yang, Ren Lijie
Neurology Department of Shenzhen Second People's Hospital, First Affiliated Hospital of Shenzhen University Health Science Center, Shenzhen, China.
Biochem Biophys Rep. 2023 Dec 7;37:101595. doi: 10.1016/j.bbrep.2023.101595. eCollection 2024 Mar.
Ischemic stroke (IS) is one of most common causes of disability in adults worldwide. However, there is still a lack of effective and reliable diagnostic markers and therapeutic targets in IS. Furthermore, immune cell dysfunction plays an important role in the pathogenesis of IS. Hence, in-depth research on immune-related targets in progressive IS is urgently needed.
Expression profile data from patients with IS were downloaded from the Gene Expression Omnibus (GEO) database. Then, differential expression analysis and weighted gene coexpression network analysis (WGCNA) were performed to identify the significant modules and differentially expressed genes (DEGs). Key genes were obtained and used in functional enrichment analyses by overlapping module genes and DEGs. Next, hub candidate genes were identified by utilizing three machine learning algorithms: least absolute shrinkage and selection operator (LASSO), random forest, and support vector machine-recursive feature elimination (SVM-RFE). Subsequently, a diagnostic model was constructed based on the hub genes, and receiver operating characteristic (ROC) curves were constructed to validate the performances of the predictive models and candidate genes. Finally, the immune cell infiltration landscape of IS was explored with the CIBERSORT deconvolution algorithm.
A total of 40 key DEGs were identified based on the intersection of the DEGs and module genes, and we found that these genes were mainly enriched in the regulation of lipolysis in adipocytes, neutrophil extracellular trap formation and complement and coagulation cascades. Based on the results from three advanced machine learning algorithms, we obtained 7 hub candidate genes (ABCA1, ARG1, C5AR1, CKAP4, HMFN0839, SDCBP and TLN1) as diagnostic biomarkers of IS and developed a reliable nomogram with high predictive performance (AUC = 0.987). In addition, immune cell infiltration dysregulation was implicated in IS, and compared with those in the normal group, IS patients had increased fractions of gamma delta T cells, monocytes, M0 macrophages, M2 macrophages and neutrophils and clearly lower percentages of naive B cells, CD8 T cells, CD4 memory T cells, follicular helper T cells, regulatory T cells (Tregs) and resting dendritic cells. Furthermore, correlation analysis indicated a significant correlation between the hub genes and immune cells in progressive IS.
In conclusion, our study identified 7 hub genes as diagnostic biomarkers and established a reliable model to predict the occurrence of IS. Meanwhile, we explored the immune cell infiltration pattern and investigated the relationship between candidate genes and immune cells in the pathogenesis of IS. Hence, our study provides new insights into the diagnosis and treatment of IS.
缺血性中风(IS)是全球成年人残疾的最常见原因之一。然而,IS 仍缺乏有效且可靠的诊断标志物和治疗靶点。此外,免疫细胞功能障碍在 IS 的发病机制中起重要作用。因此,迫切需要对进展性 IS 中与免疫相关的靶点进行深入研究。
从基因表达综合数据库(GEO)下载 IS 患者的表达谱数据。然后,进行差异表达分析和加权基因共表达网络分析(WGCNA),以识别显著模块和差异表达基因(DEG)。通过重叠模块基因和 DEG 获得关键基因,并用于功能富集分析。接下来,利用三种机器学习算法:最小绝对收缩和选择算子(LASSO)、随机森林和支持向量机递归特征消除(SVM-RFE)来识别枢纽候选基因。随后,基于枢纽基因构建诊断模型,并构建受试者工作特征(ROC)曲线以验证预测模型和候选基因的性能。最后,用 CIBERSORT 反卷积算法探索 IS 的免疫细胞浸润情况。
基于 DEG 与模块基因的交集共鉴定出 40 个关键 DEG,我们发现这些基因主要富集于脂肪细胞中脂解的调节、中性粒细胞胞外陷阱形成以及补体和凝血级联反应。基于三种先进机器学习算法的结果,我们获得了 7 个枢纽候选基因(ABCA1、ARG1、C5AR1、CKAP4、HMFN0839、SDCBP 和 TLN1)作为 IS 的诊断生物标志物,并开发了一种具有高预测性能的可靠列线图(AUC = 0.987)。此外,免疫细胞浸润失调与 IS 有关,与正常组相比,IS 患者的γδT 细胞、单核细胞、M0 巨噬细胞、M2 巨噬细胞和中性粒细胞比例增加,而幼稚 B 细胞、CD8 T 细胞、CD4 记忆 T 细胞、滤泡辅助性 T 细胞、调节性 T 细胞(Tregs)和静息树突状细胞的百分比明显降低。此外,相关性分析表明进展性 IS 中的枢纽基因与免疫细胞之间存在显著相关性。
总之,我们的研究确定了 7 个枢纽基因作为诊断生物标志物,并建立了一个可靠的模型来预测 IS 的发生。同时,我们探索了免疫细胞浸润模式,并研究了候选基因与 IS 发病机制中免疫细胞之间的关系。因此,我们的研究为 IS 的诊断和治疗提供了新的见解。