Wang Hao, Len Linghan, Hu Li, Hu Yingchun
Clinical Medical College, Southwest Medical University, Luzhou, People's Republic of China.
Department of Emergency Medicine, The Affiliated Hospital of Southwest Medical University, Luzhou, People's Republic of China.
Sci Rep. 2025 Jan 10;15(1):1557. doi: 10.1038/s41598-025-85799-1.
This research aimed to identify novel indicators for sepsis by analyzing RNA sequencing data from peripheral blood samples obtained from sepsis patients (n = 23) and healthy controls (n = 10). 5148 differentially expressed genes were identified using the DESeq2 technique and 5636 differentially expressed genes were identified by the limma method(|Log2 Fold Change|≥2, FDR < 0.05). A total of 1793 immune-related genes were identified from the ImmPort database, with 358 genes identified in both groups. Next, a Biological association network was constructed, and five key hub genes (CD4, HLA-DOB, HLA-DRB1, HLA-DRA, AHNAK) were identified using a combination of three topological analysis algorithms (MCC, Closeness, and MNC) and four machine learning algorithms (Random Forest, LASSO regression, SVM, and XGBoost). immune cell distribution showed that the key genes correlated with multiple immune cell infiltrations. Gene Set Enrichment Analysis (GSEA) revealed that the key genes involved multiple immune response and inflammation-related signaling pathways. Subsequently, diagnostic models were constructed using four machine learning algorithms (Logistic regression, AdaBoost, KNN, and XGBoost) based on the identified key genes. Models with the highest performance were then selected. Ultimately, single-cell sequencing data revealed that the identified key genes were expressed in various immune cells, while Quantitative PCR (qPCR) tests confirmed their reduced expression in the sepsis group.
本研究旨在通过分析脓毒症患者(n = 23)和健康对照者(n = 10)外周血样本的RNA测序数据,确定脓毒症的新指标。使用DESeq2技术鉴定出5148个差异表达基因,通过limma方法鉴定出5636个差异表达基因(|Log2倍数变化|≥2,FDR < 0.05)。从ImmPort数据库中总共鉴定出1793个免疫相关基因,两组中均鉴定出358个基因。接下来,构建了一个生物关联网络,并使用三种拓扑分析算法(MCC、紧密性和MNC)和四种机器学习算法(随机森林、LASSO回归、支持向量机和XGBoost)的组合,鉴定出五个关键枢纽基因(CD4、HLA - DOB、HLA - DRB1、HLA - DRA、AHNAK)。免疫细胞分布显示关键基因与多种免疫细胞浸润相关。基因集富集分析(GSEA)表明关键基因涉及多种免疫反应和炎症相关信号通路。随后,基于鉴定出的关键基因,使用四种机器学习算法(逻辑回归、AdaBoost、KNN和XGBoost)构建诊断模型。然后选择性能最高的模型。最终,单细胞测序数据显示鉴定出的关键基因在各种免疫细胞中表达,而定量PCR(qPCR)测试证实它们在脓毒症组中的表达降低。