Suppr超能文献

结合机器学习与单细胞测序以识别脓毒症中的关键免疫基因。

Combining machine learning and single-cell sequencing to identify key immune genes in sepsis.

作者信息

Wang Hao, Len Linghan, Hu Li, Hu Yingchun

机构信息

Clinical Medical College, Southwest Medical University, Luzhou, People's Republic of China.

Department of Emergency Medicine, The Affiliated Hospital of Southwest Medical University, Luzhou, People's Republic of China.

出版信息

Sci Rep. 2025 Jan 10;15(1):1557. doi: 10.1038/s41598-025-85799-1.

Abstract

This research aimed to identify novel indicators for sepsis by analyzing RNA sequencing data from peripheral blood samples obtained from sepsis patients (n = 23) and healthy controls (n = 10). 5148 differentially expressed genes were identified using the DESeq2 technique and 5636 differentially expressed genes were identified by the limma method(|Log2 Fold Change|≥2, FDR < 0.05). A total of 1793 immune-related genes were identified from the ImmPort database, with 358 genes identified in both groups. Next, a Biological association network was constructed, and five key hub genes (CD4, HLA-DOB, HLA-DRB1, HLA-DRA, AHNAK) were identified using a combination of three topological analysis algorithms (MCC, Closeness, and MNC) and four machine learning algorithms (Random Forest, LASSO regression, SVM, and XGBoost). immune cell distribution showed that the key genes correlated with multiple immune cell infiltrations. Gene Set Enrichment Analysis (GSEA) revealed that the key genes involved multiple immune response and inflammation-related signaling pathways. Subsequently, diagnostic models were constructed using four machine learning algorithms (Logistic regression, AdaBoost, KNN, and XGBoost) based on the identified key genes. Models with the highest performance were then selected. Ultimately, single-cell sequencing data revealed that the identified key genes were expressed in various immune cells, while Quantitative PCR (qPCR) tests confirmed their reduced expression in the sepsis group.

摘要

本研究旨在通过分析脓毒症患者(n = 23)和健康对照者(n = 10)外周血样本的RNA测序数据,确定脓毒症的新指标。使用DESeq2技术鉴定出5148个差异表达基因,通过limma方法鉴定出5636个差异表达基因(|Log2倍数变化|≥2,FDR < 0.05)。从ImmPort数据库中总共鉴定出1793个免疫相关基因,两组中均鉴定出358个基因。接下来,构建了一个生物关联网络,并使用三种拓扑分析算法(MCC、紧密性和MNC)和四种机器学习算法(随机森林、LASSO回归、支持向量机和XGBoost)的组合,鉴定出五个关键枢纽基因(CD4、HLA - DOB、HLA - DRB1、HLA - DRA、AHNAK)。免疫细胞分布显示关键基因与多种免疫细胞浸润相关。基因集富集分析(GSEA)表明关键基因涉及多种免疫反应和炎症相关信号通路。随后,基于鉴定出的关键基因,使用四种机器学习算法(逻辑回归、AdaBoost、KNN和XGBoost)构建诊断模型。然后选择性能最高的模型。最终,单细胞测序数据显示鉴定出的关键基因在各种免疫细胞中表达,而定量PCR(qPCR)测试证实它们在脓毒症组中的表达降低。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/740e/11718265/f406b4a6067e/41598_2025_85799_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验