Suppr超能文献

基于机器学习对与多生命阶段脓毒症人群细胞死亡和免疫抑制相关生物标志物的筛选。

Machine learning based screening of biomarkers associated with cell death and immunosuppression of multiple life stages sepsis populations.

作者信息

Yang Jie, Ou Fanyan, Li Binbin, Zeng Lixiong, Chen Qiuli, Gan Houyu, Yu Jianing, Guo Qian, Feng Jihua, Zhang Jianfeng

机构信息

Clinical Medical Research Center, The Second Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China.

Department of Clinical Pathology, The Second Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China.

出版信息

Sci Rep. 2025 Aug 19;15(1):30302. doi: 10.1038/s41598-025-14600-0.

Abstract

Sepsis is a condition resulting from the uncontrolled immune response to infection, leading to widespread inflammatory damage and potentially fatal organ dysfunction. Currently, there is a lack of specific prevention and treatment strategies for sepsis across different age groups. Programmed Cell Death (PCD) can regulate the enrichment of effector immune cells or regulatory immune cells, providing a new perspective for immunotherapy. Within the framework of computational biology and machine learning strategies, and against the backdrop of global multicenter sepsis cohort data, this study aims to deeply mine and screen specific biomarkers related to the immune microenvironment and programmed cell death in populations across different life stages (neonates, children, and adults). This will provide foundational data for precision treatment and drug development in artificial intelligence-assisted sepsis diagnosis and treatment management. Gene expression data from sepsis patients across global multicenter populations, including China, Europe, and the United States, were obtained from the Gene Expression Omnibus (GEO) database, and differentially expressed genes (DEGs) were identified. A literature review was conducted to obtain 18 PCD-related genes, which were intersected with DEGs to identify DEGs associated with specific types of PCD. Nine machine learning algorithms (Logistic Regression LR, Decision Tree DT, Gradient Boosting Machine GBM, K-Nearest Neighbors KNN, LASSO, Principal Component Analysis PCA, Random Forest RF, Support Vector Machine SVM, and XGBoost) were applied to training and testing datasets with 10-fold cross-validation to select three optimized algorithm models. The SHAP algorithm was further used to quantify the contribution of each gene based on cell death features to the prediction of sepsis. Key PCD patterns were identified based on model evaluation metrics (Accuracy, Precision, Recall, F1 score, and Receiver Operating Characteristic Curve ROC), and their associated DEGs were obtained through intersection, followed by immune-related analysis of DEGs. The study included a total of 1507 sepsis cases and 484 controls globally, with 90 neonatal cases and 95 controls, 527 children cases and 101 controls, and 890 adult cases and 288 controls. The best model for predicting sepsis across different populations was GBM.The key PCD patterns selected by machine learning for different age groups were Pyroptosis (neonates), Ferroptosis (children), and Autophagy (adults). (1) In neonatal sepsis, the models constructed by GBM, XGBoost, and RF algorithms performed the best, and identified 5 key DEGs associated with Pyroptosis (CHMP7, NLRC4, AIM2, GZMB, PRKACA), with NLRC4 showing the best predictive ability (AUC = 0.902, P < 0.05), significantly positively correlated with neutrophils and negatively correlated with CD8 + T cells. (2) In the children sepsis population, models constructed using the Gradient Boosting Machine (GBM), Support Vector Machine (SVM), and Least Absolute Shrinkage and Selection Operator (LASSO) algorithms demonstrated the best performance. Six key DEGs associated with Ferroptosis were identified (AKR1C3, GCLM, PEBP1, CARS, MAP1LC3B, SCL11A2), among which MAP1LC3B, playing a role in mitochondrial reactive oxygen species energy metabolism, showed the strongest predictive ability (AUC = 0.883, P < 0.05). It was significantly positively correlated with M0-type macrophages and significantly negatively correlated with activated CD4 + memory T cells. (3) In the adult sepsis population, models constructed using GBM, SVM, and LASSO algorithms showed the best performance. Three key DEGs associated with Autophagy were identified (TSPO, HTRA2, USP10), with TSPO, which mediates oxidative stress regulation, iron homeostasis, and cholesterol transport, showing the strongest predictive ability (AUC = 0.825, P < 0.05). It was significantly positively correlated with M1-type macrophages and significantly negatively correlated with CD8 + T cells. This study, through the integrated application of computational biology and machine learning algorithms, discovered biomarkers of PCD patterns that affect cytokine storm-mediated inflammation and immunosuppressive effects in sepsis populations across different age groups (neonates, children, and adults). These findings have specific clinical application and drug development value, providing a scientific basis for the global application of artificial intelligence-assisted sepsis diagnosis and treatment management.

摘要

脓毒症是一种因对感染的免疫反应失控而导致的病症,会引发广泛的炎症损伤以及潜在致命的器官功能障碍。目前,针对不同年龄组的脓毒症缺乏特异性的预防和治疗策略。程序性细胞死亡(PCD)可调节效应免疫细胞或调节性免疫细胞的富集,为免疫治疗提供了新视角。在计算生物学和机器学习策略的框架下,以全球多中心脓毒症队列数据为背景,本研究旨在深入挖掘和筛选不同生命阶段(新生儿、儿童和成人)人群中与免疫微环境和程序性细胞死亡相关的特异性生物标志物。这将为人工智能辅助脓毒症诊断和治疗管理中的精准治疗及药物研发提供基础数据。从基因表达综合数据库(GEO)获取了来自包括中国、欧洲和美国在内的全球多中心人群的脓毒症患者基因表达数据,并鉴定出差异表达基因(DEGs)。进行文献综述以获取18个与PCD相关的基因,将其与DEGs进行交叉分析,以鉴定与特定类型PCD相关的DEGs。应用九种机器学习算法(逻辑回归LR、决策树DT、梯度提升机GBM、K近邻KNN、套索回归、主成分分析PCA、随机森林RF、支持向量机SVM和XGBoost)对训练集和测试集进行10折交叉验证,以选择三种优化算法模型。进一步使用SHAP算法基于细胞死亡特征量化每个基因对脓毒症预测的贡献。基于模型评估指标(准确率、精确率、召回率、F1分数和受试者工作特征曲线ROC)确定关键的PCD模式,并通过交叉分析获得其相关的DEGs,随后对DEGs进行免疫相关分析。该研究全球共纳入1507例脓毒症病例和484例对照者,其中新生儿病例90例、对照者95例,儿童病例527例、对照者101例,成人病例890例、对照者288例。预测不同人群脓毒症的最佳模型是GBM。机器学习为不同年龄组选择的关键PCD模式分别是焦亡(新生儿)、铁死亡(儿童)和自噬(成人)。(1)在新生儿脓毒症中,由GBM、XGBoost和RF算法构建的模型表现最佳,鉴定出5个与焦亡相关的关键DEGs(CHMP7、NLRC4、AIM2、GZMB、PRKACA);其中NLRC4的预测能力最佳(AUC = 0.902,P < 0.05),与中性粒细胞显著正相关,与CD8 + T细胞显著负相关。(2)在儿童脓毒症人群中,使用梯度提升机(GBM)、支持向量机(SVM)和最小绝对收缩和选择算子(LASSO)算法构建的模型表现最佳。鉴定出6个与铁死亡相关的关键DEGs(AKR1C3、GCLM、PEBP1、CARS、MAP1LC3B、SCL11A2);其中在线粒体活性氧能量代谢中起作用的MAP1LC3B预测能力最强(AUC = 0.883,P < 0.05)。它与M0型巨噬细胞显著正相关,与活化的CD4 + 记忆T细胞显著负相关。(3)在成人脓毒症人群中,使用GBM、SVM和LASSO算法构建的模型表现最佳。鉴定出3个与自噬相关的关键DEGs(TSPO、HTRA2、USP10);其中介导氧化应激调节、铁稳态和胆固醇转运的TSPO预测能力最强(AUC = 0.825,P < 0.05)。它与M1型巨噬细胞显著正相关,与CD8 + T细胞显著负相关。本研究通过计算生物学和机器学习算法的综合应用,发现了影响不同年龄组(新生儿、儿童和成人)脓毒症人群中细胞因子风暴介导的炎症和免疫抑制作用的PCD模式生物标志物。这些发现具有特定的临床应用和药物研发价值,为人工智能辅助脓毒症诊断和治疗管理的全球应用提供了科学依据。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验