Chen Juan Juan, Lu Zhang Ze, Jing Yu Xin, Nong Xing Mei, Qin Yi, Huang Jin Yang, Lin Na, Wei Jie
Department of Pediatrics, Affiliated Hospital of Youjiang Medical University for Nationalities, Baise, Guangxi, China.
Department of Urology, Baise People's Hospital, Baise, Guangxi, China.
Front Immunol. 2025 Jul 3;16:1609183. doi: 10.3389/fimmu.2025.1609183. eCollection 2025.
Respiratory syncytial virus (RSV) is a leading cause of severe lower respiratory infections in children, yet biomarkers for assessing disease severity remain limited. Herein, we investigated the differential expression biomarkers between RSV infected hospitalized patients, healthy groups and RSV infected outpatients.
Two publicly available transcriptomic datasets (GSE77087 and GSE188427) were retrieved from the Gene Expression Omnibus (GEO) database. The GSE77087 dataset comprised peripheral blood samples from 81 children with confirmed RSV infection (61 hospitalized and 20 outpatient) and 23 healthy controls. The GSE188427 dataset included 147 RSV-infected children (113 hospitalized and 34 outpatient) and 51 healthy controls. Genes with |log2 fold change (logFC)| > 0 and false discovery rate (FDR) < 0.05 were considered differentially expressed. Overlapping DEGs between the two datasets were identified using the VennDiagram package. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were conducted on the intersecting DEGs via the clusterProfiler package, with terms deemed significant at FDR < 0.05.The CIBERSORT algorithm was applied to estimate the relative proportions of 22 immune cell types in 228 RSV-infected samples. Potential drug interactions for hug genes were predicted using the Drug-Gene Interaction Database (DGIdb). Competing endogenous RNA (ceRNA) networks were constructed using the SpongeScan database to identify lncRNAs interacting with the target miRNAs. Networks were visualized using Cytoscape (v3.10.1).Finally, Machine Learning-Based Biomarker Selection and hub gene identification and validation.
Differential gene expression analysis revealed 81 overlapping genes between hospitalized and outpatient RSV-infected children. Machine learning models, particularly SVM (area under the curve, AUC = 0.950), prioritized CD79A and GADD45A as key predictors of hospitalization. CD79A was significantly downregulated in severe cases, correlating with impaired B-cell responses and cytotoxic immunity, while GADD45A, upregulated in severe infections, linked to oxidative stress and neutrophil-driven inflammation. Immune cell profiling highlighted distinct infiltration patterns, with severe cases showing elevated naïve B cells and M0 macrophages versus activated NK cells and M1 macrophages in mild cases. Clinical validation in 92 children confirmed CD79A suppression and GADD45A elevation in severe RSV (p < 0.001), aligning with younger age, lower weight, and respiratory distress. Functional enrichment implicated endoplasmic reticulum stress and neutrophil extracellular traps in disease progression. Drug-target predictions and ceRNA networks further revealed therapeutic potential.
These findings establish CD79A and GADD45A as clinically actionable biomarkers for RSV severity, offering insights into immune dysregulation and guiding personalized management strategies.
呼吸道合胞病毒(RSV)是儿童严重下呼吸道感染的主要病因,但用于评估疾病严重程度的生物标志物仍然有限。在此,我们研究了RSV感染的住院患者、健康组和RSV感染的门诊患者之间的差异表达生物标志物。
从基因表达综合数据库(GEO)中检索了两个公开可用的转录组数据集(GSE77087和GSE188427)。GSE77087数据集包括来自81名确诊RSV感染儿童(61名住院患者和20名门诊患者)的外周血样本以及23名健康对照。GSE188427数据集包括147名RSV感染儿童(113名住院患者和34名门诊患者)和51名健康对照。|log2倍数变化(logFC)|>0且错误发现率(FDR)<0.05的基因被认为是差异表达基因。使用VennDiagram软件包识别两个数据集之间的重叠差异表达基因。通过clusterProfiler软件包对相交的差异表达基因进行基因本体(GO)和京都基因与基因组百科全书(KEGG)通路富集分析,FDR<0.05的术语被视为显著。应用CIBERSORT算法估计228个RSV感染样本中22种免疫细胞类型的相对比例。使用药物-基因相互作用数据库(DGIdb)预测hub基因的潜在药物相互作用。使用SpongeScan数据库构建竞争性内源性RNA(ceRNA)网络,以识别与靶标miRNA相互作用的lncRNA。使用Cytoscape(v3.10.1)可视化网络。最后,基于机器学习的生物标志物选择以及hub基因的鉴定和验证。
差异基因表达分析揭示了住院和门诊RSV感染儿童之间有81个重叠基因。机器学习模型,特别是支持向量机(曲线下面积,AUC = 0.950),将CD79A和GADD45A列为住院的关键预测因子。CD79A在严重病例中显著下调,与B细胞反应受损和细胞毒性免疫相关,而GADD45A在严重感染中上调,与氧化应激和中性粒细胞驱动的炎症相关。免疫细胞分析突出了不同的浸润模式,严重病例中幼稚B细胞和M0巨噬细胞升高,而轻度病例中活化的NK细胞和M1巨噬细胞升高。对92名儿童的临床验证证实,严重RSV患者中CD79A受到抑制,GADD45A升高(p<0.001),这与年龄较小、体重较低和呼吸窘迫一致。功能富集表明内质网应激和中性粒细胞胞外陷阱参与疾病进展。药物靶点预测和ceRNA网络进一步揭示了治疗潜力。
这些发现确立了CD79A和GADD45A作为RSV严重程度的临床可操作生物标志物,为免疫失调提供了见解,并指导个性化管理策略。