Department of Pulmonary and Critical Care Medicine, Zhongshan Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, China.
National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China.
J Cell Mol Med. 2024 Sep;28(18):e70107. doi: 10.1111/jcmm.70107.
This retrospective transcriptomic study leveraged bioinformatics and machine learning algorithms to identify novel gene biomarkers and explore immune cell infiltration profiles associated with chronic obstructive pulmonary disease (COPD). Utilizing an integrated analysis of metadata encompassing six gene expression omnibus (GEO) microarray datasets, 987 differentially expressed genes were identified. Further gene ontology and pathway enrichment analyses revealed the enrichment of these genes across various biological processes and pathways. Moreover, a systematic integration of two machine learning algorithms along with pathway-gene correlations identified six candidate biomarkers, which were validated in a separate cohort comprising six additional microarray datasets, ultimately identifying ADD3 and GNAS as diagnostic biomarkers for COPD. Subsequently, the diagnostic efficacy of ADD3 and GNAS was assessed, and the impact of their expression levels on overall survival was further evaluated and quantified in the validation cohort. Examination of immune cell subtype infiltration found increased proportions of cytotoxic CD8 T cells, resting and activated NK cells, along with decreased M0 and M2 macrophages, in COPD versus control samples. Correlation analyses also uncovered significant associations between ADD3 and GNAS expression and infiltration of various immune cell types. In conclusion, this study elucidates crucial COPD diagnostic biomarkers and immune cell profiles which may illuminate the immunopathological drivers of COPD progression, representing personalized therapeutic targets warranting further investigation.
本回顾性转录组学研究利用生物信息学和机器学习算法,鉴定了与慢性阻塞性肺疾病(COPD)相关的新型基因生物标志物和免疫细胞浸润特征。通过对包含六个基因表达综合(GEO)微阵列数据集的元数据进行综合分析,鉴定了 987 个差异表达基因。进一步的基因本体论和途径富集分析显示,这些基因在各种生物学过程和途径中均有富集。此外,两种机器学习算法与途径-基因相关性的系统整合,确定了六个候选生物标志物,这些标志物在包含另外六个微阵列数据集的独立队列中得到了验证,最终确定 ADD3 和 GNAS 是 COPD 的诊断生物标志物。随后,评估了 ADD3 和 GNAS 的诊断效果,并在验证队列中进一步评估和量化了其表达水平对总生存期的影响。对免疫细胞亚型浸润的检查发现,与对照样本相比,COPD 样本中细胞毒性 CD8 T 细胞、静止和激活的 NK 细胞的比例增加,而 M0 和 M2 巨噬细胞的比例减少。相关性分析还揭示了 ADD3 和 GNAS 表达与各种免疫细胞类型浸润之间的显著关联。总之,本研究阐明了 COPD 的关键诊断生物标志物和免疫细胞特征,这可能阐明了 COPD 进展的免疫病理驱动因素,代表了值得进一步研究的个性化治疗靶点。