Zhang Yufeng, Wang Cong, Xia Qingqing, Jiang Weilong, Zhang Huizhe, Amiri-Ardekani Ehsan, Hua Haibing, Cheng Yi
Department of Pulmonary and Critical Care Medicine, Jiangyin Hospital of Traditional Chinese Medicine, Jiangyin Hospital Affiliated to Nanjing University of Chinese Medicine, Jiangyin, Jiangsu, China.
Department of Respiratory Medicine, Yancheng Hospital of Traditional Chinese Medicine, Yancheng Hospital Affiliated to Nanjing University of Chinese Medicine, Yancheng, Jiangsu, China.
Front Med (Lausanne). 2023 Feb 13;10:1001813. doi: 10.3389/fmed.2023.1001813. eCollection 2023.
This study aimed to identify candidate gene biomarkers associated with immune infiltration in idiopathic pulmonary fibrosis (IPF) based on machine learning algorithms.
Microarray datasets of IPF were extracted from the Gene Expression Omnibus (GEO) database to screen for differentially expressed genes (DEGs). The DEGs were subjected to enrichment analysis, and two machine learning algorithms were used to identify candidate genes associated with IPF. These genes were verified in a validation cohort from the GEO database. Receiver operating characteristic (ROC) curves were plotted to assess the predictive value of the IPF-associated genes. The cell-type identification by estimating relative subsets of RNA transcripts (CIBERSORT) algorithm was used to evaluate the proportion of immune cells in IPF and normal tissues. Additionally, the correlation between the expression of IPF-associated genes and the infiltration levels of immune cells was examined.
A total of 302 upregulated and 192 downregulated genes were identified. Functional annotation, pathway enrichment, Disease Ontology and gene set enrichment analyses revealed that the DEGs were related to the extracellular matrix and immune responses. COL3A1, CDH3, CEBPD, and GPIHBP1 were identified as candidate biomarkers using machine learning algorithms, and their predictive value was verified in a validation cohort. Additionally, ROC analysis revealed that the four genes had high predictive accuracy. The infiltration levels of plasma cells, M0 macrophages and resting dendritic cells were higher and those of resting natural killer (NK) cells, M1 macrophages and eosinophils were lower in the lung tissues of patients with IPF than in those of healthy individuals. The expression of the abovementioned genes was correlated with the infiltration levels of plasma cells, M0 macrophages and eosinophils.
COL3A1, CDH3, CEBPD, and GPIHBP1 are candidate biomarkers of IPF. Plasma cells, M0 macrophages and eosinophils may be involved in the development of IPF and may serve as immunotherapeutic targets in IPF.
本研究旨在基于机器学习算法识别与特发性肺纤维化(IPF)免疫浸润相关的候选基因生物标志物。
从基因表达综合数据库(GEO)中提取IPF的微阵列数据集,以筛选差异表达基因(DEGs)。对DEGs进行富集分析,并使用两种机器学习算法识别与IPF相关的候选基因。这些基因在来自GEO数据库的验证队列中得到验证。绘制受试者工作特征(ROC)曲线以评估IPF相关基因的预测价值。使用通过估计RNA转录本相对子集进行细胞类型鉴定(CIBERSORT)算法评估IPF和正常组织中免疫细胞的比例。此外,检测IPF相关基因的表达与免疫细胞浸润水平之间的相关性。
共鉴定出302个上调基因和192个下调基因。功能注释、通路富集、疾病本体和基因集富集分析表明,DEGs与细胞外基质和免疫反应相关。使用机器学习算法将COL3A1、CDH3、CEBPD和GPIHBP1鉴定为候选生物标志物,并在验证队列中验证了它们的预测价值。此外,ROC分析表明这四个基因具有较高的预测准确性。与健康个体相比,IPF患者肺组织中浆细胞、M0巨噬细胞和静息树突状细胞的浸润水平较高,而静息自然杀伤(NK)细胞、M1巨噬细胞和嗜酸性粒细胞的浸润水平较低。上述基因的表达与浆细胞、M0巨噬细胞和嗜酸性粒细胞的浸润水平相关。
COL3A1、CDH3、CEBPD和GPIHBP1是IPF的候选生物标志物。浆细胞、M0巨噬细胞和嗜酸性粒细胞可能参与IPF的发生发展,并可能作为IPF的免疫治疗靶点。