Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing 400016, China.
Genes (Basel). 2022 Sep 8;13(9):1602. doi: 10.3390/genes13091602.
Although many biomarkers associated with coronavirus disease 2019 (COVID-19) were found, a novel signature relevant to immune cells has not been developed. In this work, the "CIBERSORT" algorithm was used to assess the fraction of immune infiltrating cells in GSE152641 and GSE171110. Key modules associated with important immune cells were selected by the "WGCNA" package. The "GO" enrichment analysis was used to reveal the biological function associated with COVID-19. The "Boruta" algorithm was used to screen candidate genes, and the "LASSO" algorithm was used for collinearity reduction. A novel gene signature was developed based on multivariate logistic regression analysis. Subsequently, M0 macrophages (PR = 0.948 in GSE152641 and PR = 0.981 in GSE171110) and neutrophils (PR = 0.892 in GSE152641 and PR = 0.960 in GSE171110) were considered as important immune cells. Forty-three intersected genes from two modules were selected, which mainly participated in some immune-related activities. Finally, a three-gene signature comprising CLEC4D, DUSP13, and UNC5A that can accurately distinguish COVID-19 patients and healthy controls in three datasets was constructed. The ROC was 0.974 in the training set, 0.946 in the internal test set, and 0.709 in the external test set. In conclusion, we constructed a three-gene signature to identify COVID-19, and CLEC4D, DUSP13, and UNC5A may be potential biomarkers for COVID-19 patients.
尽管已经发现了许多与 2019 年冠状病毒病(COVID-19)相关的生物标志物,但尚未开发出与免疫细胞相关的新型特征。在这项工作中,使用“CIBERSORT”算法评估了 GSE152641 和 GSE171110 中免疫浸润细胞的分数。使用“WGCNA”软件包选择与重要免疫细胞相关的关键模块。使用“GO”富集分析揭示与 COVID-19 相关的生物学功能。使用“Boruta”算法筛选候选基因,并使用“LASSO”算法进行共线性减少。基于多元逻辑回归分析开发了一种新的基因特征。随后,M0 巨噬细胞(在 GSE152641 中的 PR = 0.948,在 GSE171110 中的 PR = 0.981)和中性粒细胞(在 GSE152641 中的 PR = 0.892,在 GSE171110 中的 PR = 0.960)被认为是重要的免疫细胞。从两个模块中选择了 43 个相交基因,这些基因主要参与了一些免疫相关的活动。最后,构建了一个包含 CLEC4D、DUSP13 和 UNC5A 的三基因特征,该特征可以在三个数据集准确区分 COVID-19 患者和健康对照者。在训练集中 ROC 为 0.974,内部测试集为 0.946,外部测试集为 0.709。总之,我们构建了一个三基因特征来识别 COVID-19,CLEC4D、DUSP13 和 UNC5A 可能是 COVID-19 患者的潜在生物标志物。