Huang Ke-Ke, Zheng Hui-Lei, Li Shuo, Zeng Zhi-Yu
Department of Cardiology, Institute of Cardiovascular Diseases, the First Affiliated Hospital, Guangxi Medical University, Nanning, China.
Department of Health Management, the First Affiliated Hospital, Guangxi Medical University, Nanning, China.
J Thorac Dis. 2022 Jul;14(7):2621-2634. doi: 10.21037/jtd-22-632.
Coronary artery disease (CAD) is a multifactorial disease and its pathogenesis remains unclear. We aimed to explore the optimal feature genes (OFGs) for CAD and to investigate the function of immune cell infiltration of CAD. It will be helpful for better understanding of the pathogenesis and the development of genetic prediction of CAD.
Datasets related to CAD were obtained from the Gene Expression Omnibus (GEO) database. Cases from the datasets met diagnostic criteria including clinical symptoms, electrocardiographic (ECG) and angiographic evidence. We identified differentially expressed genes (DEGs) and conducted functional enrichment analysis. OFGs were obtained from the least absolute shrinkage and selection operator (LASSO) algorithm, support vector machine recursive feature elimination (SVM-RFE) algorithm, and random forest (RF) algorithm. CIBERSORT was used to compare immune infiltration between CAD patients and normal controls, and the correlation between OFGs and immune cells was analyzed.
DEGs were involved in the interleukin (IL)-17 signaling pathway, nuclear factor (NF)-kappa B signaling pathway, and tumor necrosis factor (TNF) signaling pathway. Gene Ontology (GO) analysis revealed DEGs were enriched in lipopolysaccharide (LPS), tertiary granule, and pattern recognition receptor activity. Disease Ontology (DO) analysis suggested DEGs were enriched in lung disease, arteriosclerotic cardiovascular disease (CVD). Matrix metalloproteinase 9 (MMP9), Pellino E3 ubiquitin protein ligase 1 (PELI1), thrombomodulin (THBD), and zinc finger protein 36 (ZFP36) were screened by the intersection of OFGs obtained from LASSO, SVM-REF, and RF algorithms. CAD patients had a lower proportion of memory B cells (P=0.019), CD8 T cells (P<0.001), resting memory CD4 T cells (P<0.001), regulatory T cells (P=0.028), and gamma delta T cells (P<0.001) than normal controls, while the proportion of activated memory CD4 T cells (P=0.014), resting natural killer (NK) cells (P<0.001), monocytes (P<0.001), M0 macrophages (P=0.023), activated mast cells (P<0.001), and neutrophils (P<0.001) in CAD patients were higher than normal controls. MMP9, PELI1, THBD, and ZFP36 were correlated with immune cells.
MMP9, PELI1, THBD, and ZFP36 may be predicted biomarkers for CAD. The OFGs and association between OFGs and immune infiltration may provide potential biomarkers for CAD prediction along with the better assessment of the disease.
冠状动脉疾病(CAD)是一种多因素疾病,其发病机制尚不清楚。我们旨在探索CAD的最佳特征基因(OFG),并研究CAD免疫细胞浸润的功能。这将有助于更好地理解CAD的发病机制以及遗传预测的发展。
从基因表达综合数据库(GEO)获取与CAD相关的数据集。数据集中的病例符合诊断标准,包括临床症状、心电图(ECG)和血管造影证据。我们鉴定了差异表达基因(DEG)并进行了功能富集分析。通过最小绝对收缩和选择算子(LASSO)算法、支持向量机递归特征消除(SVM-RFE)算法和随机森林(RF)算法获得OFG。使用CIBERSORT比较CAD患者和正常对照之间的免疫浸润,并分析OFG与免疫细胞之间的相关性。
DEG参与白细胞介素(IL)-17信号通路、核因子(NF)-κB信号通路和肿瘤坏死因子(TNF)信号通路。基因本体(GO)分析显示DEG在脂多糖(LPS)、三级颗粒和模式识别受体活性中富集。疾病本体(DO)分析表明DEG在肺部疾病、动脉粥样硬化性心血管疾病(CVD)中富集。通过LASSO、SVM-REF和RF算法获得的OFG的交集筛选出基质金属蛋白酶9(MMP9)、佩利诺E3泛素蛋白连接酶1(PELI1)、血栓调节蛋白(THBD)和锌指蛋白36(ZFP36)。与正常对照相比,CAD患者的记忆B细胞(P=0.019)、CD8 T细胞(P<0.001)、静息记忆CD4 T细胞(P<0.001)、调节性T细胞(P=0.028)和γδT细胞(P<0.001)比例较低,而CAD患者中活化记忆CD4 T细胞(P=0.014)、静息自然杀伤(NK)细胞(P<0.001)、单核细胞(P<0.001)、M0巨噬细胞(P=0.023)、活化肥大细胞(P<0.001)和中性粒细胞(P<0.001)的比例高于正常对照。MMP9、PELI1、THBD和ZFP36与免疫细胞相关。
MMP9、PELI1、THBD和ZFP36可能是CAD的预测生物标志物。OFG以及OFG与免疫浸润之间的关联可能为CAD预测提供潜在的生物标志物,并更好地评估该疾病。