Zhu Yihao, Yang Xiaojing, Zu Yao
International Research Center for Marine Biosciences, Ministry of Science and Technology, Shanghai Ocean University, Shanghai, China.
Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources, Ministry of Education, Shanghai Ocean University, Shanghai, China.
Front Cell Dev Biol. 2022 Dec 5;10:1089915. doi: 10.3389/fcell.2022.1089915. eCollection 2022.
The etiologies and pathogenesis of dilated cardiomyopathy (DCM) with heart failure (HF) remain to be defined. Thus, exploring specific diagnosis biomarkers and mechanisms is urgently needed to improve this situation. In this study, three gene expression profiling datasets (GSE29819, GSE21610, GSE17800) and one single-cell RNA sequencing dataset (GSE95140) were obtained from the Gene Expression Omnibus (GEO) database. GSE29819 and GSE21610 were combined into the training group, while GSE17800 was the test group. We used the weighted gene co-expression network analysis (WGCNA) and identified fifteen driver genes highly associated with DCM with HF in the module. We performed the least absolute shrinkage and selection operator (LASSO) on the driver genes and then constructed five machine learning classifiers (random forest, gradient boosting machine, neural network, eXtreme gradient boosting, and support vector machine). Random forest was the best-performing classifier established on five Lasso-selected genes, which was utilized to select out NPPA, OMD, and PRELP for diagnosing DCM with HF. Moreover, we observed the up-regulation mRNA levels and robust diagnostic accuracies of NPPA, OMD, and PRELP in the training group and test group. Single-cell RNA-seq analysis further demonstrated their stable up-regulation expression patterns in various cardiomyocytes of DCM patients. Besides, through gene set enrichment analysis (GSEA), we found TGF-β signaling pathway, correlated with NPPA, OMD, and PRELP, was the underlying mechanism of DCM with HF. Overall, our study revealed NPPA, OMD, and PRELP serving as diagnostic biomarkers for DCM with HF, deepening the understanding of its pathogenesis.
扩张型心肌病(DCM)伴心力衰竭(HF)的病因和发病机制尚待明确。因此,迫切需要探索特异性诊断生物标志物和机制以改善这种状况。在本研究中,从基因表达综合数据库(GEO)中获取了三个基因表达谱数据集(GSE29819、GSE21610、GSE17800)和一个单细胞RNA测序数据集(GSE95140)。将GSE29819和GSE21610合并为训练组,而GSE17800作为测试组。我们使用加权基因共表达网络分析(WGCNA)并在该模块中鉴定出15个与DCM伴HF高度相关的驱动基因。我们对这些驱动基因进行了最小绝对收缩和选择算子(LASSO)分析,然后构建了五个机器学习分类器(随机森林、梯度提升机、神经网络、极限梯度提升和支持向量机)。随机森林是基于五个Lasso选择基因建立的表现最佳的分类器,用于筛选出NPPA、OMD和PRELP以诊断DCM伴HF。此外,我们观察到训练组和测试组中NPPA、OMD和PRELP的mRNA水平上调以及强大的诊断准确性。单细胞RNA测序分析进一步证明了它们在DCM患者各种心肌细胞中的稳定上调表达模式。此外,通过基因集富集分析(GSEA),我们发现与NPPA、OMD和PRELP相关的TGF-β信号通路是DCM伴HF的潜在机制。总体而言,我们的研究揭示了NPPA、OMD和PRELP可作为DCM伴HF的诊断生物标志物,加深了对其发病机制的理解。