Suppr超能文献

加权基因共表达网络分析(WGCNA)与机器学习的综合分析确定了扩张型心肌病伴心力衰竭的诊断生物标志物。

Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure.

作者信息

Zhu Yihao, Yang Xiaojing, Zu Yao

机构信息

International Research Center for Marine Biosciences, Ministry of Science and Technology, Shanghai Ocean University, Shanghai, China.

Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources, Ministry of Education, Shanghai Ocean University, Shanghai, China.

出版信息

Front Cell Dev Biol. 2022 Dec 5;10:1089915. doi: 10.3389/fcell.2022.1089915. eCollection 2022.

Abstract

The etiologies and pathogenesis of dilated cardiomyopathy (DCM) with heart failure (HF) remain to be defined. Thus, exploring specific diagnosis biomarkers and mechanisms is urgently needed to improve this situation. In this study, three gene expression profiling datasets (GSE29819, GSE21610, GSE17800) and one single-cell RNA sequencing dataset (GSE95140) were obtained from the Gene Expression Omnibus (GEO) database. GSE29819 and GSE21610 were combined into the training group, while GSE17800 was the test group. We used the weighted gene co-expression network analysis (WGCNA) and identified fifteen driver genes highly associated with DCM with HF in the module. We performed the least absolute shrinkage and selection operator (LASSO) on the driver genes and then constructed five machine learning classifiers (random forest, gradient boosting machine, neural network, eXtreme gradient boosting, and support vector machine). Random forest was the best-performing classifier established on five Lasso-selected genes, which was utilized to select out NPPA, OMD, and PRELP for diagnosing DCM with HF. Moreover, we observed the up-regulation mRNA levels and robust diagnostic accuracies of NPPA, OMD, and PRELP in the training group and test group. Single-cell RNA-seq analysis further demonstrated their stable up-regulation expression patterns in various cardiomyocytes of DCM patients. Besides, through gene set enrichment analysis (GSEA), we found TGF-β signaling pathway, correlated with NPPA, OMD, and PRELP, was the underlying mechanism of DCM with HF. Overall, our study revealed NPPA, OMD, and PRELP serving as diagnostic biomarkers for DCM with HF, deepening the understanding of its pathogenesis.

摘要

扩张型心肌病(DCM)伴心力衰竭(HF)的病因和发病机制尚待明确。因此,迫切需要探索特异性诊断生物标志物和机制以改善这种状况。在本研究中,从基因表达综合数据库(GEO)中获取了三个基因表达谱数据集(GSE29819、GSE21610、GSE17800)和一个单细胞RNA测序数据集(GSE95140)。将GSE29819和GSE21610合并为训练组,而GSE17800作为测试组。我们使用加权基因共表达网络分析(WGCNA)并在该模块中鉴定出15个与DCM伴HF高度相关的驱动基因。我们对这些驱动基因进行了最小绝对收缩和选择算子(LASSO)分析,然后构建了五个机器学习分类器(随机森林、梯度提升机、神经网络、极限梯度提升和支持向量机)。随机森林是基于五个Lasso选择基因建立的表现最佳的分类器,用于筛选出NPPA、OMD和PRELP以诊断DCM伴HF。此外,我们观察到训练组和测试组中NPPA、OMD和PRELP的mRNA水平上调以及强大的诊断准确性。单细胞RNA测序分析进一步证明了它们在DCM患者各种心肌细胞中的稳定上调表达模式。此外,通过基因集富集分析(GSEA),我们发现与NPPA、OMD和PRELP相关的TGF-β信号通路是DCM伴HF的潜在机制。总体而言,我们的研究揭示了NPPA、OMD和PRELP可作为DCM伴HF的诊断生物标志物,加深了对其发病机制的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a929/9760806/27c081ded0be/fcell-10-1089915-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验