Suppr超能文献

机器学习算法辅助识别与中风后抑郁相关的生物学特征。

Machine learning algorithms assisted identification of post-stroke depression associated biological features.

作者信息

Zhang Xintong, Wang Xiangyu, Wang Shuwei, Zhang Yingjie, Wang Zeyu, Yang Qingyan, Wang Song, Cao Risheng, Yu Binbin, Zheng Yu, Dang Yini

机构信息

Department of Rehabilitation Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu, China.

Department of Rehabilitation Medicine, The Affiliated Lianyungang Oriental Hospital of Kangda College of Nanjing Medical University, Lianyungang, Jiangsu, China.

出版信息

Front Neurosci. 2023 Mar 8;17:1146620. doi: 10.3389/fnins.2023.1146620. eCollection 2023.

Abstract

OBJECTIVES

Post-stroke depression (PSD) is a common and serious psychiatric complication which hinders functional recovery and social participation of stroke patients. Stroke is characterized by dynamic changes in metabolism and hemodynamics, however, there is still a lack of metabolism-associated effective and reliable diagnostic markers and therapeutic targets for PSD. Our study was dedicated to the discovery of metabolism related diagnostic and therapeutic biomarkers for PSD.

METHODS

Expression profiles of GSE140275, GSE122709, and GSE180470 were obtained from GEO database. Differentially expressed genes (DEGs) were detected in GSE140275 and GSE122709. Functional enrichment analysis was performed for DEGs in GSE140275. Weighted gene co-expression network analysis (WGCNA) was constructed in GSE122709 to identify key module genes. Moreover, correlation analysis was performed to obtain metabolism related genes. Interaction analysis of key module genes, metabolism related genes, and DEGs in GSE122709 was performed to obtain candidate hub genes. Two machine learning algorithms, least absolute shrinkage and selection operator (LASSO) and random forest, were used to identify signature genes. Expression of signature genes was validated in GSE140275, GSE122709, and GSE180470. Gene set enrichment analysis (GSEA) was applied on signature genes. Based on signature genes, a nomogram model was constructed in our PSD cohort (27 PSD patients vs. 54 controls). ROC curves were performed for the estimation of its diagnostic value. Finally, correlation analysis between expression of signature genes and several clinical traits was performed.

RESULTS

Functional enrichment analysis indicated that DEGs in GSE140275 enriched in metabolism pathway. A total of 8,188 metabolism associated genes were identified by correlation analysis. WGCNA analysis was constructed to obtain 3,471 key module genes. A total of 557 candidate hub genes were identified by interaction analysis. Furthermore, two signature genes (SDHD and FERMT3) were selected using LASSO and random forest analysis. GSEA analysis found that two signature genes had major roles in depression. Subsequently, PSD cohort was collected for constructing a PSD diagnosis. Nomogram model showed good reliability and validity. AUC values of receiver operating characteristic (ROC) curve of SDHD and FERMT3 were 0.896 and 0.964. ROC curves showed that two signature genes played a significant role in diagnosis of PSD. Correlation analysis found that SDHD ( = 0.653, < 0.001) and FERM3 ( = 0.728, < 0.001) were positively related to the Hamilton Depression Rating Scale 17-item (HAMD) score.

CONCLUSION

A total of 557 metabolism associated candidate hub genes were obtained by interaction with DEGs in GSE122709, key modules genes, and metabolism related genes. Based on machine learning algorithms, two signature genes (SDHD and FERMT3) were identified, they were proved to be valuable therapeutic and diagnostic biomarkers for PSD. Early diagnosis and prevention of PSD were made possible by our findings.

摘要

目的

卒中后抑郁(PSD)是一种常见且严重的精神并发症,阻碍了卒中患者的功能恢复和社会参与。卒中的特点是代谢和血流动力学的动态变化,然而,PSD仍然缺乏与代谢相关的有效且可靠的诊断标志物和治疗靶点。我们的研究致力于发现PSD与代谢相关的诊断和治疗生物标志物。

方法

从基因表达综合数据库(GEO)获取GSE140275、GSE122709和GSE180470的表达谱。在GSE140275和GSE122709中检测差异表达基因(DEGs)。对GSE140275中的DEGs进行功能富集分析。在GSE122709中构建加权基因共表达网络分析(WGCNA)以识别关键模块基因。此外,进行相关性分析以获得与代谢相关的基因。对GSE122709中的关键模块基因、与代谢相关的基因和DEGs进行相互作用分析以获得候选枢纽基因。使用两种机器学习算法,即最小绝对收缩和选择算子(LASSO)和随机森林,来识别特征基因。在GSE140275、GSE122709和GSE180470中验证特征基因的表达。对特征基因应用基因集富集分析(GSEA)。基于特征基因,在我们的PSD队列(27例PSD患者与54例对照)中构建列线图模型。绘制ROC曲线以评估其诊断价值。最后,进行特征基因表达与几个临床特征之间的相关性分析。

结果

功能富集分析表明,GSE140275中的DEGs富集于代谢途径。通过相关性分析共鉴定出8188个与代谢相关的基因。构建WGCNA分析以获得3471个关键模块基因。通过相互作用分析共鉴定出557个候选枢纽基因。此外,使用LASSO和随机森林分析选择了两个特征基因(SDHD和FERMT3)。GSEA分析发现这两个特征基因在抑郁症中起主要作用。随后,收集PSD队列以构建PSD诊断。列线图模型显示出良好的可靠性和有效性。SDHD和FERMT3的受试者工作特征(ROC)曲线的AUC值分别为0.896和0.964。ROC曲线表明这两个特征基因在PSD诊断中起重要作用。相关性分析发现,SDHD(r = 0.653,P < 0.001)和FERM3(r = 0.728,P < 0.001)与汉密尔顿抑郁量表17项(HAMD)评分呈正相关。

结论

通过与GSE122709中的DEGs、关键模块基因和与代谢相关的基因相互作用,共获得557个与代谢相关的候选枢纽基因。基于机器学习算法,鉴定出两个特征基因(SDHD和FERMT3),它们被证明是PSD有价值的治疗和诊断生物标志物。我们的研究结果使PSD的早期诊断和预防成为可能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b95/10030717/177223e43da9/fnins-17-1146620-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验