基于机器学习和分子对接模拟的缺血性脑卒中诊断和治疗的综合分析。
Integrative Analysis of Machine Learning and Molecule Docking Simulations for Ischemic Stroke Diagnosis and Therapy.
机构信息
Department of Medical Cell Biology and Genetics, Guangdong Key Laboratory of Genomic Stability and Disease Prevention, Shenzhen Key Laboratory of Anti-Aging and Regenerative Medicine, and Shenzhen Engineering Laboratory of Regenerative Technologies for Orthopaedic Diseases, Health Sciences Center, Shenzhen University, Shenzhen 518060, China.
Brain Research Centre, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China.
出版信息
Molecules. 2023 Nov 22;28(23):7704. doi: 10.3390/molecules28237704.
Due to the narrow therapeutic window and high mortality of ischemic stroke, it is of great significance to investigate its diagnosis and therapy. We employed weighted gene coexpression network analysis (WGCNA) to ascertain gene modules related to stroke and used the maSigPro R package to seek the time-dependent genes in the progression of stroke. Three machine learning algorithms were further employed to identify the feature genes of stroke. A nomogram model was built and applied to evaluate the stroke patients. We analyzed single-cell RNA sequencing (scRNA-seq) data to discern microglia subclusters in ischemic stroke. The RNA velocity, pseudo time, and gene set enrichment analysis (GSEA) were performed to investigate the relationship of microglia subclusters. Connectivity map (CMap) analysis and molecule docking were used to screen a therapeutic agent for stroke. A nomogram model based on the feature genes showed a clinical net benefit and enabled an accurate evaluation of stroke patients. The RNA velocity and pseudo time analysis showed that microglia subcluster 0 would develop toward subcluster 2 within 24 h from stroke onset. The GSEA showed that the function of microglia subcluster 0 was opposite to that of subcluster 2. AZ_628, which screened from CMap analysis, was found to have lower binding energy with Mmp12, Lgals3, Fam20c, Capg, Pkm2, Sdc4, and Itga5 in microglia subcluster 2 and maybe a therapeutic agent for the poor development of microglia subcluster 2 after stroke. Our study presents a nomogram model for stroke diagnosis and provides a potential molecule agent for stroke therapy.
由于缺血性中风的治疗窗口狭窄和死亡率高,因此研究其诊断和治疗方法具有重要意义。我们采用加权基因共表达网络分析(WGCNA)确定与中风相关的基因模块,并使用 maSigPro R 软件包寻找中风进展过程中的时间依赖性基因。进一步采用三种机器学习算法识别中风的特征基因。构建列线图模型并应用于评估中风患者。我们分析了缺血性中风的单细胞 RNA 测序(scRNA-seq)数据,以辨别小胶质细胞亚群。进行 RNA 速度、伪时间和基因集富集分析(GSEA),以研究小胶质细胞亚群之间的关系。连接图谱(CMap)分析和分子对接用于筛选中风的治疗剂。基于特征基因的列线图模型显示出临床净效益,可以准确评估中风患者。RNA 速度和伪时间分析表明,小胶质细胞亚群 0 会在中风发作后 24 小时内向亚群 2 发展。GSEA 表明小胶质细胞亚群 0 的功能与亚群 2 相反。从 CMap 分析中筛选出的 AZ_628 与小胶质细胞亚群 2 中的 Mmp12、Lgals3、Fam20c、Capg、Pkm2、Sdc4 和 Itga5 的结合能较低,可能是中风后小胶质细胞亚群 2 发育不良的治疗剂。我们的研究提出了中风诊断的列线图模型,并为中风治疗提供了潜在的分子治疗剂。