Li Hao, Sun Yong, Chen Rong
Department of Pediatrics, Hejiang People's Hospital, Sichuan, China.
3 Biotech. 2021 Mar;11(3):127. doi: 10.1007/s13205-021-02675-1. Epub 2021 Feb 16.
The purpose of this study was to identify biomarkers and construct a diagnostic prediction model for multiple sclerosis (MS). Microarray datasets in the Gene Expression Omnibus (GEO) were downloaded. Weighted gene coexpression analysis (WGCNA) was used to search for hub modules and biomarkers related to MS. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were used to roughly define their biological functions and pathways. Least absolute shrinkage and selection operator (LASSO) regression and multivariate logistic regression analysis were used to identify the diagnostic biomarkers and construct a nomogram. The calibration curve and receiver operating characteristic (ROC) curve were used to judge the diagnostic predictive ability. In addition, cell-type identification by estimating relative subsets of RNA transcripts (CIBERSORT) algorithm was used to calculate the proportion of 22 kinds of immune cells. GSE41850 was used as the training set, and GSE17048 was used as the test set. WGCNA revealed one hub module containing 165 hub genes. Most of their biological functions and pathways are related to cell metabolism and immune cell activation. The diagnostic nomogram contained , , , , and . The ROC curve and the calibration curve of the training set and test set confirmed that the nomogram had great prediction ability. In addition, monocytes and M0 macrophages were significantly different between MS patients and healthy people. The expression of and is correlated with M0 macrophages. The nomogram provides new insights and contributes to the accurate diagnosis of MS.
The online version contains supplementary material available at 10.1007/s13205-021-02675-1.
本研究的目的是识别生物标志物并构建多发性硬化症(MS)的诊断预测模型。从基因表达综合数据库(GEO)下载微阵列数据集。使用加权基因共表达分析(WGCNA)来搜索与MS相关的枢纽模块和生物标志物。基因本体(GO)和京都基因与基因组百科全书(KEGG)分析用于大致确定它们的生物学功能和途径。使用最小绝对收缩和选择算子(LASSO)回归及多变量逻辑回归分析来识别诊断生物标志物并构建列线图。校准曲线和受试者工作特征(ROC)曲线用于判断诊断预测能力。此外,通过估计RNA转录本相对子集进行细胞类型鉴定(CIBERSORT)算法用于计算22种免疫细胞的比例。GSE41850用作训练集,GSE17048用作测试集。WGCNA揭示了一个包含165个枢纽基因的枢纽模块。它们的大多数生物学功能和途径与细胞代谢和免疫细胞激活有关。诊断列线图包含 、 、 、 、 和 。训练集和测试集的ROC曲线及校准曲线证实列线图具有很强的预测能力。此外,MS患者和健康人之间的单核细胞和M0巨噬细胞存在显著差异。 和 的表达与M0巨噬细胞相关。该列线图提供了新的见解,有助于MS的准确诊断。
在线版本包含可在10.1007/s13205-021-02675-1获取的补充材料。