Lin Jing, Meng Yu, Song Meng-Fan, Gu Wei
The International Peace Maternity and Child Health Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China.
Shanghai Key Laboratory of Embryo Original Diseases, Shanghai, China.
Front Mol Biosci. 2022 Jun 16;9:757203. doi: 10.3389/fmolb.2022.757203. eCollection 2022.
WGCNA is a potent systems biology approach that explains the connection of gene expression based on a microarray database, which facilitates the discovery of disease therapy targets or potential biomarkers. Preeclampsia is a kind of pregnancy-induced hypertension caused by complex factors. The disease's pathophysiology, however, remains unknown. The focus of this research is to utilize WGCNA to identify susceptible modules and genes in the peripheral blood of preeclampsia patients. Obtain the whole gene expression data of GSE48424 preeclampsia patients and normal pregnant women from NCBI's GEO database. WGCNA is used to construct a gene co-expression network by calculating correlation coefficients between modules and phenotypic traits, screening important modules, and filtering central genes. To identify hub genes, we performed functional enrichment analysis, pathway analysis, and protein-protein interaction (PPI) network construction on key genes in critical modules. Then, the genetic data file GSE149437 and clinical peripheral blood samples were used as a validation cohort to determine the diagnostic value of these key genes. Nine gene co-expression modules were constructed through WGCNA analysis. Among them, the blue module is significantly related to preeclampsia and is related to its clinical severity. Thirty genes have been discovered by using the intersection of the genes in the blue module and the DEGs genes as the hub genes. It was found that HDC, MS4A2, and SLC18A2 scored higher in the PPI network and were identified as hub genes. These three genes were also differentially expressed in peripheral blood validation samples. Based on the above three genes, we established the prediction model of peripheral blood markers of preeclampsia and drew the nomogram and calibration curve. The ROC curves were used in the training cohort GSE48424 and the validation cohort GSE149437 to verify the predictive value of the above model. Finally, it was confirmed in the collected clinical peripheral blood samples that MS4A2 was differentially expressed in the peripheral blood of early-onset and late-onset preeclampsia, which is of great significance. This study provides a new biomarker and prediction model for preeclampsia.
加权基因共表达网络分析(WGCNA)是一种强大的系统生物学方法,它基于微阵列数据库解释基因表达的联系,有助于发现疾病治疗靶点或潜在生物标志物。子痫前期是一种由复杂因素引起的妊娠高血压疾病。然而,该疾病的病理生理学仍不清楚。本研究的重点是利用WGCNA来识别子痫前期患者外周血中的易感模块和基因。从NCBI的基因表达综合数据库(GEO)中获取GSE48424子痫前期患者和正常孕妇的全基因表达数据。通过计算模块与表型性状之间的相关系数,利用WGCNA构建基因共表达网络,筛选重要模块并过滤核心基因。为了识别枢纽基因,我们对关键模块中的关键基因进行了功能富集分析、通路分析和蛋白质-蛋白质相互作用(PPI)网络构建。然后,将基因数据文件GSE149437和临床外周血样本用作验证队列,以确定这些关键基因的诊断价值。通过WGCNA分析构建了9个基因共表达模块。其中,蓝色模块与子痫前期显著相关,并与其临床严重程度相关。通过将蓝色模块中的基因与差异表达基因(DEGs)的交集作为枢纽基因,发现了30个基因。结果发现,组氨酸脱羧酶(HDC)、膜表面免疫球蛋白A2(MS4A2)和囊泡单胺转运体2(SLC18A2)在PPI网络中得分较高,并被确定为枢纽基因。这三个基因在周围血验证样本中也存在差异表达。基于上述三个基因,我们建立了子痫前期外周血标志物的预测模型,并绘制了列线图和校准曲线。在训练队列GSE48424和验证队列GSE149437中使用ROC曲线来验证上述模型的预测价值。最后,在收集的临床外周血样本中证实,MS4A2在早发型和晚发型子痫前期患者外周血中存在差异表达,具有重要意义。本研究为子痫前期提供了新的生物标志物和预测模型。