Qin Dongxu, Zheng Yongquan, Wang Libo, Lin Zhenyi, Yao Yao, Fei Weidong, Zheng Caihong
Department of Pharmacy, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China.
Department of pharmacy, Affiliated Xianju's Hospital, XianJu People's Hospital, Zhejiang Southeast Campus of Zhejiang Provincial People's Hospital, Hangzhou Medical College, Xianju, 317300, Zhejiang, China.
Sci Rep. 2025 Mar 17;15(1):9110. doi: 10.1038/s41598-025-93146-7.
Endometriosis and Recurrent Implantation Failure (RIF) are both pivotal clinical issues within the realm of reproductive medicine, sharing significant overlap in their pathophysiological mechanisms. However, research exploring the commonalities between these two conditions remains relatively scarce, and reliable shared diagnostic biomarkers have yet to be identified. In this study, we integrated transcriptomic and single-cell sequencing data from the Gene Expression Omnibus (GEO) database to identify shared diagnostic genes and alterations in the cellular microenvironment between EMs and RIF. Differential expression analysis and weighted gene co-expression network analysis (WGCNA) were employed to identify key genes. Machine learning algorithms, including Random Forest (RF) and XGBoost, were utilized to screen for shared diagnostic genes, which were subsequently validated through receiver operating characteristic (ROC) analysis and clinical prediction models. Single-cell analysis was conducted to investigate the expression patterns of these diagnostic genes across various cellular subpopulations. Additionally, gene set enrichment analysis (GSEA) and competing endogenous RNA (ceRNA) network analysis were employed to further elucidate the biological functions and regulatory mechanisms of these genes. A total of 16 key genes were identified, which were predominantly expressed in fibroblasts. Through machine learning, the optimal model combining RF and XGBoost was selected to identify the shared diagnostic genes PDIA4 and PGBD5. Single-cell analysis revealed significant differences in the expression of these diagnostic genes in fibroblasts between normal and disease states. ROC analysis showed that the Area Under the Curve (AUC) values for individual genes in disease diagnosis were all above 0.7. The constructed clinical prediction model demonstrated robust predictive capacity for the disease. Immune infiltration analysis indicated that M2 macrophages and γδ T cells play important roles in the pathogenesis of EMs and RIF. GSEA revealed that these genes are involved in immune responses, vascular function, and hormone regulation, and are regulated by miR-3121-3p. This study provides comprehensive insights into the shared cellular microenvironmental alterations and molecular mechanisms underlying EMs and RIF. The identification of PDIA4 and PGBD5 as shared diagnostic biomarkers offers new avenues for early diagnosis and targeted treatment of EMs-related RIF. Future work will focus on validating these findings in larger cohorts and exploring their therapeutic potential.
子宫内膜异位症和反复种植失败(RIF)都是生殖医学领域的关键临床问题,其病理生理机制存在显著重叠。然而,探索这两种疾病共性的研究相对较少,尚未确定可靠的共享诊断生物标志物。在本研究中,我们整合了来自基因表达综合数据库(GEO)的转录组和单细胞测序数据,以确定子宫内膜异位症(EMs)和反复种植失败(RIF)之间的共享诊断基因以及细胞微环境的变化。采用差异表达分析和加权基因共表达网络分析(WGCNA)来识别关键基因。利用包括随机森林(RF)和XGBoost在内的机器学习算法筛选共享诊断基因,随后通过受试者工作特征(ROC)分析和临床预测模型进行验证。进行单细胞分析以研究这些诊断基因在不同细胞亚群中的表达模式。此外,采用基因集富集分析(GSEA)和竞争性内源RNA(ceRNA)网络分析进一步阐明这些基因的生物学功能和调控机制。共鉴定出16个关键基因,主要在成纤维细胞中表达。通过机器学习,选择了结合RF和XGBoost的最佳模型来识别共享诊断基因PDIA4和PGBD5。单细胞分析显示,这些诊断基因在正常和疾病状态下成纤维细胞中的表达存在显著差异。ROC分析表明,疾病诊断中单个基因的曲线下面积(AUC)值均高于0.7。构建的临床预测模型对疾病具有强大的预测能力。免疫浸润分析表明,M2巨噬细胞和γδT细胞在子宫内膜异位症和反复种植失败的发病机制中起重要作用。GSEA显示,这些基因参与免疫反应、血管功能和激素调节,并受miR - 3121 - 3p调控。本研究全面深入地了解了子宫内膜异位症和反复种植失败共同的细胞微环境变化和分子机制。将PDIA4和PGBD5鉴定为共享诊断生物标志物为子宫内膜异位症相关反复种植失败的早期诊断和靶向治疗提供了新途径。未来的工作将集中在更大队列中验证这些发现,并探索其治疗潜力。