Zhang Li, Sun ZhenPeng, Yuan Yao, Sheng Jie
Department of Epidemiology and Statistics, College of Public Health, Zhengzhou University, Zhengzhou, 450001, Henan, China.
School of Basic Medical Sciences, Chongqing Medical University, Chongqing, 400016, China.
Sci Rep. 2025 May 15;15(1):16868. doi: 10.1038/s41598-025-01628-5.
Diabetes mellitus (DM) is a chronic metabolic disorder that poses significant challenges to public health. Among its various complications, diabetic nephropathy (DN) emerges as a critical microvascular complication associated with high mortality rates. Despite the development of diverse therapeutic strategies targeting metabolic improvement, hemodynamic regulation, and fibrosis mitigation, the precise mechanisms responsible for glomerular injury in DN are not yet fully elucidated. To explore these mechanisms, public DN datasets (GSE30528, GSE104948, and GSE96804) were obtained from the GEO database. We merged the GSE30528 and GSE104948 datasets to identify differentially expressed genes (DEGs) between DN and control groups using R software. Weighted gene co-expression network analysis (WGCNA) was subsequently employed to discern genes associated with DN in key modules. We utilized Venny software to pinpoint co-expressed genes shared between DEGs and key module genes. These co-expressed genes underwent gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) enrichment analyses. Through LASSO, SVM, and RF methods, we isolated five significant genes: FN1, C1orf21, CD36, CD48, and SRPX2. These genes were further validated using a logistic model and 10-fold cross-validation. The external dataset GSE96804 served to validate the identified biomarkers, while receiver operating characteristic (ROC) curve analysis assessed their diagnostic efficacy for DN. Additionally, GSE104948 facilitated comparison of biomarker expression levels between DN and five other kidney diseases, highlighting their specificity for DN. These biomarkers also enabled the identification and validation of two molecular subtypes characterized by distinct immune profiles. The Nephroseq v5 database corroborated the correlation between biomarkers and clinical data. Furthermore, the GSigDB database was employed to predict protein-drug interactions, with molecular docking confirming the therapeutic potential of these drug targets. Finally, a diabetic mouse model (BKS-db) was constructed, and RT-qPCR experiments validated the reliability of the identified biomarkers. The study identified five biomarkers with robust diagnostic predictive power for DN. Subtype classification based on these biomarkers revealed distinct enrichment pathways and immune cell infiltration profiles, underscoring the close relationship between these genes and immune functions in DN. Drug prediction and molecular docking analyses demonstrated excellent binding affinities of candidate drugs to target proteins. Differential expression analysis between DN and five other kidney diseases indicated that all biomarkers, except C1orf21, were highly expressed in DN. Notably, as the mouse model lacks the C1orf21 gene, RT-qPCR confirmed the upregulated expression of FN1, CD36, CD48, and SRPX2. This study successfully identified five biomarkers with potential diagnostic and therapeutic value for DN. These biomarkers not only offer insights into the regulatory mechanisms underlying glomerular injury but also provide a theoretical foundation for the development of diagnostic biomarkers and therapeutic targets related to DN-associated glomerular injury.
糖尿病(DM)是一种慢性代谢紊乱疾病,对公众健康构成重大挑战。在其各种并发症中,糖尿病肾病(DN)是一种关键的微血管并发症,死亡率很高。尽管已经开发出多种针对代谢改善、血流动力学调节和纤维化减轻的治疗策略,但DN中肾小球损伤的确切机制尚未完全阐明。为了探索这些机制,我们从基因表达综合数据库(GEO)中获取了公开的DN数据集(GSE30528、GSE104948和GSE96804)。我们使用R软件合并了GSE30528和GSE104948数据集,以识别DN组和对照组之间的差异表达基因(DEG)。随后采用加权基因共表达网络分析(WGCNA)来识别关键模块中与DN相关的基因。我们使用Venny软件来确定DEG和关键模块基因之间共享的共表达基因。这些共表达基因进行了基因本体(GO)和京都基因与基因组百科全书(KEGG)富集分析。通过套索回归(LASSO)、支持向量机(SVM)和随机森林(RF)方法,我们分离出五个重要基因:纤连蛋白1(FN1)、1号染色体开放阅读框21(C1orf21)、血小板糖蛋白IV(CD36)、CD48分子(CD48)和小脑退化相关蛋白2同源物(SRPX2)。使用逻辑模型和10倍交叉验证对这些基因进行了进一步验证。外部数据集GSE96804用于验证所识别的生物标志物,同时通过受试者工作特征(ROC)曲线分析评估它们对DN的诊断效能。此外,GSE104948有助于比较DN与其他五种肾脏疾病之间生物标志物的表达水平,突出了它们对DN的特异性。这些生物标志物还能够识别和验证两种具有不同免疫特征的分子亚型。Nephroseq v5数据库证实了生物标志物与临床数据之间的相关性。此外,使用GSigDB数据库预测蛋白质-药物相互作用,分子对接证实了这些药物靶点的治疗潜力。最后,构建了糖尿病小鼠模型(BKS-db),实时定量聚合酶链反应(RT-qPCR)实验验证了所识别生物标志物的可靠性。该研究确定了五个对DN具有强大诊断预测能力的生物标志物。基于这些生物标志物的亚型分类揭示了不同的富集途径和免疫细胞浸润谱,强调了这些基因与DN中免疫功能之间的密切关系。药物预测和分子对接分析表明候选药物与靶蛋白具有良好的结合亲和力。DN与其他五种肾脏疾病之间的差异表达分析表明,除C1orf21外,所有生物标志物在DN中均高表达。值得注意的是,由于小鼠模型缺乏C1orf21基因,RT-qPCR证实了FN1、CD36、CD48和SRPX2的表达上调。本研究成功识别出五个对DN具有潜在诊断和治疗价值的生物标志物。这些生物标志物不仅为肾小球损伤的调控机制提供了见解,也为与DN相关肾小球损伤的诊断生物标志物和治疗靶点的开发提供了理论基础。