Wu Sixuan, Tang Yuanbin, Pan Qihong, Zheng Yaqin, Tan Yeru, Pan Junfan, Li Yuehua
Department of Oncology, The First Affiliated Hospital, Hengyang Medical School, University of South China, No. 69 Chuanshan Road, Hengyang, 421001, Hunan, China.
Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, 350014, China.
Sci Rep. 2025 Jul 2;15(1):22893. doi: 10.1038/s41598-025-05227-2.
Lung adenocarcinoma (LUAD) is a major challenge in oncology due to its complex molecular structure and generally poor prognosis. The aim of this study was to find diagnostic markers and therapeutic targets for LUAD by integrating differential gene expression analysis, weighted gene co-expression network analysis (WGCNA), and machine learning methods. Differentially expressed genes (DEGs) were identified through the analysis of gene expression data from the Gene Expression Omnibus (GEO) database. Ten of the gene co-expression modules constructed by WGCNA were identified, with the red module having the most significant correlation with clinical features. In addition, a machine learning model constructed based on Stepglm[backward] with the random forest algorithm achieved the highest C-index (0.999) and screened eight core genes, among which ST14 was noted for its excellent predictive ability. Single-cell RNA sequencing of the LUAD samples further analyzed the expression patterns of these genes in 29 cellular subtypes, revealing their significant association with immune cell infiltration. Of particular note, the association of ST14 with clinical prognosis, drug responsiveness, and immune infiltration was validated, while enrichment analysis further clarified its role in key biological pathways. Ultimately, the expression of the core genes was validated experimentally. This study provides new insights into the pathogenesis of LUAD, clarifies potential diagnostic markers and therapeutic targets, and provides an important basis for future clinical interventions.
肺腺癌(LUAD)因其复杂的分子结构和普遍较差的预后,成为肿瘤学领域的一项重大挑战。本研究的目的是通过整合差异基因表达分析、加权基因共表达网络分析(WGCNA)和机器学习方法,寻找肺腺癌的诊断标志物和治疗靶点。通过对来自基因表达综合数据库(GEO)的基因表达数据进行分析,鉴定出差异表达基因(DEG)。WGCNA构建的基因共表达模块中,鉴定出10个模块,其中红色模块与临床特征的相关性最为显著。此外,基于带有随机森林算法的Stepglm[向后]构建的机器学习模型获得了最高的C指数(0.999),并筛选出8个核心基因,其中ST14因其出色的预测能力而受到关注。对肺腺癌样本进行单细胞RNA测序,进一步分析了这些基因在29种细胞亚型中的表达模式,揭示了它们与免疫细胞浸润的显著关联。特别值得注意的是,验证了ST14与临床预后、药物反应性和免疫浸润的关联,而富集分析进一步阐明了其在关键生物学途径中的作用。最终,通过实验验证了核心基因的表达。本研究为肺腺癌的发病机制提供了新的见解,阐明了潜在的诊断标志物和治疗靶点,并为未来的临床干预提供了重要依据。
Nucleic Acids Res. 2025-1-6
BMC Cancer. 2024-3-11