Liu Yi, Bian Bingxian, Chen Shiyu, Zhou Bingqian, Zhang Peng, Shen Lisong, Chen Hui
Department of Clinical Laboratory, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Institute of Artificial Intelligence Medicine, Shanghai Academy of Experimental Medicine, Shanghai, China.
Cancer Med. 2025 Mar;14(6):e70659. doi: 10.1002/cam4.70659.
Gastric cancer (GC) is considered a highly heterogeneous disease, and currently, a comprehensive approach encompassing molecular data from various biological levels is lacking.
This study conducted different analyses, including the identification of differentially expressed genes (DEGs), weighted correlation networks (WGCNA), single-cell RNA sequencing (scRNA-seq), mRNA expression-based stemness index (mRNAsi), and multiCox analysis, utilizing data from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) databases. Subsequently, the machine learning algorithms including least absolute shrinkage and selection operator (LASSO) regression and random forest (RF), combined with multiCox analysis were exploited to identify hub genes. These findings were then validated through the receiver operating characteristic (ROC) curve and Kaplan-Meier analysis, and were experimentally confirmed in GC samples by reverse transcription-polymerase chain reaction (RT-PCR) and enzyme-linked immunosorbent assay (ELISA).
Integrated analysis of TCGA and GEO databases, coupled with LASSO regression and RF algorithms, allowed us to identify 18 hub genes encoding differentially expressed secreted proteins in GC. The results of RT-PCR and bioinformatics analysis revealed four promising biomarkers with optimal diagnostic and prognostic potential. ROC analysis and Kaplan-Meier curves highlighted CHI3L1, FCGBP, VSIG2, and TFF2 as promising biomarkers for GC, offering superior modeling accuracy. These findings were further confirmed by RT-PCR and ELISA, affirming the clinical utility of these four biomarkers. Additionally, CIBERSORT analysis indicated a potential correlation between the four biomarkers and the infiltration of B memory cells and Treg cells.
This study unveiled four promising biomarkers present in the serum of patients with GC, which could serve as powerful indicators of GC and provide valuable insights for further research into GC pathogenesis.
胃癌(GC)被认为是一种高度异质性疾病,目前缺乏一种涵盖来自各种生物学水平分子数据的综合方法。
本研究利用基因表达综合数据库(GEO)和癌症基因组图谱(TCGA)数据库的数据,进行了不同的分析,包括差异表达基因(DEG)的鉴定、加权相关网络分析(WGCNA)、单细胞RNA测序(scRNA-seq)、基于mRNA表达的干性指数(mRNAsi)分析和多因素Cox分析。随后,利用包括最小绝对收缩和选择算子(LASSO)回归和随机森林(RF)在内的机器学习算法,并结合多因素Cox分析来鉴定枢纽基因。然后通过受试者工作特征(ROC)曲线和Kaplan-Meier分析对这些发现进行验证,并通过逆转录-聚合酶链反应(RT-PCR)和酶联免疫吸附测定(ELISA)在GC样本中进行实验确认。
对TCGA和GEO数据库的综合分析,结合LASSO回归和RF算法,使我们能够鉴定出18个在GC中编码差异表达分泌蛋白的枢纽基因。RT-PCR和生物信息学分析结果揭示了4个具有最佳诊断和预后潜力的有前景的生物标志物。ROC分析和Kaplan-Meier曲线突出显示CHI3L1、FCGBP、VSIG2和TFF2是GC有前景的生物标志物,具有卓越的建模准确性。RT-PCR和ELISA进一步证实了这些发现,肯定了这4种生物标志物的临床实用性。此外,CIBERSORT分析表明这4种生物标志物与B记忆细胞和调节性T细胞(Treg细胞)的浸润之间存在潜在关联。
本研究揭示了GC患者血清中存在的4种有前景的生物标志物,它们可作为GC的有力指标,并为进一步研究GC发病机制提供有价值的见解。