Jia Yiwen, Feng Guangming, Chen Siyuan, Li Wenhao, Jia Zeguo, Wang Jian, Li Hongxia, Hong Shaocheng, Dai Fu
Department of Gastroenterology, The Third Affiliated Hospital of Anhui Medical University (Hefei first people's Hospital), Hefei, China.
Department of Pulmonology, The First Affiliated Hospital of Anhui Medical University, Hefei, 230032, China.
J Cancer. 2024 Jun 3;15(13):4175-4196. doi: 10.7150/jca.94630. eCollection 2024.
Metabolic reprogramming plays a crucial role in the development of colorectal cancer (CRC), influencing tumor heterogeneity, the tumor microenvironment, and metastasis. While the interaction between metabolism and CRC is critical for developing personalized treatments, gaps remain in understanding how tumor cell metabolism affects prognosis. Our study introduces novel insights by integrating single-cell and bulk transcriptome analyses to explore the metabolic landscape within CRC cells and its mechanisms influencing disease progression. This approach allows us to uncover metabolic heterogeneity and identify specific metabolic genes impacting metastasis, which have not been thoroughly examined in previous studies. We sourced microarray and single-cell RNA sequencing datasets from the Gene Expression Omnibus (GEO) and bulk sequencing data for CRC from The Cancer Genome Atlas (TCGA). We employed Gene Set Variation Analysis (GSVA) to assess metabolic pathway activity, consensus clustering to identify CRC-specific transcriptome subtypes in bulkseq, and rigorous quality controls, including the exclusion of cells with high mitochondrial gene expression in scRNA seq. Advanced analyses such as AUCcell, infercnvCNV, Non-negative Matrix Factorization (NMF), and CytoTRACE were utilized to dissect the cellular landscape and evaluate pathway activities and tumor cell stemness. The hdWGCNA algorithm helped identify prognosis-related hub genes, integrating these findings using a random forest machine learning model. Kaplan-Meier survival curves identified 21 significant metabolic pathways linked to prognosis, with consensus clustering defining three CRC subtypes (C3, C2, C1) based on metabolic activity, which correlated with distinct clinical outcomes. The metabolic activity of the 13 cell subpopulations, particularly the epithelial cell subpopulation with active metabolic levels, was evaluated using AUCcell in scRNA seq. To further analyze tumor cells using infercnv, NMF disaggregated these cells into 10 cellular subpopulations. Among these, the C2 subpopulation exhibited higher stemness and tended to have a poorer prognosis compared to C6 and C0. Conversely, the C8, C3, and C1 subpopulations demonstrated a higher level of the five metabolic pathways, and the C3 and C8 subpopulations tended to have a more favorable prognosis. hdWGCNA identified 20 modules, from which we selected modules primarily expressed in high metabolic tumor subgroups and highly correlated with clinical information, including blue and cyan. By applying variable downscaling of RF to a total of 50 hub genes, seven gene signatures were obtained. Furthermore, molecules that were validated to be protective in GEO were screened alongside related molecules, resulting in the identification of prognostically relevant molecules such as UQCRFS1 and GRSF1. Additionally, the expression of GRSF1 was examined in colon cancer cell lines using qPCR and phenotypically verified by experiments. Our findings emphasize that high activity in specific metabolic pathways, including pyruvate metabolism and the tricarboxylic acid cycle, correlates with improved colon cancer outcomes, presenting new avenues for metabolic-based therapies. The identification of hub genes like GRSF1 and UQCRFS1 and their link to favorable metabolic profiles offers novel insights into tumor neovascularization and metastasis, with significant clinical implications for targeting metabolic pathways in CRC therapy.
代谢重编程在结直肠癌(CRC)的发展中起着至关重要的作用,影响肿瘤异质性、肿瘤微环境和转移。虽然代谢与CRC之间的相互作用对于开发个性化治疗至关重要,但在理解肿瘤细胞代谢如何影响预后方面仍存在差距。我们的研究通过整合单细胞和批量转录组分析引入了新的见解,以探索CRC细胞内的代谢格局及其影响疾病进展的机制。这种方法使我们能够揭示代谢异质性,并识别影响转移的特定代谢基因,这些基因在以前的研究中尚未得到充分研究。我们从基因表达综合数据库(GEO)获取了微阵列和单细胞RNA测序数据集,并从癌症基因组图谱(TCGA)获取了CRC的批量测序数据。我们采用基因集变异分析(GSVA)来评估代谢途径活性,采用共识聚类来识别批量测序中的CRC特异性转录组亚型,并进行严格的质量控制,包括在scRNA测序中排除线粒体基因表达高的细胞。利用AUCcell、infercnvCNV、非负矩阵分解(NMF)和CytoTRACE等高级分析来剖析细胞格局,并评估途径活性和肿瘤细胞干性。hdWGCNA算法有助于识别与预后相关的枢纽基因,并使用随机森林机器学习模型整合这些发现。Kaplan-Meier生存曲线确定了21条与预后相关的重要代谢途径,共识聚类根据代谢活性定义了三种CRC亚型(C3、C2、C1),这与不同的临床结果相关。在scRNA测序中使用AUCcell评估了13个细胞亚群的代谢活性,特别是代谢水平活跃的上皮细胞亚群。为了使用infercnv进一步分析肿瘤细胞,NMF将这些细胞分解为10个细胞亚群。其中,C2亚群表现出更高的干性,与C6和C0相比,预后往往更差。相反,C8、C3和C1亚群表现出五种代谢途径的较高水平,C3和C8亚群的预后往往更有利。hdWGCNA识别出20个模块,我们从中选择了主要在高代谢肿瘤亚组中表达且与临床信息高度相关的模块,包括蓝色和青色模块。通过对总共50个枢纽基因应用可变降尺度的随机森林,获得了7个基因特征。此外,筛选了在GEO中被验证具有保护作用的分子以及相关分子,从而鉴定出与预后相关的分子,如UQCRFS1和GRSF1。此外,使用qPCR检测了GRSF1在结肠癌细胞系中的表达,并通过实验进行了表型验证。我们的研究结果强调,包括丙酮酸代谢和三羧酸循环在内的特定代谢途径的高活性与改善的结肠癌结果相关,为基于代谢的治疗提供了新途径。识别出像GRSF1和UQCRFS1这样的枢纽基因及其与有利代谢谱的联系,为肿瘤新生血管形成和转移提供了新的见解,对CRC治疗中靶向代谢途径具有重要的临床意义。