Zhang Cuizhen, Niu Wanjie, Zhang Jiangtao, Zheng Yingyi, Chen Zhiru, Zhang Fali, Qiu Xiaoyan
Department of Pharmacy, Huashan Hospital, Fudan University, 12 Middle Urumqi Road, Shanghai, 200040, China.
Key Laboratory of Big Data Intelligent Computing, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China.
Int J Med Sci. 2025 Aug 11;22(14):3749-3762. doi: 10.7150/ijms.114833. eCollection 2025.
There is significant individual variation in the efficacy of cetuximab for the treatment of colorectal cancer (CRC). However, effective models to predict treatment outcomes are still lacking in clinical practice. Datasets (GSE106582 and GSE83889) were used to identify differentially expressed genes (DEGs) in CRC by the 'Limma' package in R software. Hypoxia-related genes were retrieved from the Molecular Signatures Database and cross-referenced with CRC DEGs. Protein expression levels were verified using immunohistochemistry (IHC) data from the Human Protein Atlas (HPA), and prognostic significance was assessed through the Kaplan-Meier plotter platform. Additionally, pathway and immune infiltration analyses were performed using the GSCA platform. We also successfully constructed a prediction model for cetuximab treatment response using the K-nearest neighbors (KNN) algorithm in GSE108277 dataset, in which the feature selection was performed through the permutation importance method. Analysis of GSE106582 and GSE83889 identified 417 overlapping DEGs by comparing cancer tissues with normal controls, including 16 hypoxia-related genes. 6 genes (, , , , , and ) were upregulated, and 10 genes (, , , , , , , , , and ) were downregulated in CRC. Survival analysis revealed that the 16 hypoxia-related DEGs were linked to the survival outcomes of CRC patients. Pathway analysis indicated that these genes were almost involved in EMT, cell cycle, and RTK pathways. Furthermore, these genes play a role in the infiltration of immune cells and may regulate the immune microenvironment. A prediction model for cetuximab response was developed, based on 10 key genes (, , , , , , , , , and ) and dataset from GSE108277. The model demonstrated robust performance with an accuracy of 0.9500, precision of 0.8378, recall of 1.0000, F1-score of 0.9118, and a receiver operating characteristic-area under the curve (ROC-AUC) of 0.9663. Our study identifies 10 hypoxia-related DEGs as key players in CRC progression and cetuximab response. And we successfully developed a predictive model to forecast the response of CRC patients to cetuximab treatment. This study will provide valuable biomarkers for CRC prognosis and help guide more effective therapeutic strategies.
西妥昔单抗治疗结直肠癌(CRC)的疗效存在显著个体差异。然而,临床实践中仍缺乏有效的模型来预测治疗结果。利用数据集(GSE106582和GSE83889),通过R软件中的“Limma”软件包来识别CRC中差异表达基因(DEGs)。从分子特征数据库中检索缺氧相关基因,并与CRC DEGs进行交叉引用。使用来自人类蛋白质图谱(HPA)的免疫组织化学(IHC)数据验证蛋白质表达水平,并通过Kaplan-Meier绘图平台评估预后意义。此外,使用GSCA平台进行通路和免疫浸润分析。我们还在GSE108277数据集中使用K近邻(KNN)算法成功构建了西妥昔单抗治疗反应的预测模型,其中通过排列重要性方法进行特征选择。通过比较癌组织与正常对照,对GSE106582和GSE83889的分析确定了417个重叠的DEGs,包括16个缺氧相关基因。在CRC中,6个基因(, , , , ,和 )上调,10个基因(, , , , , , , ,和 )下调。生存分析显示,这16个缺氧相关DEGs与CRC患者的生存结果相关。通路分析表明,这些基因几乎参与上皮-间质转化(EMT)、细胞周期和受体酪氨酸激酶(RTK)通路。此外,这些基因在免疫细胞浸润中起作用,并可能调节免疫微环境。基于10个关键基因(, , , , , , , ,和 )和GSE108277数据集,开发了西妥昔单抗反应预测模型。该模型表现出强大的性能,准确率为0.9500,精确率为0.8378,召回率为1.0000,F1值为0.9118,曲线下面积(ROC-AUC)为0.9663。我们的研究确定了10个缺氧相关DEGs是CRC进展和西妥昔单抗反应的关键因素。并且我们成功开发了一个预测模型来预测CRC患者对西妥昔单抗治疗的反应。本研究将为CRC预后提供有价值的生物标志物,并有助于指导更有效的治疗策略。