Zhou Leqi, Yu Yue, Wen Rongbo, Zheng Kuo, Jiang Siyuan, Zhu Xiaoming, Sui Jinke, Gong Haifeng, Lou Zheng, Hao Liqiang, Yu Guanyu, Zhang Wei
Department of Colorectal Surgery, Changhai Hospital, Shanghai, China.
Front Oncol. 2022 May 10;12:863094. doi: 10.3389/fonc.2022.863094. eCollection 2022.
Most prognostic signatures for colorectal cancer (CRC) are developed to predict overall survival (OS). Gene signatures predicting recurrence-free survival (RFS) are rarely reported, and postoperative recurrence results in a poor outcome. Thus, we aim to construct a robust, individualized gene signature that can predict both OS and RFS of CRC patients.
Prognostic genes that were significantly associated with both OS and RFS in GSE39582 and TCGA cohorts were screened univariate Cox regression analysis and Venn diagram. These genes were then submitted to least absolute shrinkage and selection operator (LASSO) regression analysis and followed by multivariate Cox regression analysis to obtain an optimal gene signature. Kaplan-Meier (K-M), calibration curves and receiver operating characteristic (ROC) curves were used to evaluate the predictive performance of this signature. A nomogram integrating prognostic factors was constructed to predict 1-, 3-, and 5-year survival probabilities. Function annotation and pathway enrichment analyses were used to elucidate the biological implications of this model.
A total of 186 genes significantly associated with both OS and RFS were identified. Based on these genes, LASSO and multivariate Cox regression analyses determined an 8-gene signature that contained ATOH1, CACNB1, CEBPA, EPPHB2, HIST1H2BJ, INHBB, LYPD6, and ZBED3. Signature high-risk cases had worse OS in the GSE39582 training cohort (hazard ratio [HR] = 1.54, 95% confidence interval [CI] = 1.42 to 1.67) and the TCGA validation cohort (HR = 1.39, 95% CI = 1.24 to 1.56) and worse RFS in both cohorts (GSE39582: HR = 1.49, 95% CI = 1.35 to 1.64; TCGA: HR = 1.39, 95% CI = 1.25 to 1.56). The area under the curves (AUCs) of this model in the training and validation cohorts were all around 0.7, which were higher or no less than several previous models, suggesting that this signature could improve OS and RFS prediction of CRC patients. The risk score was related to multiple oncological pathways. CACNB1, HIST1H2BJ, and INHBB were significantly upregulated in CRC tissues.
A credible OS and RFS prediction signature with multi-cohort and cross-platform compatibility was constructed in CRC. This signature might facilitate personalized treatment and improve the survival of CRC patients.
大多数结直肠癌(CRC)的预后特征是为预测总生存期(OS)而开发的。预测无复发生存期(RFS)的基因特征鲜有报道,而术后复发会导致不良预后。因此,我们旨在构建一个强大的、个体化的基因特征,以预测CRC患者的OS和RFS。
通过单变量Cox回归分析和维恩图筛选出在GSE39582和TCGA队列中与OS和RFS均显著相关的预后基因。然后将这些基因进行最小绝对收缩和选择算子(LASSO)回归分析,随后进行多变量Cox回归分析,以获得最佳基因特征。采用Kaplan-Meier(K-M)法、校准曲线和受试者工作特征(ROC)曲线来评估该特征的预测性能。构建了一个整合预后因素的列线图,以预测1年、3年和5年生存概率。使用功能注释和通路富集分析来阐明该模型的生物学意义。
共鉴定出186个与OS和RFS均显著相关的基因。基于这些基因,LASSO和多变量Cox回归分析确定了一个包含ATOH1、CACNB1、CEBPA、EPPHB2、HIST1H2BJ、INHBB、LYPD6和ZBED3的8基因特征。在GSE39582训练队列(风险比[HR]=1.54,95%置信区间[CI]=1.42至1.67)和TCGA验证队列(HR=1.39,95%CI=1.24至1.56)中,特征高风险病例的OS较差,且在两个队列中的RFS也较差(GSE39582:HR=1.49,95%CI=1.35至1.64;TCGA:HR=1.39,95%CI=1.25至1.56)。该模型在训练和验证队列中的曲线下面积(AUC)均在0.7左右,高于或不低于之前的几个模型,表明该特征可以改善CRC患者的OS和RFS预测。风险评分与多种肿瘤学通路相关。CACNB1、HIST1H2BJ和INHBB在CRC组织中显著上调。
在CRC中构建了一个具有多队列和跨平台兼容性的可靠的OS和RFS预测特征。该特征可能有助于个性化治疗并提高CRC患者的生存率。