Johnson Heather, El-Schich Zahra, Ali Amjad, Zhang Xuhui, Simoulis Athanasios, Wingren Anette Gjörloff, Persson Jenny L
Olympia Diagnostics, Sunnyvale, CA 94086, USA.
Department of Biomedical Sciences, Malmö University, SE-206 06 Malmö, Sweden.
Cancers (Basel). 2022 Apr 18;14(8):2045. doi: 10.3390/cancers14082045.
Purpose: Despite the high mortality of metastatic colorectal cancer (mCRC), no new biomarker tools are available for predicting treatment response. We developed gene-mutation-based algorithms as a biomarker classifier to predict treatment response with better precision than the current predictive factors. Methods: Random forest machine learning (ML) was applied to identify the candidate algorithms using the MSK Cohort (n = 471) as a training set and validated in the TCGA Cohort (n = 221). Logistic regression, progression-free survival (PFS), and univariate/multivariate Cox proportional hazard analyses were performed and the performance of the candidate algorithms was compared with the established risk parameters. Results: A novel 7-Gene Algorithm based on mutation profiles of seven KRAS-associated genes was identified. The algorithm was able to distinguish non-progressed (responder) vs. progressed (non-responder) patients with AUC of 0.97 and had predictive power for PFS with a hazard ratio (HR) of 16.9 (p < 0.001) in the MSK cohort. The predictive power of this algorithm for PFS was more pronounced in mCRC (HR = 16.9, p < 0.001, n = 388). Similarly, in the TCGA validation cohort, the algorithm had AUC of 0.98 and a significant predictive power for PFS (p < 0.001). Conclusion: The novel 7-Gene Algorithm can be further developed as a biomarker model for prediction of treatment response in mCRC patients to improve personalized therapies.
尽管转移性结直肠癌(mCRC)死亡率很高,但尚无新的生物标志物工具可用于预测治疗反应。我们开发了基于基因突变的算法作为生物标志物分类器,以比当前预测因素更精确地预测治疗反应。方法:应用随机森林机器学习(ML),以MSK队列(n = 471)作为训练集来识别候选算法,并在TCGA队列(n = 221)中进行验证。进行逻辑回归、无进展生存期(PFS)以及单变量/多变量Cox比例风险分析,并将候选算法的性能与既定风险参数进行比较。结果:基于7个KRAS相关基因的突变谱,确定了一种新型的7基因算法。该算法能够区分病情未进展(反应者)与病情进展(无反应者)的患者,在MSK队列中AUC为0.97,对PFS具有预测能力,风险比(HR)为16.9(p < 0.001)。该算法对mCRC患者PFS的预测能力更为显著(HR = 16.9,p < 0.001,n = 388)。同样,在TCGA验证队列中,该算法的AUC为0.98,对PFS具有显著预测能力(p < 0.001)。结论:新型7基因算法可进一步开发为预测mCRC患者治疗反应的生物标志物模型,以改善个性化治疗。