Cai Xiaoshu, Chen Yang, Zheng Chunlei, Xu Rong
Department of Electrical Engineering and Computer Science, School of Engineering, Case Western Reserve University, Cleveland, Ohio, USA.
Department of Epidemiology & Biostatistics, School of Medicine, Case Western Reserve University, Cleveland, Ohio, USA.
AMIA Jt Summits Transl Sci Proc. 2017 Jul 26;2017:227-236. eCollection 2017.
Colorectal cancer is the second leading cancer-related death worldwide and a majority of patients die from metastasis. Chronic intestinal inflammation plays an important role in tumor progression of colorectal cancer. However, few study works on systematically predicting colorectal cancer metastasis using inflammatory cytokine genes. We developed a supervised machine learning approach to predict colorectal cancer tumor progression using patient level genomic features. To better understand the role of cytokines, we integrated the metastatic-related genes from mouse phenotypic data. In addition, pathway analysis and network visualization were also applied to top significant genes ranked by feature weights of the final prediction model. The combined model of cytokines and mouse phenotypes achieved a predictive accuracy of 75.54%, higher than the model based on mouse phenotypes independently (70.42%, p-value<0.05). In additional, the combined model outperformed the model based on the existing metastatic-related epithelial-to-mesenchymal transition (EMT) genes (75.54% vs. 71.61%, p-value<0.05). We also observed that the most important cytokine gene features of the our model interact with the cancer driver genes and are highly associated with the colorectal cancer metastasis signaling pathway. We developed a combined model using both cytokine and mouse phenotype information to predict colorectal cancer metastasis. The results suggested that the inflammatory cytokines increase the power of predicting metastasis. We also systematically demonstrated the critical role of cytokines in progression of colorectal tumor.
结直肠癌是全球第二大致癌相关死亡原因,大多数患者死于转移。慢性肠道炎症在结直肠癌的肿瘤进展中起重要作用。然而,很少有研究致力于使用炎性细胞因子基因系统地预测结直肠癌转移。我们开发了一种监督机器学习方法,利用患者水平的基因组特征来预测结直肠癌的肿瘤进展。为了更好地理解细胞因子的作用,我们整合了来自小鼠表型数据的转移相关基因。此外,通路分析和网络可视化也应用于根据最终预测模型的特征权重排名的最显著基因。细胞因子和小鼠表型的组合模型实现了75.54%的预测准确率,高于独立基于小鼠表型的模型(70.42%,p值<0.05)。此外,组合模型优于基于现有转移相关上皮-间质转化(EMT)基因的模型(75.54%对71.61%,p值<0.05)。我们还观察到我们模型中最重要的细胞因子基因特征与癌症驱动基因相互作用,并且与结直肠癌转移信号通路高度相关。我们开发了一个使用细胞因子和小鼠表型信息的组合模型来预测结直肠癌转移。结果表明炎性细胞因子提高了预测转移的能力。我们还系统地证明了细胞因子在结直肠癌进展中的关键作用。