Lin S M, Wang X J, Huang S H, Xu Z B, Huang Y, Lu X R, Xu D B, Chi P
Department of Gastrointestinal Surgery, Fujian Medical University Longyan First Hospital, Longyan 364000, China.
Department of Colorectal Surgery, Fujian Medical University Union Hospital, Fuzhou 350000, China.
Zhonghua Zhong Liu Za Zhi. 2021 Feb 23;43(2):202-206. doi: 10.3760/cma.j.cn112152-20200419-00355.
To explore and establish an artificial neural network (ANN) model for predicting the efficacy of first-line FOLFOX chemotherapy for metastatic colorectal cancer. A set of FOLFOX chemotherapy data from a group of patients with metastatic colorectal cancer (mCRC) (GSE104645) was downloaded from the GEO database as a training set. According to the FOLFOX protocol, the efficacy was divided into two groups: the chemo-sensitive group (including complete response and partial response) and the chemo-resistant group (including stable disease and progressive disease), including 31 cases in the sensitive group and 23 in the resistant group. Then, chip data (accessible number: GSE69657) from Fujian Medical University Union Hospital were chosen as a test set. A total of 30 patients were enrolled in the study, including 13 in the sensitive group and 17 in the resistant group. The batch effect correction was performed on the expression values of the two sets of matrices using the R 3.5.1 software Combat package. The gene expression difference of sensitive and resistant group in GSE104645 was analyzed by the GEO2R platform. <0.05 and the absolute value of log(2)FC>0.33 (FC abbreviation of fold change) were used as the threshold value to screen the drug resistance and sensitive genes of the FOLFOX regimen. An ANN was constructed using the multi-layer perceptron (MLP) to perform the FOLFOX regimen on the GSE104645 dataset. The GSE69657 expression matrix and clinical efficacy parameters were then used for retrospective verification. Receiver operating characteristic(ROC) curves were used to evaluate the test results and predictive power. A total of 2, 076 differentially expressed genes in GSE104645 were selected, of which 822 genes were up-regulated and 1, 254 genes were down-regulated in the chemo-resistance group. The down-regulated genes were sensitive genes. GO analysis of the biological processes in which the differentially expressed genes were involved, revealed that they were mainly involved in the regulation of substance metabolism. A total of 39 genes were included in the final model construction. This was a neural network model with two hidden layers. The accuracy of predicting training samples and test samples was 75.7% and 76.5%, respectively, and the area under the ROC curve was 0.875. The chip data set of our department (GSE69657) was set as the test set, and the area under the ROC curve was 0.778. In this study, an artificial neural network model is successfully constructed to predict the efficacy of first-line FOLFOX regimen for metastatic colorectal cancer based on the microarray, and an independent external verification is also conducted. The model has good stability and well prediction efficiency. Besides, the results of this study suggest that the gene functions related to oxaliplatin resistance are mainly enriched in the regulation process of substance metabolism.
探索并建立一种用于预测一线FOLFOX化疗对转移性结直肠癌疗效的人工神经网络(ANN)模型。从基因表达综合数据库(GEO数据库)下载一组转移性结直肠癌(mCRC)患者的FOLFOX化疗数据(GSE104645)作为训练集。根据FOLFOX方案,将疗效分为两组:化疗敏感组(包括完全缓解和部分缓解)和化疗耐药组(包括病情稳定和疾病进展),敏感组31例,耐药组23例。然后,选取福建医科大学附属协和医院的芯片数据(可获取编号:GSE69657)作为测试集。该研究共纳入30例患者,其中敏感组13例,耐药组17例。使用R 3.5.1软件的Combat程序包对两组矩阵的表达值进行批次效应校正。通过GEO2R平台分析GSE104645中敏感组和耐药组的基因表达差异。以<0.05且log(2)FC的绝对值>0.33(FC为变化倍数的缩写)作为阈值筛选FOLFOX方案的耐药和敏感基因。使用多层感知器(MLP)构建一个ANN,对GSE104645数据集进行FOLFOX方案分析。然后将GSE69657表达矩阵和临床疗效参数用于回顾性验证。采用受试者工作特征(ROC)曲线评估测试结果和预测能力。在GSE104645中总共筛选出2076个差异表达基因,其中822个基因在化疗耐药组中上调,1254个基因下调。下调的基因是敏感基因。对差异表达基因所参与的生物学过程进行基因本体(GO)分析,结果显示它们主要参与物质代谢的调节。最终模型构建共纳入39个基因。这是一个具有两个隐藏层的神经网络模型。预测训练样本和测试样本的准确率分别为75.7%和76.5%,ROC曲线下面积为0.875。将本科室的芯片数据集(GSE69657)设为测试集时,ROC曲线下面积为0.778。在本研究中,成功构建了一种基于芯片预测一线FOLFOX方案对转移性结直肠癌疗效的人工神经网络模型,并进行了独立的外部验证。该模型具有良好的稳定性和预测效率。此外,本研究结果表明与奥沙利铂耐药相关的基因功能主要富集在物质代谢的调节过程中。