Hsieh Meng-Hsuen, Sun Li-Min, Lin Cheng-Li, Hsieh Meng-Ju, Sun Kyle, Hsu Chung-Y, Chou An-Kuo, Kao Chia-Hung
Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720, USA.
Department of Radiation Oncology, Zuoying Branch of Kaohsiung Armed Forces General Hospital, Kaohsiung 81342, Taiwan.
J Clin Med. 2018 Sep 12;7(9):277. doi: 10.3390/jcm7090277.
Observational studies suggested that patients with type 2 diabetes mellitus (T2DM) presented a higher risk of developing colorectal cancer (CRC). The current study aims to create a deep neural network (DNN) to predict the onset of CRC for patients with T2DM.
We employed the national health insurance database of Taiwan to create predictive models for detecting an increased risk of subsequent CRC development in T2DM patients in Taiwan. We identified a total of 1,349,640 patients between 2000 and 2012 with newly diagnosed T2DM. All the available possible risk factors for CRC were also included in the analyses. The data were split into training and test sets with 97.5% of the patients in the training set and 2.5% of the patients in the test set. The deep neural network (DNN) model was optimized using Adam with Nesterov's accelerated gradient descent. The recall, precision, F₁ values, and the area under the receiver operating characteristic (ROC) curve were used to evaluate predictor performance.
The F₁, precision, and recall values of the DNN model across all data were 0.931, 0.982, and 0.889, respectively. The area under the ROC curve of the DNN model across all data was 0.738, compared to the ideal value of 1. The metrics indicate that the DNN model appropriately predicted CRC. In contrast, a single variable predictor using adapted the Diabetes Complication Severity Index showed poorer performance compared to the DNN model.
Our results indicated that the DNN model is an appropriate tool to predict CRC risk in patients with T2DM in Taiwan.
观察性研究表明,2型糖尿病(T2DM)患者患结直肠癌(CRC)的风险更高。本研究旨在创建一个深度神经网络(DNN)来预测T2DM患者患CRC的风险。
我们利用台湾国民健康保险数据库,为检测台湾T2DM患者后续发生CRC风险增加创建预测模型。我们确定了2000年至2012年间1349640例新诊断为T2DM的患者。分析中还纳入了所有可能的CRC风险因素。数据被分为训练集和测试集,训练集包含97.5%的患者,测试集包含2.5%的患者。深度神经网络(DNN)模型使用带有Nesterov加速梯度下降的Adam进行优化。召回率、精确率、F₁值和受试者工作特征(ROC)曲线下面积用于评估预测指标的性能。
DNN模型在所有数据上的F₁值、精确率和召回率分别为0.931、0.982和0.889。DNN模型在所有数据上的ROC曲线下面积为0.738,而理想值为1。这些指标表明DNN模型能适当地预测CRC。相比之下,使用改良糖尿病并发症严重程度指数的单变量预测指标与DNN模型相比性能较差。
我们的结果表明,DNN模型是预测台湾T2DM患者CRC风险的合适工具。