Shin Bonggun, Park Sungsoo, Hong Ji Hyung, An Ho Jung, Chun Sang Hoon, Kang Kilsoo, Ahn Young-Ho, Ko Yoon Ho, Kang Keunsoo
Department of Computer Science, Emory University, Atlanta, GA, United States.
Deargen, Inc., Daejeon, South Korea.
Front Genet. 2019 Jul 19;10:662. doi: 10.3389/fgene.2019.00662. eCollection 2019.
Artificial neural network-based analysis has recently been used to predict clinical outcomes in patients with solid cancers, including lung cancer. However, the majority of algorithms were not originally developed to identify genes associated with patients' prognoses. To address this issue, we developed a novel prognosis-related feature selection framework called Cascaded Wx (CWx). The CWx framework ranks features according to the survival of a given cohort by training neural networks with three different high- and low-risk groups in a cascaded fashion. We showed that this approach accurately identified features that best identify the patients' prognoses, compared to other feature selection algorithms, including the Cox proportional hazards and Coxnet models, when applied to The Cancer Genome Atlas lung adenocarcinoma (LUAD) transcriptome data. The prognostic potential of the top 100 genes identified by CWx outperformed or was comparable to those identified by the other methods as assessed by the concordance index (-index). In addition, the top 100 genes identified by CWx were found to be associated with the Wnt signaling pathway, providing biologically relevant evidence for the value of these genes in predicting the prognosis of patients with LUAD. Further analyses of other cancer types showed that the genes identified by CWx had the highest prognostic values according to the -index. Collectively, the CWx framework will potentially be of great use to prognosis-related biomarker discoveries in a variety of diseases.
基于人工神经网络的分析方法最近已被用于预测实体癌患者的临床结局,包括肺癌。然而,大多数算法最初并非用于识别与患者预后相关的基因。为了解决这一问题,我们开发了一种名为级联Wx(CWx)的新型预后相关特征选择框架。CWx框架通过以级联方式训练具有三个不同高风险和低风险组的神经网络,根据给定队列的生存率对特征进行排名。我们表明,当应用于癌症基因组图谱肺腺癌(LUAD)转录组数据时,与其他特征选择算法(包括Cox比例风险模型和Coxnet模型)相比,这种方法能够准确识别最能确定患者预后的特征。通过一致性指数(-指数)评估,CWx识别出的前100个基因的预后潜力优于或与其他方法识别出的基因相当。此外,发现CWx识别出的前100个基因与Wnt信号通路相关,为这些基因在预测LUAD患者预后方面的价值提供了生物学相关证据。对其他癌症类型的进一步分析表明,根据-指数,CWx识别出的基因具有最高的预后价值。总体而言,CWx框架可能对多种疾病中与预后相关的生物标志物发现有很大帮助。