Huang Xiaoqing, Huang Kun, Johnson Travis, Radovich Milan, Zhang Jie, Ma Jianzhu, Wang Yijie
Department of Biostatistics and Health Data Science, Indiana University School of Medicine, Indianapolis, IN 46202, USA.
Division of General Surgery, Department of Surgery, Indiana University School of Medicine, Indianapolis, IN 46202, USA.
NAR Genom Bioinform. 2021 Oct 27;3(4):lqab097. doi: 10.1093/nargab/lqab097. eCollection 2021 Dec.
Prediction of cancer-specific drug responses as well as identification of the corresponding drug-sensitive genes and pathways remains a major biological and clinical challenge. Deep learning models hold immense promise for better drug response predictions, but most of them cannot provide biological and clinical interpretability. Visible neural network (VNN) models have emerged to solve the problem by giving neurons biological meanings and directly casting biological networks into the models. However, the biological networks used in VNNs are often redundant and contain components that are irrelevant to the downstream predictions. Therefore, the VNNs using these redundant biological networks are overparameterized, which significantly limits VNNs' predictive and explanatory power. To overcome the problem, we treat the edges and nodes in biological networks used in VNNs as features and develop a sparse learning framework ParsVNN to learn parsimony VNNs with only edges and nodes that contribute the most to the prediction task. We applied ParsVNN to build cancer-specific VNN models to predict drug response for five different cancer types. We demonstrated that the parsimony VNNs built by ParsVNN are superior to other state-of-the-art methods in terms of prediction performance and identification of cancer driver genes. Furthermore, we found that the pathways selected by ParsVNN have great potential to predict clinical outcomes as well as recommend synergistic drug combinations.
预测癌症特异性药物反应以及识别相应的药物敏感基因和通路仍然是一项重大的生物学和临床挑战。深度学习模型在更好地预测药物反应方面具有巨大潜力,但其中大多数无法提供生物学和临床可解释性。可见神经网络(VNN)模型通过赋予神经元生物学意义并将生物网络直接融入模型来解决这一问题。然而,VNN中使用的生物网络往往是冗余的,并且包含与下游预测无关的成分。因此,使用这些冗余生物网络的VNN参数过多,这显著限制了VNN的预测和解释能力。为了克服这个问题,我们将VNN中使用的生物网络中的边和节点视为特征,并开发了一个稀疏学习框架ParsVNN,以学习仅包含对预测任务贡献最大的边和节点的简约VNN。我们应用ParsVNN构建癌症特异性VNN模型,以预测五种不同癌症类型的药物反应。我们证明,由ParsVNN构建的简约VNN在预测性能和识别癌症驱动基因方面优于其他现有方法。此外,我们发现ParsVNN选择的通路在预测临床结果以及推荐协同药物组合方面具有很大潜力。