Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, P.R. China.
Department of Computing Sciences, The College at Brockport, State University of New York, Brockport, NY 14422, USA.
Bioinformatics. 2022 Sep 30;38(19):4546-4553. doi: 10.1093/bioinformatics/btac574.
Due to cancer heterogeneity, the therapeutic effect may not be the same when a cohort of patients of the same cancer type receive the same treatment. The anticancer drug response prediction may help develop personalized therapy regimens to increase survival and reduce patients' expenses. Recently, graph neural network-based methods have aroused widespread interest and achieved impressive results on the drug response prediction task. However, most of them apply graph convolution to process cell line-drug bipartite graphs while ignoring the intrinsic differences between cell lines and drug nodes. Moreover, most of these methods aggregate node-wise neighbor features but fail to consider the element-wise interaction between cell lines and drugs.
This work proposes a neighborhood interaction (NI)-based heterogeneous graph convolution network method, namely NIHGCN, for anticancer drug response prediction in an end-to-end way. Firstly, it constructs a heterogeneous network consisting of drugs, cell lines and the known drug response information. Cell line gene expression and drug molecular fingerprints are linearly transformed and input as node attributes into an interaction model. The interaction module consists of a parallel graph convolution network layer and a NI layer, which aggregates node-level features from their neighbors through graph convolution operation and considers the element-level of interactions with their neighbors in the NI layer. Finally, the drug response predictions are made by calculating the linear correlation coefficients of feature representations of cell lines and drugs. We have conducted extensive experiments to assess the effectiveness of our model on Cancer Drug Sensitivity Data (GDSC) and Cancer Cell Line Encyclopedia (CCLE) datasets. It has achieved the best performance compared with the state-of-the-art algorithms, especially in predicting drug responses for new cell lines, new drugs and targeted drugs. Furthermore, our model that was well trained on the GDSC dataset can be successfully applied to predict samples of PDX and TCGA, which verified the transferability of our model from cell line in vitro to the datasets in vivo.
The source code can be obtained from https://github.com/weiba/NIHGCN.
Supplementary data are available at Bioinformatics online.
由于癌症异质性,同一癌症类型的患者接受相同治疗时,疗效可能并不相同。抗癌药物反应预测可能有助于制定个性化治疗方案,以提高生存率并降低患者费用。最近,基于图神经网络的方法在药物反应预测任务中引起了广泛关注,并取得了令人印象深刻的结果。然而,它们大多数应用图卷积处理细胞系-药物二分图,而忽略了细胞系和药物节点之间的内在差异。此外,它们大多数聚合节点级邻居特征,但未能考虑细胞系和药物之间的元素级交互。
本研究提出了一种基于邻域交互(NI)的异质图卷积网络方法,即 NIHGCN,用于端到端的抗癌药物反应预测。首先,它构建了一个由药物、细胞系和已知药物反应信息组成的异质网络。细胞系基因表达和药物分子指纹通过线性变换并作为节点属性输入到交互模型中。交互模块由一个并行图卷积网络层和一个 NI 层组成,通过图卷积操作从邻居节点聚合节点级特征,并在 NI 层中考虑与邻居的元素级交互。最后,通过计算细胞系和药物特征表示的线性相关系数来进行药物反应预测。我们在 Cancer Drug Sensitivity Data (GDSC) 和 Cancer Cell Line Encyclopedia (CCLE) 数据集上进行了广泛的实验,以评估我们模型的有效性。与最先进的算法相比,它的性能最佳,特别是在预测新细胞系、新药物和靶向药物的药物反应方面。此外,我们在 GDSC 数据集上训练良好的模型可以成功应用于预测 PDX 和 TCGA 的样本,这验证了我们的模型从体外细胞系到体内数据集的可转移性。
源代码可从 https://github.com/weiba/NIHGCN 获得。
补充数据可在 Bioinformatics 在线获取。