IEEE/ACM Trans Comput Biol Bioinform. 2022 Mar-Apr;19(2):710-717. doi: 10.1109/TCBB.2021.3096960. Epub 2022 Apr 1.
Previous studies have either learned drug's features from their string or numeric representations, which are not natural forms of drugs, or only used genomic data of cell lines for the drug response prediction problem. Here, we proposed a deep learning model, GraOmicDRP, to learn drug's features from their graph representation and integrate multiple -omic data of cell lines. In GraOmicDRP, drugs are represented as graphs of bindings among atoms; meanwhile, cell lines are depicted by not only genomic but also transcriptomic and epigenomic data. Graph convolutional and convolutional neural networks were used to learn the representation of drugs and cell lines, respectively. A combination of the two representations was then used to be representative of each pair of drug-cell line. Finally, the response value of each pair was predicted by a fully connected network. Experimental results indicate that transcriptomic data shows the best among single -omic data; meanwhile, the combinations of transcriptomic and other -omic data achieved the best performance overall in terms of both Root Mean Square Error and Pearson correlation coefficient. In addition, we also show that GraOmicDRP outperforms some state-of-the-art methods, including ones integrating -omic data with drug information such as GraphDRP, and ones using -omic data without drug information such as DeepDR and MOLI.
先前的研究要么从药物的字符串或数字表示中学习药物的特征,而这些表示并不是药物的自然形式,要么仅将细胞系的基因组数据用于药物反应预测问题。在这里,我们提出了一个深度学习模型 GraOmicDRP,用于从药物的图表示中学习药物的特征,并整合细胞系的多种组学数据。在 GraOmicDRP 中,药物被表示为原子之间结合的图;同时,细胞系不仅由基因组数据表示,还由转录组和表观基因组数据表示。图卷积和卷积神经网络分别用于学习药物和细胞系的表示。然后,将这两种表示的组合用于代表每对药物-细胞系。最后,通过全连接网络预测每对的响应值。实验结果表明,在单一组学数据中,转录组数据表现最佳;同时,转录组和其他组学数据的组合在均方根误差和皮尔逊相关系数方面的整体性能最佳。此外,我们还表明 GraOmicDRP 优于一些最先进的方法,包括将组学数据与药物信息(如 GraphDRP)集成的方法,以及使用组学数据而不使用药物信息(如 DeepDR 和 MOLI)的方法。