Department of Computer Science, Yonsei University, Seoul, South Korea.
Department of Computer Science & Engineering, Incheon National University, Incheon, South Korea.
Sci Rep. 2020 Feb 5;10(1):1861. doi: 10.1038/s41598-020-58821-x.
Cancer is one of the most difficult diseases to treat owing to the drug resistance of tumour cells. Recent studies have revealed that drug responses are closely associated with genomic alterations in cancer cells. Numerous state-of-the-art machine learning models have been developed for prediction of drug responses using various genomic data and diverse drug molecular information, but those methods are ineffective to predict drug response to untrained drugs and gene expression patterns, which is known as the cold-start problem. In this study, we present a novel deep neural network model, termed RefDNN, for improved prediction of drug resistance and identification of biomarkers related to drug response. RefDNN exploits a collection of drugs, called reference drugs, to learn representations for a high-dimensional gene expression vector and a molecular structure vector of a drug and predicts drug response labels using the reference drug-based representations. These calculations come from the observation that similar chemicals have similar effects. The proposed model not only outperformed existing computational prediction models in most comparative experiments, but also showed more robust prediction for untrained drugs and cancer types than traditional machine learning models. RefDNN exploits the ElasticNet regularization to deal with high-dimensional gene expression data, which allows identification of gene markers associated with drug resistance. Lastly, we described an application of RefDNN in exploring a new candidate drug for liver cancer. As the proposed model can guarantee good prediction of drug responses to untrained drugs for given gene expression patterns, it may be of potential benefit in drug repositioning and personalized medicine.
由于肿瘤细胞的耐药性,癌症是最难治疗的疾病之一。最近的研究表明,药物反应与癌细胞中的基因组改变密切相关。已经开发了许多最先进的机器学习模型,用于使用各种基因组数据和不同的药物分子信息预测药物反应,但这些方法对于预测未经训练的药物和基因表达模式的药物反应是无效的,这被称为冷启动问题。在这项研究中,我们提出了一种新的深度神经网络模型,称为 RefDNN,用于改善药物耐药性的预测和识别与药物反应相关的生物标志物。RefDNN 利用一组药物(称为参考药物)来学习高维基因表达向量和药物分子结构向量的表示,并使用基于参考药物的表示来预测药物反应标签。这些计算来自于这样一种观察,即类似的化学物质具有相似的作用。所提出的模型不仅在大多数比较实验中优于现有的计算预测模型,而且与传统的机器学习模型相比,对未经训练的药物和癌症类型表现出更稳健的预测。RefDNN 利用 ElasticNet 正则化来处理高维基因表达数据,这允许识别与耐药性相关的基因标记。最后,我们描述了 RefDNN 在探索肝癌新候选药物中的应用。由于所提出的模型可以保证对给定基因表达模式的未经训练的药物的药物反应的良好预测,因此它可能在药物重定位和个性化医疗方面具有潜在的益处。