Suppr超能文献

通过在半监督学习模型中整合已知疾病-基因和药物-靶点关联进行药物重新定位

Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model.

作者信息

Le Duc-Hau, Nguyen-Ngoc Doanh

机构信息

School of Computer Science and Engineering, Thuyloi University, 175 Tay Son, Dong Da, Hanoi, Vietnam.

Sorbonne Université, IRD, JEAI WARM, Unité de Modélisation Mathématiques et Informatique des Systèmes Complexes, UMMISCO, 93143, Bondy, France.

出版信息

Acta Biotheor. 2018 Dec;66(4):315-331. doi: 10.1007/s10441-018-9325-z. Epub 2018 Apr 26.

Abstract

Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.

摘要

计算药物重新定位已被证明是一种从现有药物中发现新用途的有前途且高效的策略。为实现这一目标,人们提出了许多计算方法,这些方法基于药物和疾病的不同数据源。这些方法使用基于机器学习或基于网络的模型来解决该问题,其假设是相似的药物可用于相似的疾病以识别药物的新适应症。因此,药物之间以及疾病之间的相似性通常用作输入。此外,这些方法还需要已知的药物 - 疾病关联作为先验信息。应当指出的是,由于许多已上市药物已被撤回,这些关联仍未完全确立,这可能会影响这些方法的结果。在本研究中,我们提出了一种名为RLSDR(用于药物重新定位的正则化最小二乘法)的新方法来寻找药物的新用途。更具体地说,它依赖于一种半监督学习模型,即正则化最小二乘法,因此它不需要像先前提出的基于机器学习的方法那样定义非药物 - 疾病关联。此外,通过药物化合物的化学结构测量的药物之间的相似性以及共享表型的疾病之间的相似性可以以相似性网络或相似性矩阵的形式表示,作为该方法的输入。此外,我们不是使用一组已知药物 - 疾病关联的金标准集,而是基于已知的疾病 - 基因和药物 - 靶点关联构建一组人工关联。实验结果表明,就接收器操作特征(ROC)曲线下面积(AUC)而言,RLSDR在人工药物 - 疾病关联集上的预测性能优于在金标准关联集上的预测性能。此外,无论药物 - 疾病关联的先验信息如何,它都优于两种代表性的基于网络的方法。还通过来自不同数据资源的证据识别并验证了多种药物的新适应症。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验