药物重定位的生物医学数据和计算模型：全面综述。

Biomedical data and computational models for drug repositioning: a comprehensive review.

机构信息

School of Computer Science and Engineering at Central South University.

College of Engineering and the Department of Computer Science at University of Saskatchewan, Saskatoon, Canada.

出版信息

Brief Bioinform. 2021 Mar 22;22(2):1604-1619. doi: 10.1093/bib/bbz176.

DOI:10.1093/bib/bbz176

PMID:32043521

Abstract

Drug repositioning can drastically decrease the cost and duration taken by traditional drug research and development while avoiding the occurrence of unforeseen adverse events. With the rapid advancement of high-throughput technologies and the explosion of various biological data and medical data, computational drug repositioning methods have been appealing and powerful techniques to systematically identify potential drug-target interactions and drug-disease interactions. In this review, we first summarize the available biomedical data and public databases related to drugs, diseases and targets. Then, we discuss existing drug repositioning approaches and group them based on their underlying computational models consisting of classical machine learning, network propagation, matrix factorization and completion, and deep learning based models. We also comprehensively analyze common standard data sets and evaluation metrics used in drug repositioning, and give a brief comparison of various prediction methods on the gold standard data sets. Finally, we conclude our review with a brief discussion on challenges in computational drug repositioning, which includes the problem of reducing the noise and incompleteness of biomedical data, the ensemble of various computation drug repositioning methods, the importance of designing reliable negative samples selection methods, new techniques dealing with the data sparseness problem, the construction of large-scale and comprehensive benchmark data sets and the analysis and explanation of the underlying mechanisms of predicted interactions.

摘要

药物重定位可以大大降低传统药物研发的成本和时间，同时避免意外不良事件的发生。随着高通量技术的快速发展和各种生物医学数据和医疗数据的爆炸式增长，计算药物重定位方法已经成为一种吸引人且强大的技术，可以系统地识别潜在的药物-靶标相互作用和药物-疾病相互作用。在这篇综述中，我们首先总结了与药物、疾病和靶点相关的可用生物医学数据和公共数据库。然后，我们讨论了现有的药物重定位方法，并根据其基于经典机器学习、网络传播、矩阵分解和完成以及基于深度学习的模型的底层计算模型对其进行分组。我们还全面分析了药物重定位中常用的标准数据集和评估指标，并对黄金标准数据集上的各种预测方法进行了简要比较。最后，我们简要讨论了计算药物重定位中的挑战，包括减少生物医学数据的噪声和不完整性、各种计算药物重定位方法的集成、设计可靠的负样本选择方法的重要性、处理数据稀疏问题的新技术、构建大规模和全面的基准数据集以及分析和解释预测相互作用的潜在机制。