Department of Computer Science, Hunter College, The City University of New York, New York, New York 10065, United States.
Department of Computer Science, University of Northern Iowa, Cedar Falls, Iowa 50614, United States.
Sci Rep. 2016 Dec 13;6:38860. doi: 10.1038/srep38860.
Conventional one-drug-one-gene approach has been of limited success in modern drug discovery. Polypharmacology, which focuses on searching for multi-targeted drugs to perturb disease-causing networks instead of designing selective ligands to target individual proteins, has emerged as a new drug discovery paradigm. Although many methods for single-target virtual screening have been developed to improve the efficiency of drug discovery, few of these algorithms are designed for polypharmacology. Here, we present a novel theoretical framework and a corresponding algorithm for genome-scale multi-target virtual screening based on the one-class collaborative filtering technique. Our method overcomes the sparseness of the protein-chemical interaction data by means of interaction matrix weighting and dual regularization from both chemicals and proteins. While the statistical foundation behind our method is general enough to encompass genome-wide drug off-target prediction, the program is specifically tailored to find protein targets for new chemicals with little to no available interaction data. We extensively evaluate our method using a number of the most widely accepted gene-specific and cross-gene family benchmarks and demonstrate that our method outperforms other state-of-the-art algorithms for predicting the interaction of new chemicals with multiple proteins. Thus, the proposed algorithm may provide a powerful tool for multi-target drug design.
传统的“一个药物对应一个靶点”的方法在现代药物发现中取得的成功有限。多药理学专注于寻找多靶点药物来干扰致病网络,而不是设计选择性配体来针对单个蛋白质,它已成为一种新的药物发现范例。尽管已经开发了许多用于单靶点虚拟筛选的方法来提高药物发现的效率,但这些算法中很少有针对多药理学的。在这里,我们提出了一种基于一类协同过滤技术的基因组规模多靶点虚拟筛选的新理论框架和相应算法。我们的方法通过化学物质和蛋白质的交互矩阵加权和双重正则化克服了蛋白质-化学相互作用数据的稀疏性。虽然我们方法背后的统计学基础足够广泛,可以涵盖全基因组范围内的药物脱靶预测,但该程序专门用于为新的化学物质找到蛋白质靶点,这些化学物质几乎没有可用的相互作用数据。我们使用许多最广泛接受的基因特异性和跨基因家族基准广泛评估了我们的方法,并证明我们的方法在预测新化学物质与多个蛋白质的相互作用方面优于其他最先进的算法。因此,所提出的算法可能为多靶点药物设计提供了一种强大的工具。