Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, State University of New York, Buffalo, NY 14203, USA.
Molecules. 2019 Jan 4;24(1):167. doi: 10.3390/molecules24010167.
Drug repurposing is a valuable tool for combating the slowing rates of novel therapeutic discovery. The Computational Analysis of Novel Drug Opportunities (CANDO) platform performs shotgun repurposing of 2030 indications/diseases using 3733 drugs/compounds to predict interactions with 46,784 proteins and relating them via proteomic interaction signatures. The accuracy is calculated by comparing interaction similarities of drugs approved for the same indications. We performed a unique subset analysis by breaking down the full protein library into smaller subsets and then recombining the best performing subsets into larger supersets. Up to 14% improvement in accuracy is seen upon benchmarking the supersets, representing a 100⁻1000-fold reduction in the number of proteins considered relative to the full library. Further analysis revealed that libraries comprised of proteins with more equitably diverse ligand interactions are important for describing compound behavior. Using one of these libraries to generate putative drug candidates against malaria, tuberculosis, and large cell carcinoma results in more drugs that could be validated in the biomedical literature compared to using those suggested by the full protein library. Our work elucidates the role of particular protein subsets and corresponding ligand interactions that play a role in drug repurposing, with implications for drug design and machine learning approaches to improve the CANDO platform.
药物再利用是应对新型治疗方法发现速度放缓的一种有价值的工具。计算新型药物机会分析(CANDO)平台使用 3733 种药物/化合物对 2030 种适应症/疾病进行了大规模的重新定位,以预测与 46784 种蛋白质的相互作用,并通过蛋白质组相互作用特征将其联系起来。通过比较相同适应症批准的药物的相互作用相似性来计算准确性。我们通过将完整的蛋白质库分解成更小的子集,然后将表现最佳的子集重新组合成更大的超集来进行独特的子集分析。在基准测试超集时,准确性提高了高达 14%,这代表与完整库相比,考虑的蛋白质数量减少了 100-1000 倍。进一步的分析表明,由具有更均衡多样配体相互作用的蛋白质组成的库对于描述化合物行为很重要。使用其中一个库来生成针对疟疾、结核病和大细胞癌的潜在药物候选物,与使用完整蛋白质库建议的药物相比,在生物医学文献中可以验证更多的药物。我们的工作阐明了在药物再利用中起作用的特定蛋白质子集和相应的配体相互作用的作用,这对药物设计和机器学习方法改进 CANDO 平台具有重要意义。