Department of Pharmacological Sciences; Mount Sinai Center for Bioinformatics; Big Data to Knowledge, Library of Integrated Network-Based Cellular Signatures, Data Coordination and Integration Center (BD2K-LINCS DCIC); Knowledge Management Center for Illuminating the Druggable Genome (KMC-IDG); Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Place, Box 1603, New York, NY 10029, USA.
Database (Oxford). 2021 Mar 31;2021. doi: 10.1093/database/baab017.
Understanding the underlying molecular and structural similarities between seemingly heterogeneous sets of drugs can aid in identifying drug repurposing opportunities and assist in the discovery of novel properties of preclinical small molecules. A wealth of information about drug and small molecule structure, targets, indications and side effects; induced gene expression signatures; and other attributes are publicly available through web-based tools, databases and repositories. By processing, abstracting and aggregating information from these resources into drug set libraries, knowledge about novel properties of drugs and small molecules can be systematically imputed with machine learning. In addition, drug set libraries can be used as the underlying database for drug set enrichment analysis. Here, we present Drugmonizome, a database with a search engine for querying annotated sets of drugs and small molecules for performing drug set enrichment analysis. Utilizing the data within Drugmonizome, we also developed Drugmonizome-ML. Drugmonizome-ML enables users to construct customized machine learning pipelines using the drug set libraries from Drugmonizome. To demonstrate the utility of Drugmonizome, drug sets from 12 independent SARS-CoV-2 in vitro screens were subjected to consensus enrichment analysis. Despite the low overlap among these 12 independent in vitro screens, we identified common biological processes critical for blocking viral replication. To demonstrate Drugmonizome-ML, we constructed a machine learning pipeline to predict whether approved and preclinical drugs may induce peripheral neuropathy as a potential side effect. Overall, the Drugmonizome and Drugmonizome-ML resources provide rich and diverse knowledge about drugs and small molecules for direct systems pharmacology applications. Database URL: https://maayanlab.cloud/drugmonizome/.
理解看似不同的药物集合之间潜在的分子和结构相似性,可以帮助识别药物再利用的机会,并有助于发现临床前小分子的新特性。通过基于网络的工具、数据库和存储库,大量关于药物和小分子结构、靶点、适应症和副作用、诱导的基因表达特征以及其他属性的信息是公开可用的。通过对这些资源中的信息进行处理、抽象和汇总,将药物集库中的药物和小分子的新特性知识可以通过机器学习系统地推断出来。此外,药物集库可作为药物集富集分析的基础数据库。在这里,我们介绍了 Drugmonizome,这是一个带有搜索引擎的数据库,用于查询注释的药物和小分子集合,以进行药物集富集分析。利用 Drugmonizome 中的数据,我们还开发了 Drugmonizome-ML。Drugmonizome-ML 允许用户使用来自 Drugmonizome 的药物集库构建定制的机器学习管道。为了展示 Drugmonizome 的实用性,我们对来自 12 个独立的 SARS-CoV-2 体外筛选的药物集进行了共识富集分析。尽管这 12 个独立的体外筛选重叠较少,但我们确定了对阻止病毒复制至关重要的共同生物学过程。为了展示 Drugmonizome-ML,我们构建了一个机器学习管道来预测已批准和临床前药物是否可能引起周围神经病变作为潜在的副作用。总体而言,Drugmonizome 和 Drugmonizome-ML 资源为直接的系统药理学应用提供了丰富多样的药物和小分子知识。数据库网址:https://maayanlab.cloud/drugmonizome/。