Tuerkova Alzbeta, Zdrazil Barbara
Department of Pharmaceutical Chemistry, Division of Drug Design and Medicinal Chemistry, University of Vienna, Althanstraße 14, 1090 Vienna, Austria.
J Cheminform. 2020 Nov 25;12:71. doi: 10.1186/s13321-020-00474-z. eCollection 2020.
Biomedical information mining is increasingly recognized as a promising technique to accelerate drug discovery and development. Especially, integrative approaches which mine data from several (open) data sources have become more attractive with the increasing possibilities to programmatically access data through Application Programming Interfaces (APIs). The use of open data in conjunction with free, platform-independent analytic tools provides the additional advantage of flexibility, re-usability, and transparency. Here, we present a strategy for performing ligand-based in silico drug repurposing with the analytics platform KNIME. We demonstrate the usefulness of the developed workflow on the basis of two different use cases: a rare disease (here: Glucose Transporter Type 1 (GLUT-1) deficiency), and a new disease (here: COVID 19). The workflow includes a targeted download of data through web services, data curation, detection of enriched structural patterns, as well as substructure searches in DrugBank and a recently deposited data set of antiviral drugs provided by Chemical Abstracts Service. Developed workflows, tutorials with detailed step-by-step instructions, and the information gained by the analysis of data for GLUT-1 deficiency syndrome and COVID-19 are made freely available to the scientific community. The provided framework can be reused by researchers for other in silico drug repurposing projects, and it should serve as a valuable teaching resource for conveying integrative data mining strategies.
生物医学信息挖掘日益被视为一种加速药物发现和开发的有前景的技术。特别是,从多个(开放)数据源挖掘数据的整合方法,随着通过应用程序编程接口(API)以编程方式访问数据的可能性不断增加,变得更具吸引力。将开放数据与免费的、与平台无关的分析工具结合使用,具有灵活性、可重复使用性和透明度等额外优势。在这里,我们展示了一种使用分析平台KNIME进行基于配体的计算机辅助药物重新利用的策略。我们基于两个不同的用例证明了所开发工作流程的实用性:一种罕见疾病(这里指1型葡萄糖转运体(GLUT-1)缺乏症)和一种新出现的疾病(这里指2019冠状病毒病)。该工作流程包括通过网络服务有针对性地下载数据、数据整理、检测富集的结构模式,以及在DrugBank和化学文摘社最近存入的抗病毒药物数据集中进行子结构搜索。所开发的工作流程、带有详细分步说明的教程,以及通过对GLUT-1缺乏综合征和2019冠状病毒病数据的分析所获得的信息,都将免费提供给科学界。所提供的框架可供研究人员用于其他计算机辅助药物重新利用项目,并且它应该作为传达整合数据挖掘策略的宝贵教学资源。