Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (Fraunhofer SCAI), 53757, Sankt Augustin, Germany.
Bonn-Aachen International Center for IT, Rheinische Friedrich-Wilhelms-Universität Bonn, 53117, Bonn, Germany.
BMC Bioinformatics. 2020 Jun 5;21(1):231. doi: 10.1186/s12859-020-03568-5.
During the last decade, there has been a surge towards computational drug repositioning owing to constantly increasing -omics data in the biomedical research field. While numerous existing methods focus on the integration of heterogeneous data to propose candidate drugs, it is still challenging to substantiate their results with mechanistic insights of these candidate drugs. Therefore, there is a need for more innovative and efficient methods which can enable better integration of data and knowledge for drug repositioning.
Here, we present a customizable workflow (PS4DR) which not only integrates high-throughput data such as genome-wide association study (GWAS) data and gene expression signatures from disease and drug perturbations but also takes pathway knowledge into consideration to predict drug candidates for repositioning. We have collected and integrated publicly available GWAS data and gene expression signatures for several diseases and hundreds of FDA-approved drugs or those under clinical trial in this study. Additionally, different pathway databases were used for mechanistic knowledge integration in the workflow. Using this systematic consolidation of data and knowledge, the workflow computes pathway signatures that assist in the prediction of new indications for approved and investigational drugs.
We showcase PS4DR with applications demonstrating how this tool can be used for repositioning and identifying new drugs as well as proposing drugs that can simulate disease dysregulations. We were able to validate our workflow by demonstrating its capability to predict FDA-approved drugs for their known indications for several diseases. Further, PS4DR returned many potential drug candidates for repositioning that were backed up by epidemiological evidence extracted from scientific literature. Source code is freely available at https://github.com/ps4dr/ps4dr.
由于生物医学研究领域中不断增加的组学数据,过去十年中,计算药物再定位的研究呈指数级增长。虽然许多现有的方法侧重于整合异构数据来提出候选药物,但仍然难以用这些候选药物的机制见解来证实其结果。因此,需要更具创新性和高效的方法来更好地整合数据和知识以进行药物再定位。
在这里,我们提出了一个可定制的工作流程(PS4DR),它不仅可以整合高通量数据,如全基因组关联研究(GWAS)数据和疾病及药物扰动的基因表达特征,还可以考虑途径知识来预测候选药物的再定位。在这项研究中,我们收集并整合了几种疾病的公开 GWAS 数据和基因表达特征以及数百种已批准的 FDA 药物或临床试验中的药物。此外,不同的途径数据库用于工作流程中的机制知识整合。通过这种数据和知识的系统整合,该工作流程计算途径特征,以辅助预测已批准和试验中的药物的新适应症。
我们通过应用示例展示了 PS4DR,展示了如何使用该工具进行药物再定位和鉴定新药,并提出可以模拟疾病失调的药物。我们通过证明其能够预测 FDA 批准的药物在几种疾病中的已知适应症,验证了我们的工作流程的能力。此外,PS4DR 还返回了许多有潜力的药物再定位候选药物,这些候选药物得到了从科学文献中提取的流行病学证据的支持。源代码可在 https://github.com/ps4dr/ps4dr 上免费获取。