Huck Institutes of Life Sciences, Pennsylvania State University, State College, PA 16801, USA.
Department of Computer Science, Northwestern University, Evanston, IL 60208, USA.
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad057. Epub 2023 Aug 21.
Computational drug repurposing is a cost- and time-efficient approach that aims to identify new therapeutic targets or diseases (indications) of existing drugs/compounds. It is especially critical for emerging and/or orphan diseases due to its cheaper investment and shorter research cycle compared with traditional wet-lab drug discovery approaches. However, the underlying mechanisms of action (MOAs) between repurposed drugs and their target diseases remain largely unknown, which is still a main obstacle for computational drug repurposing methods to be widely adopted in clinical settings.
In this work, we propose KGML-xDTD: a Knowledge Graph-based Machine Learning framework for explainably predicting Drugs Treating Diseases. It is a 2-module framework that not only predicts the treatment probabilities between drugs/compounds and diseases but also biologically explains them via knowledge graph (KG) path-based, testable MOAs. We leverage knowledge-and-publication-based information to extract biologically meaningful "demonstration paths" as the intermediate guidance in the Graph-based Reinforcement Learning (GRL) path-finding process. Comprehensive experiments and case study analyses show that the proposed framework can achieve state-of-the-art performance in both predictions of drug repurposing and recapitulation of human-curated drug MOA paths.
KGML-xDTD is the first model framework that can offer KG path explanations for drug repurposing predictions by leveraging the combination of prediction outcomes and existing biological knowledge and publications. We believe it can effectively reduce "black-box" concerns and increase prediction confidence for drug repurposing based on predicted path-based explanations and further accelerate the process of drug discovery for emerging diseases.
计算药物再利用是一种成本和时间效率高的方法,旨在确定现有药物/化合物的新治疗靶点或疾病(适应证)。由于与传统的湿实验室药物发现方法相比,它的投资成本更低,研究周期更短,因此对于新兴和/或孤儿疾病来说尤为重要。然而,再利用药物与其目标疾病之间的作用机制(MOA)在很大程度上仍然未知,这仍然是计算药物再利用方法在临床环境中广泛采用的主要障碍。
在这项工作中,我们提出了 KGML-xDTD:一种基于知识图的机器学习框架,用于可解释地预测药物治疗疾病。它是一个 2 个模块的框架,不仅可以预测药物/化合物与疾病之间的治疗概率,还可以通过基于知识图(KG)路径的、可测试的 MOA 对其进行生物学解释。我们利用知识和出版的信息来提取有生物学意义的“示范路径”,作为基于图的强化学习(GRL)路径查找过程中的中间指导。全面的实验和案例研究分析表明,所提出的框架可以在药物再利用的预测和人类编辑的药物 MOA 路径的再现方面达到最先进的性能。
KGML-xDTD 是第一个可以通过利用预测结果和现有生物学知识和出版物的组合为药物再利用预测提供 KG 路径解释的模型框架。我们相信,它可以有效地减少“黑盒”的担忧,并提高对基于预测路径的解释的药物再利用的预测信心,并进一步加速新兴疾病的药物发现过程。