Gu Yaowen, Zheng Si, Zhang Bowen, Kang Hongyu, Jiang Rui, Li Jiao
Institute of Medical Information, Chinese Academy of Medical Sciences and Peking Union Medical College (CAMS&PUMC), Beijing, 100020, China; Department of Chemistry, New York University, NY, 10027, USA.
Institute of Medical Information, Chinese Academy of Medical Sciences and Peking Union Medical College (CAMS&PUMC), Beijing, 100020, China; Institute for Artificial Intelligence, Department of Computer Science and Technology, BNRist, Tsinghua University, Beijing, 100084, China.
Comput Biol Med. 2025 Jan;184:109403. doi: 10.1016/j.compbiomed.2024.109403. Epub 2024 Nov 21.
Drug repositioning offers promising prospects for accelerating drug discovery by identifying potential drug-disease associations (DDAs) for existing drugs and diseases. Previous methods have generated meta-path-augmented node or graph embeddings for DDA prediction in drug-disease heterogeneous networks. However, these approaches rarely develop end-to-end frameworks for path instance-level representation learning as well as the further feature selection and aggregation. By leveraging the abundant topological information in path instances, more fine-grained and interpretable predictions can be achieved. To this end, we introduce deep multiple instance learning into drug repositioning by proposing a novel method called MilGNet. MilGNet employs a heterogeneous graph neural network (HGNN)-based encoder to learn drug and disease node embeddings. Treating each drug-disease pair as a bag, we designed a special quadruplet meta-path form and implemented a pseudo meta-path generator in MilGNet to obtain multiple meta-path instances based on network topology. Additionally, a bidirectional instance encoder enhances the representation of meta-path instances. Finally, MilGNet utilizes a multi-scale interpretable predictor to aggregate bag embeddings with an attention mechanism, providing predictions at both the bag and instance levels for accurate and explainable predictions. Comprehensive experiments on five benchmarks demonstrate that MilGNet significantly outperforms ten advanced methods. Notably, three case studies on one drug (Methotrexate) and two diseases (Renal Failure and Mismatch Repair Cancer Syndrome) highlight MilGNet's potential for discovering new indications, therapies, and generating rational meta-path instances to investigate possible treatment mechanisms. The source code is available at https://github.com/gu-yaowen/MilGNet.
药物重新定位通过识别现有药物和疾病之间潜在的药物-疾病关联(DDA),为加速药物发现提供了广阔前景。先前的方法已经生成了元路径增强的节点或图嵌入,用于药物-疾病异质网络中的DDA预测。然而,这些方法很少开发用于路径实例级表示学习以及进一步的特征选择和聚合的端到端框架。通过利用路径实例中丰富的拓扑信息,可以实现更细粒度和可解释的预测。为此,我们通过提出一种名为MilGNet的新方法,将深度多实例学习引入药物重新定位。MilGNet采用基于异质图神经网络(HGNN)的编码器来学习药物和疾病节点嵌入。将每个药物-疾病对视为一个包,我们设计了一种特殊的四元组元路径形式,并在MilGNet中实现了一个伪元路径生成器,以基于网络拓扑获得多个元路径实例。此外,双向实例编码器增强了元路径实例 的表示。最后,MilGNet利用多尺度可解释预测器通过注意力机制聚合包嵌入,在包和实例级别都提供预测,以实现准确和可解释的预测。在五个基准上进行的综合实验表明,MilGNet显著优于十种先进方法。值得注意的是,对一种药物(甲氨蝶呤)和两种疾病(肾衰竭和错配修复癌症综合征)的三个案例研究突出了MilGNet在发现新适应症、疗法以及生成合理的元路径实例以研究可能的治疗机制方面的潜力。源代码可在https://github.com/gu-yaowen/MilGNet获取。