Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, Jilin, China.
Zhuhai Laboratory of the Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, Zhuhai College of Science and Technology, Zhuhai 519041, China.
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae238.
Identifying drug-target interactions (DTIs) holds significant importance in drug discovery and development, playing a crucial role in various areas such as virtual screening, drug repurposing and identification of potential drug side effects. However, existing methods commonly exploit only a single type of feature from drugs and targets, suffering from miscellaneous challenges such as high sparsity and cold-start problems. We propose a novel framework called MSI-DTI (Multi-Source Information-based Drug-Target Interaction Prediction) to enhance prediction performance, which obtains feature representations from different views by integrating biometric features and knowledge graph representations from multi-source information. Our approach involves constructing a Drug-Target Knowledge Graph (DTKG), obtaining multiple feature representations from diverse information sources for SMILES sequences and amino acid sequences, incorporating network features from DTKG and performing an effective multi-source information fusion. Subsequently, we employ a multi-head self-attention mechanism coupled with residual connections to capture higher-order interaction information between sparse features while preserving lower-order information. Experimental results on DTKG and two benchmark datasets demonstrate that our MSI-DTI outperforms several state-of-the-art DTIs prediction methods, yielding more accurate and robust predictions. The source codes and datasets are publicly accessible at https://github.com/KEAML-JLU/MSI-DTI.
识别药物-靶点相互作用(DTIs)在药物发现和开发中具有重要意义,在虚拟筛选、药物再利用和识别潜在药物副作用等各个领域发挥着关键作用。然而,现有的方法通常仅利用药物和靶点的单一类型的特征,存在着各种挑战,如高度稀疏和冷启动问题。我们提出了一种名为 MSI-DTI(基于多源信息的药物-靶点相互作用预测)的新框架,通过整合来自多源信息的生物特征和知识图谱表示,从不同的角度获取特征表示,以提高预测性能。我们的方法涉及构建药物-靶点知识图谱(DTKG),从不同的信息源获取 SMILES 序列和氨基酸序列的多个特征表示,整合来自 DTKG 的网络特征,并进行有效的多源信息融合。然后,我们采用多头自注意力机制结合残差连接,在保留低阶信息的同时,捕获稀疏特征之间的高阶交互信息。在 DTKG 和两个基准数据集上的实验结果表明,我们的 MSI-DTI 优于几种最先进的 DTIs 预测方法,产生更准确和稳健的预测。源代码和数据集可在 https://github.com/KEAML-JLU/MSI-DTI 上公开获取。