Huang Yixian, Huang Hsi-Yuan, Chen Yigang, Lin Yang-Chi-Dung, Yao Lantian, Lin Tianxiu, Leng Junlin, Chang Yuan, Zhang Yuntian, Zhu Zihao, Ma Kun, Cheng Yeong-Nan, Lee Tzong-Yi, Huang Hsien-Da
School of Medicine, The Chinese University of Hong Kong, Shenzhen, Longgang District, Shenzhen 518172, China.
Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, Longgang District, Shenzhen 518172, China.
Int J Mol Sci. 2023 Sep 14;24(18):14061. doi: 10.3390/ijms241814061.
Drug-target interactions (DTIs) are considered a crucial component of drug design and drug discovery. To date, many computational methods were developed for drug-target interactions, but they are insufficiently informative for accurately predicting DTIs due to the lack of experimentally verified negative datasets, inaccurate molecular feature representation, and ineffective DTI classifiers. Therefore, we address the limitations of randomly selecting negative DTI data from unknown drug-target pairs by establishing two experimentally validated datasets and propose a capsule network-based framework called CapBM-DTI to capture hierarchical relationships of drugs and targets, which adopts pre-trained bidirectional encoder representations from transformers (BERT) for contextual sequence feature extraction from target proteins through transfer learning and the message-passing neural network (MPNN) for the 2-D graph feature extraction of compounds to accurately and robustly identify drug-target interactions. We compared the performance of CapBM-DTI with state-of-the-art methods using four experimentally validated DTI datasets of different sizes, including human () and worm () species datasets, as well as three subsets (new compounds, new proteins, and new pairs). Our results demonstrate that the proposed model achieved robust performance and powerful generalization ability in all experiments. The case study on treating COVID-19 demonstrates the applicability of the model in virtual screening.
药物-靶点相互作用(DTIs)被认为是药物设计和药物发现的关键组成部分。迄今为止,已经开发了许多用于药物-靶点相互作用的计算方法,但由于缺乏经过实验验证的阴性数据集、不准确的分子特征表示和无效的DTI分类器,它们在准确预测DTIs方面的信息不足。因此,我们通过建立两个经过实验验证的数据集来解决从未知药物-靶点对中随机选择阴性DTI数据的局限性,并提出了一个基于胶囊网络的框架CapBM-DTI来捕捉药物和靶点的层次关系,该框架采用预训练的双向变换器表征(BERT)通过迁移学习从靶蛋白中提取上下文序列特征,并采用消息传递神经网络(MPNN)对化合物进行二维图特征提取,以准确、稳健地识别药物-靶点相互作用。我们使用四个不同大小的经过实验验证的DTI数据集(包括人类()和蠕虫()物种数据集)以及三个子集(新化合物、新蛋白质和新对),将CapBM-DTI的性能与现有方法进行了比较。我们的结果表明,所提出的模型在所有实验中都取得了稳健的性能和强大的泛化能力。治疗新冠肺炎的案例研究证明了该模型在虚拟筛选中的适用性。