DLM-DTI：一种基于提示学习的药物-靶点相互作用预测双语模型。

DLM-DTI: a dual language model for the prediction of drug-target interaction with hint-based learning.

作者信息

Lee Jonghyun, Jun Dae Won, Song Ildae, Kim Yun

机构信息

Department of Medical and Digital Engineering, Hanyang University College of Engineering, 222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea.

Department of Internal Medicine, Hanyang University College of Medicine, 222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea.

出版信息

J Cheminform. 2024 Feb 1;16(1):14. doi: 10.1186/s13321-024-00808-1.

DOI:10.1186/s13321-024-00808-1

PMID:38297330

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10832108/

Abstract

The drug discovery process is demanding and time-consuming, and machine learning-based research is increasingly proposed to enhance efficiency. A significant challenge in this field is predicting whether a drug molecule's structure will interact with a target protein. A recent study attempted to address this challenge by utilizing an encoder that leverages prior knowledge of molecular and protein structures, resulting in notable improvements in the prediction performance of the drug-target interactions task. Nonetheless, the target encoders employed in previous studies exhibit computational complexity that increases quadratically with the input length, thereby limiting their practical utility. To overcome this challenge, we adopt a hint-based learning strategy to develop a compact and efficient target encoder. With the adaptation parameter, our model can blend general knowledge and target-oriented knowledge to build features of the protein sequences. This approach yielded considerable performance enhancements and improved learning efficiency on three benchmark datasets: BIOSNAP, DAVIS, and Binding DB. Furthermore, our methodology boasts the merit of necessitating only a minimal Video RAM (VRAM) allocation, specifically 7.7GB, during the training phase (16.24% of the previous state-of-the-art model). This ensures the feasibility of training and inference even with constrained computational resources.

摘要

药物发现过程既艰巨又耗时，因此越来越多基于机器学习的研究被提出来以提高效率。该领域的一个重大挑战是预测药物分子的结构是否会与目标蛋白相互作用。最近的一项研究试图通过利用一种编码器来应对这一挑战，该编码器利用了分子和蛋白质结构的先验知识，从而在药物 - 靶点相互作用任务的预测性能上取得了显著提升。尽管如此，先前研究中使用的目标编码器表现出计算复杂度会随着输入长度呈二次方增长，从而限制了它们的实际效用。为了克服这一挑战，我们采用基于提示的学习策略来开发一种紧凑且高效的目标编码器。通过自适应参数，我们的模型可以融合通用知识和面向目标的知识来构建蛋白质序列的特征。这种方法在三个基准数据集BIOSNAP、DAVIS和Binding DB上实现了显著的性能提升并提高了学习效率。此外，我们的方法具有一个优点，即在训练阶段仅需要最少的视频随机存取存储器（VRAM）分配，具体为7.7GB（是先前最先进模型所需的16.24%）。这确保了即使在计算资源受限的情况下训练和推理的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c245/10832108/4696cacff2a3/13321_2024_808_Fig1_HTML.jpg

相似文献

DLM-DTI: a dual language model for the prediction of drug-target interaction with hint-based learning.DLM-DTI：一种基于提示学习的药物-靶点相互作用预测双语模型。

J Cheminform. 2024 Feb 1;16(1):14. doi: 10.1186/s13321-024-00808-1.

Fine-tuning of BERT Model to Accurately Predict Drug-Target Interactions.微调BERT模型以准确预测药物-靶点相互作用。

Pharmaceutics. 2022 Aug 16;14(8):1710. doi: 10.3390/pharmaceutics14081710.

Breaking the barriers of data scarcity in drug-target affinity prediction.打破药物靶点亲和力预测中数据稀缺的障碍。

Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad386.

How to approach machine learning-based prediction of drug/compound-target interactions.如何进行基于机器学习的药物/化合物-靶点相互作用预测。

J Cheminform. 2023 Feb 6;15(1):16. doi: 10.1186/s13321-023-00689-w.

Drug-target interaction prediction with tree-ensemble learning and output space reconstruction.基于树集成学习和输出空间重构的药物-靶标相互作用预测。

BMC Bioinformatics. 2020 Feb 7;21(1):49. doi: 10.1186/s12859-020-3379-z.

GeneralizedDTA: combining pre-training and multi-task learning to predict drug-target binding affinity for unknown drug discovery.通用 DTA：结合预训练和多任务学习，预测未知药物发现的药物-靶标结合亲和力。

BMC Bioinformatics. 2022 Sep 7;23(1):367. doi: 10.1186/s12859-022-04905-6.

MCL-DTI: using drug multimodal information and bi-directional cross-attention learning method for predicting drug-target interaction.MCL-DTI：使用药物多模态信息和双向交叉注意力学习方法预测药物-靶标相互作用。

BMC Bioinformatics. 2023 Aug 26;24(1):323. doi: 10.1186/s12859-023-05447-1.

Comparative analysis of network-based approaches and machine learning algorithms for predicting drug-target interactions.基于网络的方法和机器学习算法用于预测药物-靶点相互作用的比较分析。

Methods. 2022 Feb;198:19-31. doi: 10.1016/j.ymeth.2021.10.007. Epub 2021 Nov 1.

Protein-DNA binding sites prediction based on pre-trained protein language model and contrastive learning.基于预训练蛋白质语言模型和对比学习的蛋白质-DNA 结合位点预测。

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad488.

UnbiasedDTI: Mitigating Real-World Bias of Drug-Target Interaction Prediction by Using Deep Ensemble-Balanced Learning.无偏 DTI：通过使用深度集成平衡学习来减轻药物-靶标相互作用预测的实际偏差。

Molecules. 2022 May 6;27(9):2980. doi: 10.3390/molecules27092980.

引用本文的文献

GS-DTI: a graph-structure-aware framework leveraging large language models for drug-target interaction prediction.GS-DTI：一种利用大语言模型进行药物-靶点相互作用预测的图结构感知框架。

Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf445.

Evidential deep learning-based drug-target interaction prediction.基于证据深度学习的药物-靶点相互作用预测

Nat Commun. 2025 Jul 26;16(1):6915. doi: 10.1038/s41467-025-62235-6.

Top-DTI: integrating topological deep learning and large language models for drug-target interaction prediction.Top-DTI：整合拓扑深度学习和大语言模型用于药物-靶点相互作用预测

Bioinformatics. 2025 Jul 1;41(Supplement_1):i133-i141. doi: 10.1093/bioinformatics/btaf183.

DTBA-net: Drug-Target Binding Affinity prediction using feature selection in hybrid CNN model.DTBA网络：在混合卷积神经网络模型中使用特征选择进行药物-靶点结合亲和力预测。

J Comput Aided Mol Des. 2025 Jun 16;39(1):31. doi: 10.1007/s10822-025-00605-4.

Top-DTI: Integrating Topological Deep Learning and Large Language Models for Drug Target Interaction Prediction.Top-DTI：整合拓扑深度学习与大语言模型用于药物靶点相互作用预测

bioRxiv. 2025 Feb 8:2025.02.07.637146. doi: 10.1101/2025.02.07.637146.

Barlow Twins deep neural network for advanced 1D drug-target interaction prediction.用于高级一维药物-靶点相互作用预测的巴洛双胞胎深度神经网络。

J Cheminform. 2025 Feb 5;17(1):18. doi: 10.1186/s13321-025-00952-2.

Accurate and transferable drug-target interaction prediction with DrugLAMP.使用DrugLAMP进行准确且可转移的药物-靶点相互作用预测。

Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae693.

本文引用的文献

Relative molecule self-attention transformer.相对分子自注意力变换器

J Cheminform. 2024 Jan 3;16(1):3. doi: 10.1186/s13321-023-00789-7.

GSAML-DTA: An interpretable drug-target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information.GSAML-DTA：一种基于图神经网络和自注意力机制以及互信息的可解释药物-靶标结合亲和力预测模型。

Comput Biol Med. 2022 Nov;150:106145. doi: 10.1016/j.compbiomed.2022.106145. Epub 2022 Oct 4.

Improving the generalizability of protein-ligand binding predictions with AI-Bind.利用 AI-Bind 提高蛋白质 - 配体结合预测的泛化能力

Nat Commun. 2023 Apr 8;14(1):1989. doi: 10.1038/s41467-023-37572-z.

Deep generative model for drug design from protein target sequence.基于蛋白质靶点序列的药物设计深度生成模型。

J Cheminform. 2023 Mar 28;15(1):38. doi: 10.1186/s13321-023-00702-2.

MCANet: shared-weight-based MultiheadCrossAttention network for drug-target interaction prediction.MCANet：用于药物-靶点相互作用预测的基于共享权重的多头交叉注意力网络。

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad082.

MFR-DTA: a multi-functional and robust model for predicting drug-target binding affinity and region.MFR-DTA：一种多功能且稳健的药物-靶点结合亲和力和区域预测模型。

Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad056.

Graph-sequence attention and transformer for predicting drug-target affinity.用于预测药物-靶点亲和力的图序列注意力机制与变换器

RSC Adv. 2022 Oct 14;12(45):29525-29534. doi: 10.1039/d2ra05566j. eCollection 2022 Oct 11.

PPAEDTI: Personalized Propagation Auto-Encoder Model for Predicting Drug-Target Interactions.PPAEDTI：用于预测药物-靶点相互作用的个性化传播自动编码器模型

IEEE J Biomed Health Inform. 2023 Jan;27(1):573-582. doi: 10.1109/JBHI.2022.3217433. Epub 2023 Jan 4.

DistilProtBert: a distilled protein language model used to distinguish between real proteins and their randomly shuffled counterparts.DistilProtBert：一种经过蒸馏的蛋白质语言模型，用于区分真实蛋白质与其随机打乱的对应物。

Bioinformatics. 2022 Sep 16;38(Suppl_2):ii95-ii98. doi: 10.1093/bioinformatics/btac474.

Fine-tuning of BERT Model to Accurately Predict Drug-Target Interactions.微调BERT模型以准确预测药物-靶点相互作用。

Pharmaceutics. 2022 Aug 16;14(8):1710. doi: 10.3390/pharmaceutics14081710.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

DLM-DTI：一种基于提示学习的药物-靶点相互作用预测双语模型。

DLM-DTI: a dual language model for the prediction of drug-target interaction with hint-based learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献