Suppr超能文献

GramSeq-DTA:一种融合基因表达信息的基于语法的药物-靶点亲和力预测方法。

GramSeq-DTA: A Grammar-Based Drug-Target Affinity Prediction Approach Fusing Gene Expression Information.

作者信息

Debnath Kusal, Rana Pratip, Ghosh Preetam

机构信息

Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA.

Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA.

出版信息

Biomolecules. 2025 Mar 12;15(3):405. doi: 10.3390/biom15030405.

Abstract

Drug-target affinity (DTA) prediction is a critical aspect of drug discovery. The meaningful representation of drugs and targets is crucial for accurate prediction. Using 1D string-based representations for drugs and targets is a common approach that has demonstrated good results in drug-target affinity prediction. However, these approach lacks information on the relative position of the atoms and bonds. To address this limitation, graph-based representations have been used to some extent. However, solely considering the structural aspect of drugs and targets may be insufficient for accurate DTA prediction. Integrating the functional aspect of these drugs at the genetic level can enhance the prediction capability of the models. To fill this gap, we propose GramSeq-DTA, which integrates chemical perturbation information with the structural information of drugs and targets. We applied a Grammar Variational Autoencoder (GVAE) for drug feature extraction and utilized two different approaches for protein feature extraction as follows: a Convolutional Neural Network (CNN) and a Recurrent Neural Network (RNN). The chemical perturbation data are obtained from the L1000 project, which provides information on the up-regulation and down-regulation of genes caused by selected drugs. This chemical perturbation information is processed, and a compact dataset is prepared, serving as the functional feature set of the drugs. By integrating the drug, gene, and target features in the model, our approach outperforms the current state-of-the-art DTA prediction models when validated on widely used DTA datasets (BindingDB, Davis, and KIBA). This work provides a novel and practical approach to DTA prediction by merging the structural and functional aspects of biological entities, and it encourages further research in multi-modal DTA prediction.

摘要

药物-靶点亲和力(DTA)预测是药物发现的一个关键方面。药物和靶点的有意义表示对于准确预测至关重要。使用基于一维字符串的药物和靶点表示是一种常见方法,已在药物-靶点亲和力预测中取得了良好结果。然而,这些方法缺乏原子和键的相对位置信息。为了解决这一局限性,基于图的表示已在一定程度上得到应用。然而,仅考虑药物和靶点的结构方面对于准确的DTA预测可能是不够的。在基因水平整合这些药物的功能方面可以提高模型的预测能力。为了填补这一空白,我们提出了GramSeq-DTA,它将化学扰动信息与药物和靶点的结构信息相结合。我们应用语法变分自动编码器(GVAE)进行药物特征提取,并采用两种不同方法进行蛋白质特征提取,如下所示:卷积神经网络(CNN)和循环神经网络(RNN)。化学扰动数据来自L1000项目,该项目提供了所选药物引起的基因上调和下调信息。对这些化学扰动信息进行处理,并准备一个紧凑的数据集,作为药物的功能特征集。通过在模型中整合药物、基因和靶点特征,我们的方法在广泛使用的DTA数据集(BindingDB、Davis和KIBA)上进行验证时,优于当前最先进的DTA预测模型。这项工作通过融合生物实体的结构和功能方面,为DTA预测提供了一种新颖且实用的方法,并鼓励在多模态DTA预测方面进行进一步研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5097/11940521/f8bac52662fa/biomolecules-15-00405-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验