• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过整合药效团和预训练模型来提高药物-靶标相互作用识别的泛化能力和性能。

Enhancing generalizability and performance in drug-target interaction identification by integrating pharmacophore and pre-trained models.

机构信息

School of Software, Henan University, Kaifeng, Henan Province 475000, China.

Henan International Joint Laboratory of Intelligent Network Theory and Key Technology, Henan University, Kaifeng, Henan Province 475000, China.

出版信息

Bioinformatics. 2024 Jun 28;40(Suppl 1):i539-i547. doi: 10.1093/bioinformatics/btae240.

DOI:10.1093/bioinformatics/btae240
PMID:38940179
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11211825/
Abstract

MOTIVATION

In drug discovery, it is crucial to assess the drug-target binding affinity (DTA). Although molecular docking is widely used, computational efficiency limits its application in large-scale virtual screening. Deep learning-based methods learn virtual scoring functions from labeled datasets and can quickly predict affinity. However, there are three limitations. First, existing methods only consider the atom-bond graph or one-dimensional sequence representations of compounds, ignoring the information about functional groups (pharmacophores) with specific biological activities. Second, relying on limited labeled datasets fails to learn comprehensive embedding representations of compounds and proteins, resulting in poor generalization performance in complex scenarios. Third, existing feature fusion methods cannot adequately capture contextual interaction information.

RESULTS

Therefore, we propose a novel DTA prediction method named HeteroDTA. Specifically, a multi-view compound feature extraction module is constructed to model the atom-bond graph and pharmacophore graph. The residue concat graph and protein sequence are also utilized to model protein structure and function. Moreover, to enhance the generalization capability and reduce the dependence on task-specific labeled data, pre-trained models are utilized to initialize the atomic features of the compounds and the embedding representations of the protein sequence. A context-aware nonlinear feature fusion method is also proposed to learn interaction patterns between compounds and proteins. Experimental results on public benchmark datasets show that HeteroDTA significantly outperforms existing methods. In addition, HeteroDTA shows excellent generalization performance in cold-start experiments and superiority in the representation learning ability of drug-target pairs. Finally, the effectiveness of HeteroDTA is demonstrated in a real-world drug discovery study.

AVAILABILITY AND IMPLEMENTATION

The source code and data are available at https://github.com/daydayupzzl/HeteroDTA.

摘要

动机

在药物发现中,评估药物-靶标结合亲和力(DTA)至关重要。尽管分子对接被广泛应用,但计算效率限制了其在大规模虚拟筛选中的应用。基于深度学习的方法从标记数据集学习虚拟评分函数,并可以快速预测亲和力。然而,存在三个限制。首先,现有方法仅考虑化合物的原子-键图或一维序列表示,忽略了具有特定生物活性的功能基团(药效团)的信息。其次,依赖有限的标记数据集无法学习到化合物和蛋白质的全面嵌入表示,导致在复杂场景下的泛化性能较差。第三,现有的特征融合方法无法充分捕获上下文交互信息。

结果

因此,我们提出了一种名为 HeteroDTA 的新型 DTA 预测方法。具体来说,构建了一个多视图化合物特征提取模块,用于对原子-键图和药效团图进行建模。还利用残基连接图和蛋白质序列对蛋白质结构和功能进行建模。此外,为了增强泛化能力并减少对特定于任务的标记数据的依赖,利用预训练模型初始化化合物的原子特征和蛋白质序列的嵌入表示。还提出了一种上下文感知的非线性特征融合方法,以学习化合物和蛋白质之间的相互作用模式。在公共基准数据集上的实验结果表明,HeteroDTA 显著优于现有方法。此外,HeteroDTA 在冷启动实验中表现出出色的泛化性能,并且在药物-靶标对的表示学习能力方面具有优势。最后,在实际的药物发现研究中证明了 HeteroDTA 的有效性。

可用性和实现

源代码和数据可在 https://github.com/daydayupzzl/HeteroDTA 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ceb/11211825/dc7591a2247a/btae240f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ceb/11211825/8cad7a05a74c/btae240f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ceb/11211825/f467a7d48381/btae240f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ceb/11211825/b085b6530c15/btae240f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ceb/11211825/dc7591a2247a/btae240f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ceb/11211825/8cad7a05a74c/btae240f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ceb/11211825/f467a7d48381/btae240f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ceb/11211825/b085b6530c15/btae240f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ceb/11211825/dc7591a2247a/btae240f4.jpg

相似文献

1
Enhancing generalizability and performance in drug-target interaction identification by integrating pharmacophore and pre-trained models.通过整合药效团和预训练模型来提高药物-靶标相互作用识别的泛化能力和性能。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i539-i547. doi: 10.1093/bioinformatics/btae240.
2
G-K BertDTA: A graph representation learning and semantic embedding-based framework for drug-target affinity prediction.G-K BertDTA:一种基于图表示学习和语义嵌入的药物-靶标亲和力预测框架。
Comput Biol Med. 2024 May;173:108376. doi: 10.1016/j.compbiomed.2024.108376. Epub 2024 Mar 25.
3
Drug-target affinity prediction with extended graph learning-convolutional networks.基于扩展图学习卷积网络的药物-靶标亲和力预测。
BMC Bioinformatics. 2024 Feb 16;25(1):75. doi: 10.1186/s12859-024-05698-6.
4
GeneralizedDTA: combining pre-training and multi-task learning to predict drug-target binding affinity for unknown drug discovery.通用 DTA:结合预训练和多任务学习,预测未知药物发现的药物-靶标结合亲和力。
BMC Bioinformatics. 2022 Sep 7;23(1):367. doi: 10.1186/s12859-022-04905-6.
5
Prediction of drug-target binding affinity based on multi-scale feature fusion.基于多尺度特征融合的药物-靶标结合亲和力预测。
Comput Biol Med. 2024 Aug;178:108699. doi: 10.1016/j.compbiomed.2024.108699. Epub 2024 Jun 4.
6
TC-DTA: Predicting Drug-Target Binding Affinity With Transformer and Convolutional Neural Networks.TC-DTA:基于 Transformer 和卷积神经网络的药物-靶标结合亲和力预测。
IEEE Trans Nanobioscience. 2024 Oct;23(4):572-578. doi: 10.1109/TNB.2024.3441590. Epub 2024 Oct 15.
7
Drug-target affinity prediction method based on multi-scale information interaction and graph optimization.基于多尺度信息交互和图优化的药物-靶标亲和力预测方法。
Comput Biol Med. 2023 Dec;167:107621. doi: 10.1016/j.compbiomed.2023.107621. Epub 2023 Oct 29.
8
FMGNN: A Method to Predict Compound-Protein Interaction With Pharmacophore Features and Physicochemical Properties of Amino Acids.FMGNN:一种利用药效团特征和氨基酸理化性质预测化合物-蛋白质相互作用的方法。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1030-1040. doi: 10.1109/TCBB.2022.3172340. Epub 2023 Apr 3.
9
GPCNDTA: Prediction of drug-target binding affinity through cross-attention networks augmented with graph features and pharmacophores.基于图特征和药效团增强的交叉注意力网络预测药物-靶标结合亲和力
Comput Biol Med. 2023 Nov;166:107512. doi: 10.1016/j.compbiomed.2023.107512. Epub 2023 Sep 20.
10
MDF-DTA: A Multi-Dimensional Fusion Approach for Drug-Target Binding Affinity Prediction.MDF-DTA:一种用于药物-靶标结合亲和力预测的多维融合方法。
J Chem Inf Model. 2024 Jul 8;64(13):4980-4990. doi: 10.1021/acs.jcim.4c00310. Epub 2024 Jun 18.

引用本文的文献

1
Harnessing artificial intelligence to identify Bufalin as a molecular glue degrader of estrogen receptor alpha.利用人工智能鉴定蟾毒灵为雌激素受体α的分子胶降解剂。
Nat Commun. 2025 Aug 22;16(1):7854. doi: 10.1038/s41467-025-62288-7.

本文引用的文献

1
FDA-approved heterocyclic molecules for cancer treatment: Synthesis, dosage, mechanism of action and their adverse effect.美国食品药品监督管理局(FDA)批准的用于癌症治疗的杂环分子:合成、剂量、作用机制及其不良反应。
Heliyon. 2023 Dec 5;10(1):e23172. doi: 10.1016/j.heliyon.2023.e23172. eCollection 2024 Jan 15.
2
GSAML-DTA: An interpretable drug-target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information.GSAML-DTA:一种基于图神经网络和自注意力机制以及互信息的可解释药物-靶标结合亲和力预测模型。
Comput Biol Med. 2022 Nov;150:106145. doi: 10.1016/j.compbiomed.2022.106145. Epub 2022 Oct 4.
3
Computational approaches streamlining drug discovery.
计算方法简化药物发现。
Nature. 2023 Apr;616(7958):673-685. doi: 10.1038/s41586-023-05905-z. Epub 2023 Apr 26.
4
Sequence-based drug-target affinity prediction using weighted graph neural networks.基于序列的药物-靶标亲和力预测的加权图神经网络方法。
BMC Genomics. 2022 Jun 17;23(1):449. doi: 10.1186/s12864-022-08648-9.
5
Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences.生物结构和功能源于将无监督学习扩展到 2.5 亿个蛋白质序列。
Proc Natl Acad Sci U S A. 2021 Apr 13;118(15). doi: 10.1073/pnas.2016239118.
6
Computational molecular docking and virtual screening revealed promising SARS-CoV-2 drugs.计算分子对接和虚拟筛选揭示了有前景的抗SARS-CoV-2药物。
Precis Clin Med. 2021 Jan 18;4(1):1-16. doi: 10.1093/pcmedi/pbab001. eCollection 2021 Mar.
7
GraphDTA: predicting drug-target binding affinity with graph neural networks.GraphDTA:基于图神经网络的药物-靶标结合亲和力预测。
Bioinformatics. 2021 May 23;37(8):1140-1147. doi: 10.1093/bioinformatics/btaa921.
8
DeepCDA: deep cross-domain compound-protein affinity prediction through LSTM and convolutional neural networks.DeepCDA:通过 LSTM 和卷积神经网络进行深度跨域化合物-蛋白质亲和力预测。
Bioinformatics. 2020 Nov 1;36(17):4633-4642. doi: 10.1093/bioinformatics/btaa544.
9
DeepDTA: deep drug-target binding affinity prediction.深度 DTA:深度药物-靶标结合亲和力预测。
Bioinformatics. 2018 Sep 1;34(17):i821-i829. doi: 10.1093/bioinformatics/bty593.
10
SimBoost: a read-across approach for predicting drug-target binding affinities using gradient boosting machines.SimBoost:一种使用梯度提升机器预测药物-靶点结合亲和力的类推方法。
J Cheminform. 2017 Apr 18;9(1):24. doi: 10.1186/s13321-017-0209-z.