• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过保持嵌入之间的原始拓扑关系来准确预测药物-蛋白质相互作用。

Accurate prediction of drug-protein interactions by maintaining the original topological relationships among embeddings.

作者信息

Li Yanfei, Chen Xiran, Wang Shuqin, Wei Jinmao

机构信息

College of Computer Science, Nankai University, 300071, Tianjin, China.

National Heart & Lung Institute, Imperial College London, SW3 6LY, London, UK.

出版信息

BMC Biol. 2025 Aug 5;23(1):243. doi: 10.1186/s12915-025-02338-0.

DOI:10.1186/s12915-025-02338-0
PMID:40764993
Abstract

BACKGROUND

Learning-based methods have recently demonstrated strong potential in predicting drug-protein interactions (DPIs). However, existing approaches often fail to achieve accurate predictions on real-world imbalanced datasets while maintaining high generalizability and scalability, limiting their practical applicability.

RESULTS

This study proposes a highly generalized model, GLDPI, aimed at improving prediction accuracy in imbalanced scenarios by preserving the topological relationships among initial molecular representations in the embedding space. Specifically, GLDPI employs dedicated encoders to transform one-dimensional sequence information of drugs and proteins into embedding representations and efficiently calculates the likelihood of DPIs using cosine similarity. Additionally, we introduce a prior loss function based on the guilt-by-association principle to ensure that the topology of the embedding space aligns with the structure of the initial drug-protein network. This design enables GLDPI to effectively capture network relationships and key features of molecular interactions, thereby significantly enhancing predictive performance.

CONCLUSIONS

Experimental results highlight GLDPI's superior performance on multiple highly imbalanced benchmark datasets, achieving over a 100% improvement in the AUPR metric compared to state-of-the-art methods. Additionally, GLDPI demonstrates exceptional generalization capabilities in cold-start experiments, excelling in predicting novel drug-protein interactions. Furthermore, the model exhibits remarkable scalability, efficiently inferring approximately drug-protein pairs in less than 10 h.

摘要

背景

基于学习的方法最近在预测药物-蛋白质相互作用(DPI)方面显示出强大的潜力。然而,现有的方法在处理现实世界中的不平衡数据集时,往往难以在保持高泛化性和可扩展性的同时实现准确的预测,这限制了它们的实际应用。

结果

本研究提出了一种高度泛化的模型GLDPI,旨在通过保留嵌入空间中初始分子表示之间的拓扑关系来提高不平衡场景下的预测准确性。具体而言,GLDPI采用专用编码器将药物和蛋白质的一维序列信息转换为嵌入表示,并使用余弦相似度有效地计算DPI的可能性。此外,我们引入了一种基于关联有罪原则的先验损失函数,以确保嵌入空间的拓扑结构与初始药物-蛋白质网络的结构一致。这种设计使GLDPI能够有效地捕捉网络关系和分子相互作用的关键特征,从而显著提高预测性能。

结论

实验结果突出了GLDPI在多个高度不平衡基准数据集上的卓越性能,与现有方法相比,AUPR指标提高了100%以上。此外,GLDPI在冷启动实验中表现出出色的泛化能力,在预测新型药物-蛋白质相互作用方面表现优异。此外,该模型具有显著的可扩展性,能够在不到10小时的时间内高效推断出约 药物-蛋白质对。

相似文献

1
Accurate prediction of drug-protein interactions by maintaining the original topological relationships among embeddings.通过保持嵌入之间的原始拓扑关系来准确预测药物-蛋白质相互作用。
BMC Biol. 2025 Aug 5;23(1):243. doi: 10.1186/s12915-025-02338-0.
2
MixingDTA: improved drug-target affinity prediction by extending mixup with guilt-by-association.MixingDTA:通过关联负罪感扩展混合增强来改进药物-靶点亲和力预测
Bioinformatics. 2025 Jul 1;41(Supplement_1):i105-i114. doi: 10.1093/bioinformatics/btaf238.
3
EPI-DynFusion: enhancer-promoter interaction prediction model based on sequence features and dynamic fusion mechanisms.EPI-DynFusion:基于序列特征和动态融合机制的增强子-启动子相互作用预测模型。
Front Genet. 2025 Jul 23;16:1614222. doi: 10.3389/fgene.2025.1614222. eCollection 2025.
4
Advancing edge-based clustering and graph embedding for biological network analysis: a case study in RASopathies.用于生物网络分析的基于前沿的聚类和图嵌入:以RASopathies为例的研究
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf320.
5
Predicting Drug-Side Effect Relationships From Parametric Knowledge Embedded in Biomedical BERT Models: Methodological Study With a Natural Language Processing Approach.从生物医学BERT模型中嵌入的参数知识预测药物副作用关系:一种自然语言处理方法的方法学研究
JMIR Med Inform. 2025 Jul 10;13:e67513. doi: 10.2196/67513.
6
Enhancing intelligent transportation systems with a more efficient model for long-term traffic predictions based on an attention mechanism and a residual temporal convolutional network.基于注意力机制和残差时间卷积网络,通过一种更高效的长期交通预测模型来增强智能交通系统。
Neural Netw. 2025 Jul 23;192:107897. doi: 10.1016/j.neunet.2025.107897.
7
A 4D tensor-enhanced multi-dimensional convolutional neural network for accurate prediction of protein-ligand binding affinity.一种用于准确预测蛋白质-配体结合亲和力的4D张量增强多维卷积神经网络。
Mol Divers. 2024 Dec 23. doi: 10.1007/s11030-024-11044-y.
8
ResNet-SE-CBAM Siamese Networks for Few-Shot and Imbalanced PCB Defect Classification.用于少样本和不平衡印刷电路板缺陷分类的残差网络-挤压与激励-卷积块注意力模块连体网络
Sensors (Basel). 2025 Jul 7;25(13):4233. doi: 10.3390/s25134233.
9
DDintensity: Addressing imbalanced drug-drug interaction risk levels using pre-trained deep learning model embeddings.DD强度:使用预训练的深度学习模型嵌入来处理不均衡的药物-药物相互作用风险水平。
Artif Intell Med. 2025 Oct;168:103202. doi: 10.1016/j.artmed.2025.103202. Epub 2025 Jul 1.
10
Sparse-view spectral CT reconstruction via a coupled subspace representation and score-based generative model.基于耦合子空间表示和基于分数的生成模型的稀疏视图光谱CT重建
Quant Imaging Med Surg. 2025 Jun 6;15(6):5474-5495. doi: 10.21037/qims-24-2226. Epub 2025 May 28.

引用本文的文献

1
AMCF-RDP: a self-attention-based multi-source and cascade framework for the identification of drug-protein relationships.AMCF-RDP:一种基于自注意力机制的多源级联框架,用于识别药物-蛋白质关系。
Mol Divers. 2025 Aug 27. doi: 10.1007/s11030-025-11337-w.

本文引用的文献

1
Drug-target interaction prediction with collaborative contrastive learning and adaptive self-paced sampling strategy.基于协同对比学习和自适应自步采样策略的药物-靶标相互作用预测。
BMC Biol. 2024 Sep 27;22(1):216. doi: 10.1186/s12915-024-02012-x.
2
MvGraphDTA: multi-view-based graph deep model for drug-target affinity prediction by introducing the graphs and line graphs.MvGraphDTA:基于多视图的图深度学习模型,通过引入图和折线图来预测药物-靶标亲和力。
BMC Biol. 2024 Aug 26;22(1):182. doi: 10.1186/s12915-024-01981-3.
3
Predicting drug-target binding affinity with cross-scale graph contrastive learning.
基于跨尺度图对比学习的药物-靶标结合亲和力预测。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad516.
4
DeepCompoundNet: enhancing compound-protein interaction prediction with multimodal convolutional neural networks.深度化合物网络:利用多模态卷积神经网络增强化合物-蛋白质相互作用预测
J Biomol Struct Dyn. 2025 Feb;43(3):1414-1423. doi: 10.1080/07391102.2023.2291829. Epub 2023 Dec 12.
5
Predicting Drug-Protein Interactions by Self-Adaptively Adjusting the Topological Structure of the Heterogeneous Network.通过自适应调整异质网络的拓扑结构来预测药物-蛋白质相互作用。
IEEE J Biomed Health Inform. 2023 Nov;27(11):5675-5684. doi: 10.1109/JBHI.2023.3312374. Epub 2023 Nov 7.
6
Contrastive learning in protein language space predicts interactions between drugs and protein targets.蛋白质语言空间中的对比学习可预测药物与蛋白质靶标之间的相互作用。
Proc Natl Acad Sci U S A. 2023 Jun 13;120(24):e2220778120. doi: 10.1073/pnas.2220778120. Epub 2023 Jun 8.
7
A Deep Neural Network-Based Co-Coding Method to Predict Drug-Protein Interactions by Analyzing the Feature Consistency Between Drugs and Proteins.基于深度神经网络的药物-蛋白质相互作用预测的共编码方法,通过分析药物和蛋白质之间的特征一致性。
IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):2200-2209. doi: 10.1109/TCBB.2023.3237863. Epub 2023 Jun 5.
8
MCANet: shared-weight-based MultiheadCrossAttention network for drug-target interaction prediction.MCANet:用于药物-靶点相互作用预测的基于共享权重的多头交叉注意力网络。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad082.
9
Drug-Protein interaction prediction by correcting the effect of incomplete information in heterogeneous information.通过纠正异质信息中不完整信息的影响来预测药物-蛋白质相互作用。
Bioinformatics. 2022 Nov 15;38(22):5073-5080. doi: 10.1093/bioinformatics/btac629.
10
IIFDTI: predicting drug-target interactions through interactive and independent features based on attention mechanism.IIFDTI:基于注意力机制的交互独立特征预测药物-靶标相互作用。
Bioinformatics. 2022 Sep 2;38(17):4153-4161. doi: 10.1093/bioinformatics/btac485.