• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GraphsformerCPI:用于化合物-蛋白质相互作用预测的图Transformer。

GraphsformerCPI: Graph Transformer for Compound-Protein Interaction Prediction.

机构信息

School of Information Science and Engineering, Lanzhou University, Lanzhou, 730000, China.

School of Information Engineering, Lanzhou University of Finance and Economics, Lanzhou, 730020, China.

出版信息

Interdiscip Sci. 2024 Jun;16(2):361-377. doi: 10.1007/s12539-024-00609-y. Epub 2024 Mar 8.

DOI:10.1007/s12539-024-00609-y
PMID:38457109
Abstract

Accurately predicting compound-protein interactions (CPI) is a critical task in computer-aided drug design. In recent years, the exponential growth of compound activity and biomedical data has highlighted the need for efficient and interpretable prediction approaches. In this study, we propose GraphsformerCPI, an end-to-end deep learning framework that improves prediction performance and interpretability. GraphsformerCPI treats compounds and proteins as sequences of nodes with spatial structures, and leverages novel structure-enhanced self-attention mechanisms to integrate semantic and graph structural features within molecules for deep molecule representations. To capture the vital association between compound atoms and protein residues, we devise a dual-attention mechanism to effectively extract relational features through .cross-mapping. By extending the powerful learning capabilities of Transformers to spatial structures and extensively utilizing attention mechanisms, our model offers strong interpretability, a significant advantage over most black-box deep learning methods. To evaluate GraphsformerCPI, extensive experiments were conducted on benchmark datasets including human, C. elegans, Davis and KIBA datasets. We explored the impact of model depth and dropout rate on performance and compared our model against state-of-the-art baseline models. Our results demonstrate that GraphsformerCPI outperforms baseline models in classification datasets and achieves competitive performance in regression datasets. Specifically, on the human dataset, GraphsformerCPI achieves an average improvement of 1.6% in AUC, 0.5% in precision, and 5.3% in recall. On the KIBA dataset, the average improvement in Concordance index (CI) and mean squared error (MSE) is 3.3% and 7.2%, respectively. Molecular docking shows that our model provides novel insights into the intrinsic interactions and binding mechanisms. Our research holds practical significance in effectively predicting CPIs and binding affinities, identifying key atoms and residues, enhancing model interpretability.

摘要

准确预测化合物-蛋白质相互作用(CPI)是计算机辅助药物设计中的一项关键任务。近年来,化合物活性和生物医学数据的指数级增长突出了需要高效和可解释的预测方法。在这项研究中,我们提出了 GraphsformerCPI,这是一种端到端的深度学习框架,可提高预测性能和可解释性。GraphsformerCPI 将化合物和蛋白质视为具有空间结构的节点序列,并利用新颖的结构增强型自注意力机制,在分子内整合语义和图结构特征,以实现深分子表示。为了捕捉化合物原子与蛋白质残基之间的重要关联,我们设计了一种双重注意力机制,通过交叉映射有效地提取关系特征。通过将 Transformer 的强大学习能力扩展到空间结构,并广泛利用注意力机制,我们的模型提供了强大的可解释性,这是大多数黑盒深度学习方法的显著优势。为了评估 GraphsformerCPI,我们在包括人类、秀丽隐杆线虫、戴维斯和 KIBA 数据集在内的基准数据集上进行了广泛的实验。我们探讨了模型深度和辍学率对性能的影响,并将我们的模型与最先进的基线模型进行了比较。我们的结果表明,GraphsformerCPI 在分类数据集上优于基线模型,并在回归数据集上取得了有竞争力的性能。具体来说,在人类数据集上,GraphsformerCPI 在 AUC 中平均提高了 1.6%,在精度上提高了 0.5%,在召回率上提高了 5.3%。在 KIBA 数据集上,一致性指数(CI)和均方误差(MSE)的平均提高分别为 3.3%和 7.2%。分子对接表明,我们的模型提供了对内在相互作用和结合机制的新见解。我们的研究在有效预测 CPI 和结合亲和力、识别关键原子和残基、增强模型可解释性方面具有实际意义。

相似文献

1
GraphsformerCPI: Graph Transformer for Compound-Protein Interaction Prediction.GraphsformerCPI:用于化合物-蛋白质相互作用预测的图Transformer。
Interdiscip Sci. 2024 Jun;16(2):361-377. doi: 10.1007/s12539-024-00609-y. Epub 2024 Mar 8.
2
A deep learning method for predicting molecular properties and compound-protein interactions.一种用于预测分子性质和化合物-蛋白质相互作用的深度学习方法。
J Mol Graph Model. 2022 Dec;117:108283. doi: 10.1016/j.jmgm.2022.108283. Epub 2022 Aug 17.
3
TC-DTA: Predicting Drug-Target Binding Affinity With Transformer and Convolutional Neural Networks.TC-DTA:基于 Transformer 和卷积神经网络的药物-靶标结合亲和力预测。
IEEE Trans Nanobioscience. 2024 Oct;23(4):572-578. doi: 10.1109/TNB.2024.3441590. Epub 2024 Oct 15.
4
AttentionDTA: Drug-Target Binding Affinity Prediction by Sequence-Based Deep Learning With Attention Mechanism.AttentionDTA:基于序列的深度学习与注意力机制预测药物-靶点结合亲和力
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):852-863. doi: 10.1109/TCBB.2022.3170365. Epub 2023 Apr 3.
5
Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences.基于图和序列神经网络端到端学习的化合物-蛋白质相互作用预测。
Bioinformatics. 2019 Jan 15;35(2):309-318. doi: 10.1093/bioinformatics/bty535.
6
BACPI: a bi-directional attention neural network for compound-protein interaction and binding affinity prediction.BACPI:一种用于化合物-蛋白质相互作用和结合亲和力预测的双向注意力神经网络。
Bioinformatics. 2022 Mar 28;38(7):1995-2002. doi: 10.1093/bioinformatics/btac035.
7
CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions.CAT-CPI:结合卷积神经网络(CNN)和变压器(Transformer)学习化合物图像特征以预测化合物-蛋白质相互作用
Front Mol Biosci. 2022 Sep 15;9:963912. doi: 10.3389/fmolb.2022.963912. eCollection 2022.
8
A bidirectional interpretable compound-protein interaction prediction framework based on cross attention.基于交叉注意力的双向可解释化合物-蛋白质相互作用预测框架。
Comput Biol Med. 2024 Apr;172:108239. doi: 10.1016/j.compbiomed.2024.108239. Epub 2024 Mar 2.
9
MDL-CPI: Multi-view deep learning model for compound-protein interaction prediction.MDL-CPI:用于化合物-蛋白质相互作用预测的多视图深度学习模型。
Methods. 2022 Aug;204:418-427. doi: 10.1016/j.ymeth.2022.01.008. Epub 2022 Jan 31.
10
BindingSite-AugmentedDTA: enabling a next-generation pipeline for interpretable prediction models in drug repurposing.BindingSite-AugmentedDTA:在药物重定位中为可解释预测模型提供下一代管道。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad136.

引用本文的文献

1
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
2
DNA sequence analysis landscape: a comprehensive review of DNA sequence analysis task types, databases, datasets, word embedding methods, and language models.DNA序列分析全景:对DNA序列分析任务类型、数据库、数据集、词嵌入方法和语言模型的全面综述。
Front Med (Lausanne). 2025 Apr 8;12:1503229. doi: 10.3389/fmed.2025.1503229. eCollection 2025.
3

本文引用的文献

1
CoaDTI: multi-modal co-attention based framework for drug-target interaction annotation.CoaDTI:基于多模态协同注意力的药物-靶标相互作用标注框架。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac446.
2
Drug-target affinity prediction using graph neural network and contact maps.使用图神经网络和接触图进行药物-靶点亲和力预测。
RSC Adv. 2020 Jun 1;10(35):20701-20712. doi: 10.1039/d0ra02297g. eCollection 2020 May 27.
3
Compound-protein interaction prediction by deep learning: Databases, descriptors and models.基于深度学习的复合蛋白相互作用预测:数据库、描述符与模型。
GraphBAN: An inductive graph-based approach for enhanced prediction of compound-protein interactions.
GraphBAN:一种基于归纳图的方法,用于增强对化合物-蛋白质相互作用的预测。
Nat Commun. 2025 Mar 18;16(1):2541. doi: 10.1038/s41467-025-57536-9.
4
SELFprot: Effective and Efficient Multitask Finetuning Methods for Protein Parameter Prediction.SELFprot:用于蛋白质参数预测的高效多任务微调方法
J Chem Inf Model. 2025 Apr 14;65(7):3226-3238. doi: 10.1021/acs.jcim.4c02230. Epub 2025 Mar 17.
5
Transitioning from wet lab to artificial intelligence: a systematic review of AI predictors in CRISPR.从湿实验室到人工智能的转变:对CRISPR中人工智能预测因子的系统综述
J Transl Med. 2025 Feb 4;23(1):153. doi: 10.1186/s12967-024-06013-w.
6
RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models.RNA序列分析全景:任务类型、数据库、数据集、词嵌入方法及语言模型的全面综述
Heliyon. 2025 Jan 6;11(2):e41488. doi: 10.1016/j.heliyon.2024.e41488. eCollection 2025 Jan 30.
Drug Discov Today. 2022 May;27(5):1350-1366. doi: 10.1016/j.drudis.2022.02.023. Epub 2022 Mar 3.
4
Synthon-based ligand discovery in virtual libraries of over 11 billion compounds.基于合成子的配体发现虚拟库超过 110 亿化合物。
Nature. 2022 Jan;601(7893):452-459. doi: 10.1038/s41586-021-04220-9. Epub 2021 Dec 15.
5
A virtual drug-screening approach to conquer huge chemical libraries.一种攻克海量化学文库的虚拟药物筛选方法。
Nature. 2022 Jan;601(7893):322-323. doi: 10.1038/d41586-021-03682-1.
6
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
7
Deep Learning in Drug Target Interaction Prediction: Current and Future Perspectives.深度学习在药物靶点相互作用预测中的应用:现状与未来展望。
Curr Med Chem. 2021;28(11):2100-2113. doi: 10.2174/0929867327666200907141016.
8
TransformerCPI: improving compound-protein interaction prediction by sequence-based deep learning with self-attention mechanism and label reversal experiments.TransformerCPI:通过基于序列的深度学习、自注意力机制和标签反转实验提高化合物-蛋白质相互作用预测。
Bioinformatics. 2020 Aug 15;36(16):4406-4414. doi: 10.1093/bioinformatics/btaa524.
9
A Graph Convolutional Network-Based Method for Chemical-Protein Interaction Extraction: Algorithm Development.一种基于图卷积网络的化学-蛋白质相互作用提取方法:算法开发
JMIR Med Inform. 2020 May 19;8(5):e17643. doi: 10.2196/17643.
10
Machine learning models for drug-target interactions: current knowledge and future directions.用于药物-靶点相互作用的机器学习模型:当前认知与未来方向。
Drug Discov Today. 2020 Apr;25(4):748-756. doi: 10.1016/j.drudis.2020.03.003. Epub 2020 Mar 12.