• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepRank-GNN-esm:一种使用蛋白质语言模型对蛋白质-蛋白质模型进行评分的图神经网络。

DeepRank-GNN-esm: a graph neural network for scoring protein-protein models using protein language model.

作者信息

Xu Xiaotong, Bonvin Alexandre M J J

机构信息

Department of Chemistry, Faculty of Science, Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Utrecht 3584 CS, The Netherlands.

出版信息

Bioinform Adv. 2024 Jan 5;4(1):vbad191. doi: 10.1093/bioadv/vbad191. eCollection 2024.

DOI:10.1093/bioadv/vbad191
PMID:38213822
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10782804/
Abstract

MOTIVATION

Protein-Protein interactions (PPIs) play critical roles in numerous cellular processes. By modelling the 3D structures of the correspond protein complexes valuable insights can be obtained, providing, e.g. starting points for drug and protein design. One challenge in the modelling process is however the identification of near-native models from the large pool of generated models. To this end we have previously developed DeepRank-GNN, a graph neural network that integrates structural and sequence information to enable effective pattern learning at PPI interfaces. Its main features are related to the Position Specific Scoring Matrices (PSSMs), which are computationally expensive to generate, significantly limits the algorithm's usability.

RESULTS

We introduce here DeepRank-GNN-esm that includes as additional features protein language model embeddings from the ESM-2 model. We show that the ESM-2 embeddings can actually replace the PSSM features at no cost in-, or even better performance on two PPI-related tasks: scoring docking poses and detecting crystal artifacts. This new DeepRank version bypasses thus the need of generating PSSM, greatly improving the usability of the software and opening new application opportunities for systems for which PSSM profiles cannot be obtained or are irrelevant (e.g. antibody-antigen complexes).

AVAILABILITY AND IMPLEMENTATION

DeepRank-GNN-esm is freely available from https://github.com/DeepRank/DeepRank-GNN-esm.

摘要

动机

蛋白质-蛋白质相互作用(PPI)在众多细胞过程中发挥着关键作用。通过对相应蛋白质复合物的三维结构进行建模,可以获得有价值的见解,例如为药物和蛋白质设计提供起点。然而,建模过程中的一个挑战是从大量生成的模型中识别接近天然的模型。为此,我们之前开发了DeepRank-GNN,这是一种图神经网络,它整合了结构和序列信息,以便在PPI界面进行有效的模式学习。其主要特征与位置特异性评分矩阵(PSSM)相关,而生成PSSM的计算成本很高,这显著限制了该算法的可用性。

结果

我们在此引入DeepRank-GNN-esm,它包含来自ESM-2模型的蛋白质语言模型嵌入作为附加特征。我们表明,在两个与PPI相关的任务中,即对接姿势评分和晶体伪像检测,ESM-2嵌入实际上可以免费替代PSSM特征,甚至性能更好。因此,这个新的DeepRank版本无需生成PSSM,大大提高了软件的可用性,并为无法获得或不相关PSSM概况的系统(例如抗体-抗原复合物)开辟了新的应用机会。

可用性和实现

DeepRank-GNN-esm可从https://github.com/DeepRank/DeepRank-GNN-esm免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d5/10782804/478cb6d0b474/vbad191f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d5/10782804/eb73d8729ca3/vbad191f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d5/10782804/478cb6d0b474/vbad191f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d5/10782804/eb73d8729ca3/vbad191f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d5/10782804/478cb6d0b474/vbad191f2.jpg

相似文献

1
DeepRank-GNN-esm: a graph neural network for scoring protein-protein models using protein language model.DeepRank-GNN-esm:一种使用蛋白质语言模型对蛋白质-蛋白质模型进行评分的图神经网络。
Bioinform Adv. 2024 Jan 5;4(1):vbad191. doi: 10.1093/bioadv/vbad191. eCollection 2024.
2
DeepRank-GNN: a graph neural network framework to learn patterns in protein-protein interfaces.DeepRank-GNN:一种图神经网络框架,用于学习蛋白质-蛋白质界面中的模式。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac759.
3
iScore: a novel graph kernel-based function for scoring protein-protein docking models.iScore:一种用于评估蛋白质-蛋白质对接模型的基于新型图核的函数。
Bioinformatics. 2020 Jan 1;36(1):112-121. doi: 10.1093/bioinformatics/btz496.
4
DeepRank: a deep learning framework for data mining 3D protein-protein interfaces.DeepRank:用于挖掘 3D 蛋白质-蛋白质界面的深度学习框架。
Nat Commun. 2021 Dec 3;12(1):7068. doi: 10.1038/s41467-021-27396-0.
5
G-RANK: an equivariant graph neural network for the scoring of protein-protein docking models.G-RANK:一种用于蛋白质-蛋白质对接模型评分的等变图神经网络。
Bioinform Adv. 2023 Feb 3;3(1):vbad011. doi: 10.1093/bioadv/vbad011. eCollection 2023.
6
Multiphysical graph neural network (MP-GNN) for COVID-19 drug design.多物理图神经网络(MP-GNN)在 COVID-19 药物设计中的应用。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac231.
7
Protein Docking Model Evaluation by Graph Neural Networks.基于图神经网络的蛋白质对接模型评估
Front Mol Biosci. 2021 May 25;8:647915. doi: 10.3389/fmolb.2021.647915. eCollection 2021.
8
Comprehensive Research on Druggable Proteins: From PSSM to Pre-Trained Language Models.可成药蛋白的综合研究:从位置特异性得分矩阵到预训练语言模型
Int J Mol Sci. 2024 Apr 19;25(8):4507. doi: 10.3390/ijms25084507.
9
SP-GNN: Learning structure and position information from graphs.SP-GNN:从图中学习结构和位置信息。
Neural Netw. 2023 Apr;161:505-514. doi: 10.1016/j.neunet.2023.01.051. Epub 2023 Feb 4.
10
Combining protein sequences and structures with transformers and equivariant graph neural networks to predict protein function.将蛋白质序列和结构与转换器和等变图神经网络相结合,以预测蛋白质功能。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i318-i325. doi: 10.1093/bioinformatics/btad208.

引用本文的文献

1
Large Context, Deeper Insights: Harnessing Large Language Models for Advancing Protein-Protein Interaction Analysis.大背景,更深刻的见解:利用大语言模型推动蛋白质-蛋白质相互作用分析
Methods Mol Biol. 2025;2941:243-267. doi: 10.1007/978-1-0716-4623-6_15.
2
OPUS-BFactor: Predicting Protein B-Factor with Sequence and Structure Information.OPUS-B因子:利用序列和结构信息预测蛋白质B因子
Molecules. 2025 Jun 12;30(12):2570. doi: 10.3390/molecules30122570.
3
Recent advances in deep learning for protein-protein interaction: a review.深度学习在蛋白质-蛋白质相互作用研究中的最新进展:综述

本文引用的文献

1
Evaluation of AlphaFold-Multimer prediction on multi-chain protein complexes.评估 AlphaFold-Multimer 在多链蛋白质复合物上的预测。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad424.
2
Discriminating physiological from non-physiological interfaces in structures of protein complexes: A community-wide study.区分蛋白质复合物结构中的生理和非生理界面:一项全社区研究。
Proteomics. 2023 Sep;23(17):e2200323. doi: 10.1002/pmic.202200323. Epub 2023 Jun 27.
3
Evolutionary-scale prediction of atomic-level protein structure with a language model.
BioData Min. 2025 Jun 16;18(1):43. doi: 10.1186/s13040-025-00457-6.
4
Deep learning in GPCR drug discovery: benchmarking the path to accurate peptide binding.G蛋白偶联受体药物发现中的深度学习:评估准确肽结合的途径
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf186.
5
Compact Assessment of Molecular Surface Complementarities Enhances Neural Network-Aided Prediction of Key Binding Residues.分子表面互补性的紧凑评估增强了神经网络辅助的关键结合残基预测。
J Chem Inf Model. 2025 Mar 10;65(5):2695-2709. doi: 10.1021/acs.jcim.4c02286. Epub 2025 Feb 21.
6
Protein A-like Peptide Design Based on Diffusion and ESM2 Models.基于扩散和 ESM2 模型的蛋白 A 样肽设计。
Molecules. 2024 Oct 21;29(20):4965. doi: 10.3390/molecules29204965.
7
EuDockScore: Euclidean graph neural networks for scoring protein-protein interfaces.EuDockScore:用于打分蛋白质-蛋白质界面的欧几里得图神经网络。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae636.
用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
4
MetaScore: A Novel Machine-Learning-Based Approach to Improve Traditional Scoring Functions for Scoring Protein-Protein Docking Conformations.MetaScore:一种改进基于传统打分函数的蛋白质-蛋白质对接构象打分方法的新型机器学习方法。
Biomolecules. 2023 Jan 6;13(1):121. doi: 10.3390/biom13010121.
5
Ranking Peptide Binders by Affinity with AlphaFold.利用AlphaFold按亲和力对肽结合物进行排名。
Angew Chem Int Ed Engl. 2023 Feb 6;62(7):e202213362. doi: 10.1002/anie.202213362. Epub 2023 Jan 12.
6
DeepRank-GNN: a graph neural network framework to learn patterns in protein-protein interfaces.DeepRank-GNN:一种图神经网络框架,用于学习蛋白质-蛋白质界面中的模式。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac759.
7
Improving peptide-protein docking with AlphaFold-Multimer using forced sampling.利用强制采样改进使用AlphaFold-Multimer的肽-蛋白质对接。
Front Bioinform. 2022 Sep 26;2:959160. doi: 10.3389/fbinf.2022.959160. eCollection 2022.
8
AF2Complex predicts direct physical interactions in multimeric proteins with deep learning.AF2Complex 利用深度学习预测多聚体蛋白质中的直接物理相互作用。
Nat Commun. 2022 Apr 1;13(1):1744. doi: 10.1038/s41467-022-29394-2.
9
From systems to structure - using genetic data to model protein structures.从系统到结构——利用遗传数据来构建蛋白质结构模型。
Nat Rev Genet. 2022 Jun;23(6):342-354. doi: 10.1038/s41576-021-00441-w. Epub 2022 Jan 10.
10
Harnessing protein folding neural networks for peptide-protein docking.利用蛋白质折叠神经网络进行肽-蛋白对接。
Nat Commun. 2022 Jan 10;13(1):176. doi: 10.1038/s41467-021-27838-9.