• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于蛋白质语言模型的深度图耦合网络评估蛋白质模型质量。

Assessing protein model quality based on deep graph coupled networks using protein language model.

机构信息

College of Information Engineering, Zhejiang University of Technology.

researcher of AI in the BioMap.

出版信息

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad420.

DOI:10.1093/bib/bbad420
PMID:38018909
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10685403/
Abstract

Model quality evaluation is a crucial part of protein structural biology. How to distinguish high-quality models from low-quality models, and to assess which high-quality models have relatively incorrect regions for improvement, are remain a challenge. More importantly, the quality assessment of multimer models is a hot topic for structure prediction. In this study, we propose GraphCPLMQA, a novel approach for evaluating residue-level model quality that combines graph coupled networks and embeddings from protein language models. The GraphCPLMQA consists of a graph encoding module and a transform-based convolutional decoding module. In encoding module, the underlying relational representations of sequence and high-dimensional geometry structure are extracted by protein language models with Evolutionary Scale Modeling. In decoding module, the mapping connection between structure and quality is inferred by the representations and low-dimensional features. Specifically, the triangular location and residue level contact order features are designed to enhance the association between the local structure and the overall topology. Experimental results demonstrate that GraphCPLMQA using single-sequence embedding achieves the best performance compared with the CASP15 residue-level interface evaluation methods among 9108 models in the local residue interface test set of CASP15 multimers. In CAMEO blind test (20 May 2022 to 13 August 2022), GraphCPLMQA ranked first compared with other servers (https://www.cameo3d.org/quality-estimation). GraphCPLMQA also outperforms state-of-the-art methods on 19, 035 models in CASP13 and CASP14 monomer test set.

摘要

模型质量评估是蛋白质结构生物学的一个重要组成部分。如何区分高质量模型和低质量模型,以及评估哪些高质量模型具有需要改进的相对错误区域,仍然是一个挑战。更重要的是,多聚体模型的质量评估是结构预测的一个热门话题。在这项研究中,我们提出了 GraphCPLMQA,这是一种结合了图耦合网络和蛋白质语言模型的嵌入的残基水平模型质量评估的新方法。GraphCPLMQA 由图编码模块和基于变换的卷积解码模块组成。在编码模块中,通过进化尺度建模的蛋白质语言模型提取序列和高维几何结构的基本关系表示。在解码模块中,通过表示和低维特征推断结构和质量之间的映射关系。具体来说,设计了三角形位置和残基级接触顺序特征来增强局部结构与整体拓扑之间的关联。实验结果表明,与 CASP15 多聚体局部残基界面测试集中的 9108 个模型中的其他 CASP15 残基界面评估方法相比,GraphCPLMQA 使用单序列嵌入实现了最佳性能。在 CAMEO 盲测(2022 年 5 月 20 日至 2022 年 8 月 13 日)中,GraphCPLMQA 与其他服务器(https://www.cameo3d.org/quality-estimation)相比排名第一。GraphCPLMQA 在 CASP13 和 CASP14 单体测试集中的 19035 个模型上的表现也优于最先进的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/41395856740a/bbad420f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/e2d613793c80/bbad420f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/3dd18715bbcc/bbad420f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/0a610056ac69/bbad420f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/83dec64add60/bbad420f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/6f6dac717627/bbad420f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/41395856740a/bbad420f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/e2d613793c80/bbad420f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/3dd18715bbcc/bbad420f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/0a610056ac69/bbad420f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/83dec64add60/bbad420f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/6f6dac717627/bbad420f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68a3/10685403/41395856740a/bbad420f6.jpg

相似文献

1
Assessing protein model quality based on deep graph coupled networks using protein language model.基于蛋白质语言模型的深度图耦合网络评估蛋白质模型质量。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad420.
2
GraphGPSM: a global scoring model for protein structure using graph neural networks.GraphGPSM:一种使用图神经网络的蛋白质结构全局评分模型。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad219.
3
Improved model quality assessment using sequence and structural information by enhanced deep neural networks.通过增强深度神经网络利用序列和结构信息改进模型质量评估。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac507.
4
Predicting residue-specific qualities of individual protein models using residual neural networks and graph neural networks.使用残差神经网络和图神经网络预测个体蛋白质模型的残基特异性性质。
Proteins. 2022 Dec;90(12):2091-2102. doi: 10.1002/prot.26400. Epub 2022 Jul 30.
5
DeepUMQA: ultrafast shape recognition-based protein model quality assessment using deep learning.DeepUMQA:基于超快形状识别的深度学习蛋白质模型质量评估。
Bioinformatics. 2022 Mar 28;38(7):1895-1903. doi: 10.1093/bioinformatics/btac056.
6
EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction.EmbedDTI:通过序列嵌入和图卷积网络增强分子表示,用于药物-靶标相互作用预测。
Biomolecules. 2021 Nov 29;11(12):1783. doi: 10.3390/biom11121783.
7
Modeling aspects of the language of life through transfer-learning protein sequences.通过转移学习蛋白质序列来模拟生命语言的各个方面。
BMC Bioinformatics. 2019 Dec 17;20(1):723. doi: 10.1186/s12859-019-3220-8.
8
Estimating protein complex model accuracy based on ultrafast shape recognition and deep learning in CASP15.基于超快形状识别和深度学习的 CASP15 中蛋白质复合物模型精度估计。
Proteins. 2023 Dec;91(12):1861-1870. doi: 10.1002/prot.26564. Epub 2023 Aug 8.
9
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.
10
EGPDI: identifying protein-DNA binding sites based on multi-view graph embedding fusion.EGPDI:基于多视图图嵌入融合的蛋白质-DNA 结合位点识别。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae330.

引用本文的文献

1
Estimating protein complex model accuracy using graph transformers and pairwise similarity graphs.使用图变换器和成对相似性图估计蛋白质复合物模型的准确性。
Bioinform Adv. 2025 Jul 29;5(1):vbaf180. doi: 10.1093/bioadv/vbaf180. eCollection 2025.
2
Beyond static structures: protein dynamic conformations modeling in the post-AlphaFold era.超越静态结构:后AlphaFold时代的蛋白质动态构象建模
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf340.
3
M-DeepAssembly: enhanced DeepAssembly based on multi-objective multi-domain protein conformation sampling.

本文引用的文献

1
Estimation of model accuracy in CASP15 using the ModFOLDdock server.使用 ModFOLDdock 服务器评估 CASP15 中的模型精度。
Proteins. 2023 Dec;91(12):1871-1878. doi: 10.1002/prot.26532. Epub 2023 Jun 14.
2
Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies.基于大规模天然抗体数据集的深度学习实现快速、准确的抗体结构预测。
Nat Commun. 2023 Apr 25;14(1):2389. doi: 10.1038/s41467-023-38063-x.
3
Prediction of protein structures, functions and interactions using the IntFOLD7, MultiFOLD and ModFOLDdock servers.
M-深度组装:基于多目标多域蛋白质构象采样的增强型深度组装
BMC Bioinformatics. 2025 May 5;26(1):120. doi: 10.1186/s12859-025-06131-2.
4
DeepUMQA-X: Comprehensive and insightful estimation of model accuracy for protein single-chain and complex.DeepUMQA-X:对蛋白质单链和复合物模型准确性的全面且有深刻见解的评估。
Nucleic Acids Res. 2025 May 5. doi: 10.1093/nar/gkaf380.
5
Physical-aware model accuracy estimation for protein complex using deep learning method.基于深度学习方法的蛋白质复合物物理感知模型准确性估计
Comput Struct Biotechnol J. 2025 Jan 22;27:478-487. doi: 10.1016/j.csbj.2025.01.017. eCollection 2025.
6
EquiRank: Improved protein-protein interface quality estimation using protein language-model-informed equivariant graph neural networks.EquiRank:使用蛋白质语言模型引导的等变图神经网络改进蛋白质-蛋白质界面质量评估
Comput Struct Biotechnol J. 2024 Dec 30;27:160-170. doi: 10.1016/j.csbj.2024.12.015. eCollection 2025.
7
A Survey of Deep Learning Methods for Estimating the Accuracy of Protein Quaternary Structure Models.深度学习方法估计蛋白质四级结构模型准确性的研究综述。
Biomolecules. 2024 May 13;14(5):574. doi: 10.3390/biom14050574.
8
Recent advances and challenges in protein complex model accuracy estimation.蛋白质复合物模型准确性评估的最新进展与挑战
Comput Struct Biotechnol J. 2024 Apr 21;23:1824-1832. doi: 10.1016/j.csbj.2024.04.049. eCollection 2024 Dec.
9
Multi-domain and complex protein structure prediction using inter-domain interactions from deep learning.利用深度学习的域间相互作用进行多域和复杂蛋白质结构预测。
Commun Biol. 2023 Dec 1;6(1):1221. doi: 10.1038/s42003-023-05610-7.
使用 IntFOLD7、MultiFOLD 和 ModFOLDdock 服务器预测蛋白质结构、功能和相互作用。
Nucleic Acids Res. 2023 Jul 5;51(W1):W274-W280. doi: 10.1093/nar/gkad297.
4
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
5
Protein structure and folding pathway prediction based on remote homologs recognition using PAthreader.基于远程同源物识别的 PAthreader 进行蛋白质结构和折叠途径预测。
Commun Biol. 2023 Mar 4;6(1):243. doi: 10.1038/s42003-023-04605-8.
6
Improved model quality assessment using sequence and structural information by enhanced deep neural networks.通过增强深度神经网络利用序列和结构信息改进模型质量评估。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac507.
7
Single-sequence protein structure prediction using a language model and deep learning.基于语言模型和深度学习的单序列蛋白质结构预测。
Nat Biotechnol. 2022 Nov;40(11):1617-1623. doi: 10.1038/s41587-022-01432-w. Epub 2022 Oct 3.
8
I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction.I-TASSER-MTD:一个基于深度学习的多领域蛋白质结构和功能预测平台。
Nat Protoc. 2022 Oct;17(10):2326-2353. doi: 10.1038/s41596-022-00728-0. Epub 2022 Aug 5.
9
DeepUMQA: ultrafast shape recognition-based protein model quality assessment using deep learning.DeepUMQA:基于超快形状识别的深度学习蛋白质模型质量评估。
Bioinformatics. 2022 Mar 28;38(7):1895-1903. doi: 10.1093/bioinformatics/btac056.
10
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库:用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.