• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合成对结构相似性和深度学习界面接触预测来评估CASP15中蛋白质复合物模型的准确性。

Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15.

作者信息

Roy Raj S, Liu Jian, Giri Nabin, Guo Zhiye, Cheng Jianlin

机构信息

Department of Electrical Engineering and Computer Science, NextGen Precision Health, University of Missouri, Columbia, MO 65211, USA.

出版信息

bioRxiv. 2023 Mar 12:2023.03.08.531814. doi: 10.1101/2023.03.08.531814.

DOI:10.1101/2023.03.08.531814
PMID:36945536
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10028888/
Abstract

Estimating the accuracy of quaternary structural models of protein complexes and assemblies (EMA) is important for predicting quaternary structures and applying them to studying protein function and interaction. The pairwise similarity between structural models is proven useful for estimating the quality of protein tertiary structural models, but it has been rarely applied to predicting the quality of quaternary structural models. Moreover, the pairwise similarity approach often fails when many structural models are of low quality and similar to each other. To address the gap, we developed a hybrid method (MULTICOM_qa) combining a pairwise similarity score (PSS) and an interface contact probability score (ICPS) based on the deep learning inter-chain contact prediction for estimating protein complex model accuracy. It blindly participated in the 15th Critical Assessment of Techniques for Protein Structure Prediction (CASP15) in 2022 and ranked first out of 24 predictors in estimating the global accuracy of assembly models. The average per-target correlation coefficient between the model quality scores predicted by MULTICOM_qa and the true quality scores of the models of CASP15 assembly targets is 0.66. The average per-target ranking loss in using the predicted quality scores to rank the models is 0.14. It was able to select good models for most targets. Moreover, several key factors (i.e., target difficulty, model sampling difficulty, skewness of model quality, and similarity between good/bad models) for EMA are identified and analayzed. The results demonstrate that combining the multi-model method (PSS) with the complementary single-model method (ICPS) is a promising approach to EMA. The source code of MULTICOM_qa is available at https://github.com/BioinfoMachineLearning/MULTICOM_qa .

摘要

评估蛋白质复合物和组装体四级结构模型的准确性(EMA)对于预测四级结构并将其应用于研究蛋白质功能和相互作用非常重要。结构模型之间的成对相似性已被证明可用于评估蛋白质三级结构模型的质量,但很少应用于预测四级结构模型的质量。此外,当成对相似性方法应用于许多低质量且彼此相似的结构模型时,往往会失效。为了弥补这一差距,我们开发了一种混合方法(MULTICOM_qa),该方法结合了成对相似性分数(PSS)和基于深度学习链间接触预测的界面接触概率分数(ICPS),用于评估蛋白质复合物模型的准确性。它在2022年盲目参加了第15届蛋白质结构预测技术关键评估(CASP15),在评估组装模型的全局准确性方面,在24个预测器中排名第一。MULTICOM_qa预测的模型质量分数与CASP15组装目标模型的真实质量分数之间的平均每个目标相关系数为0.66。使用预测的质量分数对模型进行排名时,平均每个目标的排名损失为0.14。它能够为大多数目标选择良好的模型。此外,还识别并分析了EMA的几个关键因素(即目标难度、模型采样难度、模型质量的偏度以及好/坏模型之间的相似性)。结果表明,将多模型方法(PSS)与互补的单模型方法(ICPS)相结合是一种有前途的EMA方法。MULTICOM_qa的源代码可在https://github.com/BioinfoMachineLearning/MULTICOM_qa获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/0714364a05ec/nihpp-2023.03.08.531814v1-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/5ecd923e1d58/nihpp-2023.03.08.531814v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/6ac4f87e5d52/nihpp-2023.03.08.531814v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/fa704082ca64/nihpp-2023.03.08.531814v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/a5ee6901dc28/nihpp-2023.03.08.531814v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/b4e60629a252/nihpp-2023.03.08.531814v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/b7a89ce0a73c/nihpp-2023.03.08.531814v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/e40eb5133849/nihpp-2023.03.08.531814v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/9276fd46691b/nihpp-2023.03.08.531814v1-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/0714364a05ec/nihpp-2023.03.08.531814v1-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/5ecd923e1d58/nihpp-2023.03.08.531814v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/6ac4f87e5d52/nihpp-2023.03.08.531814v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/fa704082ca64/nihpp-2023.03.08.531814v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/a5ee6901dc28/nihpp-2023.03.08.531814v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/b4e60629a252/nihpp-2023.03.08.531814v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/b7a89ce0a73c/nihpp-2023.03.08.531814v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/e40eb5133849/nihpp-2023.03.08.531814v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/9276fd46691b/nihpp-2023.03.08.531814v1-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d6/10028888/0714364a05ec/nihpp-2023.03.08.531814v1-f0009.jpg

相似文献

1
Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15.结合成对结构相似性和深度学习界面接触预测来评估CASP15中蛋白质复合物模型的准确性。
bioRxiv. 2023 Mar 12:2023.03.08.531814. doi: 10.1101/2023.03.08.531814.
2
Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15.结合成对结构相似性和深度学习界面接触预测来估计 CASP15 中蛋白质复合物模型的准确性。
Proteins. 2023 Dec;91(12):1889-1902. doi: 10.1002/prot.26542. Epub 2023 Jun 26.
3
Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15.在蛋白质结构预测关键评估第15轮(CASP15)中利用MULTICOM增强基于AlphaFold-Multimer的蛋白质复合物结构预测
bioRxiv. 2023 May 18:2023.05.16.541055. doi: 10.1101/2023.05.16.541055.
4
Improving AlphaFold2-based protein tertiary structure prediction with MULTICOM in CASP15.在蛋白质结构预测关键评估第15轮(CASP15)中使用MULTICOM改进基于AlphaFold2的蛋白质三级结构预测
Commun Chem. 2023 Sep 7;6(1):188. doi: 10.1038/s42004-023-00991-6.
5
Enhancing alphafold-multimer-based protein complex structure prediction with MULTICOM in CASP15.利用 MULTICOM 增强基于 AlphaFold-Multimer 的蛋白质复合物结构预测在 CASP15 中的应用。
Commun Biol. 2023 Nov 10;6(1):1140. doi: 10.1038/s42003-023-05525-3.
6
Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.通过深度学习和距离预测改进 CASP14 中的蛋白质三级结构预测。
Proteins. 2022 Jan;90(1):58-72. doi: 10.1002/prot.26186. Epub 2021 Jul 27.
7
Estimation of model accuracy in CASP15 using the ModFOLDdock server.使用 ModFOLDdock 服务器评估 CASP15 中的模型精度。
Proteins. 2023 Dec;91(12):1871-1878. doi: 10.1002/prot.26532. Epub 2023 Jun 14.
8
MULTICOM2 open-source protein structure prediction system powered by deep learning and distance prediction.基于深度学习和距离预测的 MULTICOM2 开源蛋白质结构预测系统。
Sci Rep. 2021 Jun 23;11(1):13155. doi: 10.1038/s41598-021-92395-6.
9
Designing and evaluating the MULTICOM protein local and global model quality prediction methods in the CASP10 experiment.在蛋白质结构预测关键评估第10轮(CASP10)实验中设计并评估多组件蛋白质局部和全局模型质量预测方法。
BMC Struct Biol. 2014 Apr 15;14:13. doi: 10.1186/1472-6807-14-13.
10
Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14.基于深度学习和残差距离预测的蛋白质模型准确性估计在 CASP14 中的应用。
Sci Rep. 2021 May 25;11(1):10943. doi: 10.1038/s41598-021-90303-6.

本文引用的文献

1
A gated graph transformer for protein complex structure quality assessment and its performance in CASP15.门控图转换器用于蛋白质复合物结构质量评估及其在 CASP15 中的性能。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i308-i317. doi: 10.1093/bioinformatics/btad203.
2
3D-equivariant graph neural networks for protein model quality assessment.用于蛋白质模型质量评估的 3D 等变图神经网络。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad030.
3
Improved model quality assessment using sequence and structural information by enhanced deep neural networks.
通过增强深度神经网络利用序列和结构信息改进模型质量评估。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac507.
4
Prediction of inter-chain distance maps of protein complexes with 2D attention-based deep neural networks.基于二维注意力的深度学习神经网络预测蛋白质复合物的链间距离图谱。
Nat Commun. 2022 Nov 15;13(1):6963. doi: 10.1038/s41467-022-34600-2.
5
Predicting the structure of large protein complexes using AlphaFold and Monte Carlo tree search.使用 AlphaFold 和蒙特卡罗树搜索预测大型蛋白质复合物的结构。
Nat Commun. 2022 Oct 12;13(1):6028. doi: 10.1038/s41467-022-33729-4.
6
A deep dilated convolutional residual network for predicting interchain contacts of protein homodimers.一种用于预测蛋白质同源二聚体链间接触的深度扩张卷积残差网络。
Bioinformatics. 2022 Mar 28;38(7):1904-1910. doi: 10.1093/bioinformatics/btac063.
7
Deep graph learning of inter-protein contacts.蛋白质间接触的深度图学习。
Bioinformatics. 2022 Jan 27;38(4):947-953. doi: 10.1093/bioinformatics/btab761.
8
DeepComplex: A Web Server of Predicting Protein Complex Structures by Deep Learning Inter-chain Contact Prediction and Distance-Based Modelling.深度复合物:一个通过深度学习链间接触预测和基于距离的建模来预测蛋白质复合物结构的网络服务器。
Front Mol Biosci. 2021 Aug 23;8:716973. doi: 10.3389/fmolb.2021.716973. eCollection 2021.
9
Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment.预测蛋白质组装体,下一个前沿:CASP14-CAPRI 实验。
Proteins. 2021 Dec;89(12):1800-1823. doi: 10.1002/prot.26222. Epub 2021 Sep 13.
10
Assessment of protein model structure accuracy estimation in CASP14: Old and new challenges.评估 CASP14 中蛋白质模型结构准确性估计:新老挑战。
Proteins. 2021 Dec;89(12):1940-1948. doi: 10.1002/prot.26192. Epub 2021 Aug 5.