• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用对蛋白质样接触模式的识别改进接触预测。

Improved contact predictions using the recognition of protein like contact patterns.

作者信息

Skwark Marcin J, Raimondi Daniele, Michel Mirco, Elofsson Arne

机构信息

Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden; Science for Life Laboratory, Stockholm University, Solna, Sweden; Department of Information and Computer Science, Aalto University, Aalto, Finland.

Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden; Science for Life Laboratory, Stockholm University, Solna, Sweden; Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, La Plaine Campus, Triomflaan, Brussels, Belgium.

出版信息

PLoS Comput Biol. 2014 Nov 6;10(11):e1003889. doi: 10.1371/journal.pcbi.1003889. eCollection 2014 Nov.

DOI:10.1371/journal.pcbi.1003889
PMID:25375897
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4222596/
Abstract

Given sufficient large protein families, and using a global statistical inference approach, it is possible to obtain sufficient accuracy in protein residue contact predictions to predict the structure of many proteins. However, these approaches do not consider the fact that the contacts in a protein are neither randomly, nor independently distributed, but actually follow precise rules governed by the structure of the protein and thus are interdependent. Here, we present PconsC2, a novel method that uses a deep learning approach to identify protein-like contact patterns to improve contact predictions. A substantial enhancement can be seen for all contacts independently on the number of aligned sequences, residue separation or secondary structure type, but is largest for β-sheet containing proteins. In addition to being superior to earlier methods based on statistical inferences, in comparison to state of the art methods using machine learning, PconsC2 is superior for families with more than 100 effective sequence homologs. The improved contact prediction enables improved structure prediction.

摘要

给定足够多的大蛋白质家族,并使用全局统计推断方法,在蛋白质残基接触预测中有可能获得足够的准确性,从而预测许多蛋白质的结构。然而,这些方法没有考虑到蛋白质中的接触既不是随机分布,也不是独立分布的,而是实际上遵循由蛋白质结构所支配的精确规则,因此是相互依赖的。在此,我们提出了PconsC2,这是一种使用深度学习方法来识别类似蛋白质的接触模式以改进接触预测的新方法。对于所有接触,无论比对序列的数量、残基间距或二级结构类型如何,都能看到显著的增强,而对于含β折叠的蛋白质增强最大。除了优于基于统计推断的早期方法外,与使用机器学习的现有方法相比,PconsC2对于具有100多个有效序列同源物的家族更具优势。改进的接触预测能够实现改进的结构预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/a99005637ae6/pcbi.1003889.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/cc14fe203710/pcbi.1003889.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/fd953f0004a3/pcbi.1003889.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/ad2ed28dacfc/pcbi.1003889.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/65571eb8375c/pcbi.1003889.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/d288866648da/pcbi.1003889.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/90b8085d1e14/pcbi.1003889.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/023efe37b39c/pcbi.1003889.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/a99005637ae6/pcbi.1003889.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/cc14fe203710/pcbi.1003889.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/fd953f0004a3/pcbi.1003889.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/ad2ed28dacfc/pcbi.1003889.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/65571eb8375c/pcbi.1003889.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/d288866648da/pcbi.1003889.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/90b8085d1e14/pcbi.1003889.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/023efe37b39c/pcbi.1003889.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d0f/4222596/a99005637ae6/pcbi.1003889.g008.jpg

相似文献

1
Improved contact predictions using the recognition of protein like contact patterns.利用对蛋白质样接触模式的识别改进接触预测。
PLoS Comput Biol. 2014 Nov 6;10(11):e1003889. doi: 10.1371/journal.pcbi.1003889. eCollection 2014 Nov.
2
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
3
Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning.通过整合深度多序列比对、协同进化和机器学习进行蛋白质接触预测。
Proteins. 2018 Mar;86 Suppl 1(Suppl 1):84-96. doi: 10.1002/prot.25405. Epub 2017 Oct 31.
4
Global sequence properties for superfamily prediction: a machine learning approach.用于超家族预测的全局序列特性:一种机器学习方法。
J Integr Bioinform. 2009 Aug 23;6(1):109. doi: 10.2390/biecoll-jib-2009-109.
5
A comprehensive assessment of sequence-based and template-based methods for protein contact prediction.基于序列和基于模板的蛋白质接触预测方法的综合评估。
Bioinformatics. 2008 Apr 1;24(7):924-31. doi: 10.1093/bioinformatics/btn069. Epub 2008 Feb 22.
6
Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.基于预测的二级结构集合和多重比对,以超过80%的准确率预测β转角。
BMC Bioinformatics. 2008 Oct 10;9:430. doi: 10.1186/1471-2105-9-430.
7
Protein Residue Contacts and Prediction Methods.蛋白质残基接触与预测方法
Methods Mol Biol. 2016;1415:463-76. doi: 10.1007/978-1-4939-3572-7_24.
8
Improving accuracy of protein contact prediction using balanced network deconvolution.利用平衡网络去卷积提高蛋白质接触预测的准确性。
Proteins. 2015 Mar;83(3):485-96. doi: 10.1002/prot.24744. Epub 2015 Jan 24.
9
PrDOS: prediction of disordered protein regions from amino acid sequence.PrDOS:从氨基酸序列预测无序蛋白质区域
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W460-4. doi: 10.1093/nar/gkm363. Epub 2007 Jun 12.
10
Improved residue contact prediction using support vector machines and a large feature set.使用支持向量机和大量特征集改进残基接触预测。
BMC Bioinformatics. 2007 Apr 2;8:113. doi: 10.1186/1471-2105-8-113.

引用本文的文献

1
An outlook on structural biology after AlphaFold: tools, limits and perspectives.AlphaFold之后的结构生物学展望:工具、局限与前景
FEBS Open Bio. 2025 Feb;15(2):202-222. doi: 10.1002/2211-5463.13902. Epub 2024 Sep 23.
2
Tertiary structure assessment at CASP15.三级结构评估在 CASP15。
Proteins. 2023 Dec;91(12):1616-1635. doi: 10.1002/prot.26593. Epub 2023 Sep 25.
3
Toward Characterising the Cellular 3D-Proteome.迈向细胞三维蛋白质组的表征

本文引用的文献

1
PconsFold: improved contact predictions improve protein models.PconsFold:改进的接触预测可提升蛋白质模型。
Bioinformatics. 2014 Sep 1;30(17):i482-8. doi: 10.1093/bioinformatics/btu458.
2
Assessing the accuracy of physical models used in protein-folding simulations: quantitative evidence from long molecular dynamics simulations.评估蛋白质折叠模拟中使用的物理模型的准确性:来自长分子动力学模拟的定量证据。
Curr Opin Struct Biol. 2014 Feb;24:98-105. doi: 10.1016/j.sbi.2013.12.006. Epub 2014 Jan 24.
3
Assessment of template-free modeling in CASP10 and ROLL.
Front Bioinform. 2021 Mar 29;1:598878. doi: 10.3389/fbinf.2021.598878. eCollection 2021.
4
rrQNet: Protein contact map quality estimation by deep evolutionary reconciliation.rrQNet:通过深度进化协调进行蛋白质接触图质量估计。
Proteins. 2022 Dec;90(12):2023-2034. doi: 10.1002/prot.26394. Epub 2022 Jul 12.
5
Gene prediction of aging-related diseases based on DNN and Mashup.基于 DNN 和 Mashup 的衰老相关疾病基因预测。
BMC Bioinformatics. 2021 Dec 17;22(1):597. doi: 10.1186/s12859-021-04518-5.
6
Machine-Learned Molecular Surface and Its Application to Implicit Solvent Simulations.基于机器学习的分子表面及其在隐溶剂模拟中的应用。
J Chem Theory Comput. 2021 Oct 12;17(10):6214-6224. doi: 10.1021/acs.jctc.1c00492. Epub 2021 Sep 13.
7
Evaluation of residue-residue contact prediction methods: From retrospective to prospective.评估残基残基接触预测方法:从回顾性到前瞻性。
PLoS Comput Biol. 2021 May 24;17(5):e1009027. doi: 10.1371/journal.pcbi.1009027. eCollection 2021 May.
8
Accurate contact-based modelling of repeat proteins predicts the structure of new repeats protein families.准确的基于接触的重复蛋白建模预测了新的重复蛋白家族的结构。
PLoS Comput Biol. 2021 Apr 15;17(4):e1008798. doi: 10.1371/journal.pcbi.1008798. eCollection 2021 Apr.
9
Crystallographic molecular replacement using an in silico-generated search model of SARS-CoV-2 ORF8.基于 SARS-CoV-2 ORF8 的计算机生成搜索模型的晶体学分子置换。
Protein Sci. 2021 Apr;30(4):728-734. doi: 10.1002/pro.4050. Epub 2021 Mar 4.
10
Hybrid methods for combined experimental and computational determination of protein structure.蛋白质结构的组合实验和计算测定的混合方法。
J Chem Phys. 2020 Dec 28;153(24):240901. doi: 10.1063/5.0026025.
在蛋白质结构预测关键评估第10轮(CASP10)和蛋白质结构预测连续评估(ROLL)中对无模板建模的评估。
Proteins. 2014 Feb;82 Suppl 2(Suppl 2):57-83. doi: 10.1002/prot.24470. Epub 2013 Dec 17.
4
A study and benchmark of DNcon: a method for protein residue-residue contact prediction using deep networks.DNcon:一种使用深度网络进行蛋白质残基残基接触预测的方法的研究和基准测试。
BMC Bioinformatics. 2013;14 Suppl 14(Suppl 14):S12. doi: 10.1186/1471-2105-14-S14-S12. Epub 2013 Oct 9.
5
CASP10 results compared to those of previous CASP experiments.将半胱天冬酶10(CASP10)的结果与之前的蛋白质结构预测关键评估(CASP)实验结果进行比较。
Proteins. 2014 Feb;82 Suppl 2(0 2):164-74. doi: 10.1002/prot.24448. Epub 2013 Dec 17.
6
Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era.在序列和结构丰富的时代评估基于共进化的残基-残基接触预测的效用。
Proc Natl Acad Sci U S A. 2013 Sep 24;110(39):15674-9. doi: 10.1073/pnas.1314045110. Epub 2013 Sep 5.
7
Predicting protein contact map using evolutionary and physical constraints by integer programming.利用整数规划进行进化和物理约束的蛋白质接触图预测。
Bioinformatics. 2013 Jul 1;29(13):i266-73. doi: 10.1093/bioinformatics/btt211.
8
PconsC: combination of direct information methods and alignments improves contact prediction.PconsC:直接信息方法和比对的组合提高了接触预测。
Bioinformatics. 2013 Jul 15;29(14):1815-6. doi: 10.1093/bioinformatics/btt259. Epub 2013 May 8.
9
Improved contact prediction in proteins: using pseudolikelihoods to infer Potts models.蛋白质中改进的接触预测:使用伪似然性推断Potts模型。
Phys Rev E Stat Nonlin Soft Matter Phys. 2013 Jan;87(1):012707. doi: 10.1103/PhysRevE.87.012707. Epub 2013 Jan 11.
10
ASTRO-FOLD 2.0: an Enhanced Framework for Protein Structure Prediction.ASTRO-FOLD 2.0:一种用于蛋白质结构预测的增强框架。
AIChE J. 2012 May 1;58(5):1619-1637. doi: 10.1002/aic.12669. Epub 2011 May 31.