• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

接触预测对于最具信息量的接触最为困难,但通过纳入接触电势可以得到改善。

Contact prediction is hardest for the most informative contacts, but improves with the incorporation of contact potentials.

机构信息

Department of Computer Science, Dartmouth College, Hanover, NH 03755, United States of America.

Department of Biological Sciences, Dartmouth College, Hanover, NH 03755, United States of America.

出版信息

PLoS One. 2018 Jun 28;13(6):e0199585. doi: 10.1371/journal.pone.0199585. eCollection 2018.

DOI:10.1371/journal.pone.0199585
PMID:29953468
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6023208/
Abstract

Co-evolution between pairs of residues in a multiple sequence alignment (MSA) of homologous proteins has long been proposed as an indicator of structural contacts. Recently, several methods, such as direct-coupling analysis (DCA) and MetaPSICOV, have been shown to achieve impressive rates of contact prediction by taking advantage of considerable sequence data. In this paper, we show that prediction success rates are highly sensitive to the structural definition of a contact, with more permissive definitions (i.e., those classifying more pairs as true contacts) naturally leading to higher positive predictive rates, but at the expense of the amount of structural information contributed by each contact. Thus, the remaining limitations of contact prediction algorithms are most noticeable in conjunction with geometrically restrictive contacts-precisely those that contribute more information in structure prediction. We suggest that to improve prediction rates for such "informative" contacts one could combine co-evolution scores with additional indicators of contact likelihood. Specifically, we find that when a pair of co-varying positions in an MSA is occupied by residue pairs with favorable statistical contact energies, that pair is more likely to represent a true contact. We show that combining a contact potential metric with DCA or MetaPSICOV performs considerably better than DCA or MetaPSICOV alone, respectively. This is true regardless of contact definition, but especially true for stricter and more informative contact definitions. In summary, this work outlines some remaining challenges to be addressed in contact prediction and proposes and validates a promising direction towards improvement.

摘要

在同源蛋白质的多重序列比对(MSA)中,对残基对的共进化长期以来一直被认为是结构接触的指标。最近,几种方法,如直接耦合分析(DCA)和 MetaPSICOV,已经被证明通过利用大量的序列数据,可以实现令人印象深刻的接触预测率。在本文中,我们表明预测成功率对接触的结构定义非常敏感,更宽松的定义(即,将更多对归类为真实接触)自然会导致更高的阳性预测率,但代价是每个接触贡献的结构信息量。因此,接触预测算法的剩余限制在与几何限制接触(正是那些在结构预测中贡献更多信息的接触)结合时最为明显。我们建议,为了提高此类“信息丰富”接触的预测率,可以将共进化得分与接触可能性的其他指标相结合。具体来说,我们发现当 MSA 中的一对共变位置被具有有利统计接触能的残基对占据时,该对更有可能代表真实接触。我们表明,将接触势能度量与 DCA 或 MetaPSICOV 相结合的性能明显优于单独使用 DCA 或 MetaPSICOV。这无论接触定义如何都是正确的,但对于更严格和更具信息量的接触定义尤其如此。总之,这项工作概述了接触预测中仍需要解决的一些挑战,并提出并验证了一个有前途的改进方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/24a2f9093b39/pone.0199585.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/cd3f39966ea0/pone.0199585.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/af85790bb871/pone.0199585.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/0d07239ceb99/pone.0199585.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/12534766a091/pone.0199585.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/24a2f9093b39/pone.0199585.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/cd3f39966ea0/pone.0199585.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/af85790bb871/pone.0199585.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/0d07239ceb99/pone.0199585.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/12534766a091/pone.0199585.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a412/6023208/24a2f9093b39/pone.0199585.g005.jpg

相似文献

1
Contact prediction is hardest for the most informative contacts, but improves with the incorporation of contact potentials.接触预测对于最具信息量的接触最为困难,但通过纳入接触电势可以得到改善。
PLoS One. 2018 Jun 28;13(6):e0199585. doi: 10.1371/journal.pone.0199585. eCollection 2018.
2
KScons: a Bayesian approach for protein residue contact prediction using the knob-socket model of protein tertiary structure.KScons:一种使用蛋白质三级结构的旋钮-插座模型进行蛋白质残基接触预测的贝叶斯方法。
Bioinformatics. 2016 Dec 15;32(24):3774-3781. doi: 10.1093/bioinformatics/btw553. Epub 2016 Aug 24.
3
MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.MetaPSICOV:结合协同进化方法用于精确预测蛋白质中的接触和长程氢键
Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.
4
Direct coupling analysis for protein contact prediction.用于蛋白质接触预测的直接耦合分析
Methods Mol Biol. 2014;1137:55-70. doi: 10.1007/978-1-4939-0366-5_5.
5
Contact pair dynamics during folding of two small proteins: chicken villin head piece and the Alzheimer protein beta-amyloid.两种小蛋白质折叠过程中的接触对动力学:鸡肌动蛋白结合蛋白头部片段和阿尔茨海默病蛋白β-淀粉样蛋白。
J Chem Phys. 2004 Jan 15;120(3):1602-12. doi: 10.1063/1.1633253.
6
CONFOLD: Residue-residue contact-guided ab initio protein folding.CONFOLD:基于残基-残基接触引导的从头算蛋白质折叠。
Proteins. 2015 Aug;83(8):1436-49. doi: 10.1002/prot.24829. Epub 2015 Jun 6.
7
De novo structure prediction of globular proteins aided by sequence variation-derived contacts.基于序列变异衍生接触辅助的球状蛋白质从头结构预测。
PLoS One. 2014 Mar 17;9(3):e92197. doi: 10.1371/journal.pone.0092197. eCollection 2014.
8
Identification of residue pairing in interacting β-strands from a predicted residue contact map.从预测的残基接触图中鉴定相互作用的β-折叠中的残基对。
BMC Bioinformatics. 2018 Apr 19;19(1):146. doi: 10.1186/s12859-018-2150-1.
9
Direct-coupling analysis of residue coevolution captures native contacts across many protein families.残基共进化的直接耦联分析捕获了许多蛋白质家族中的天然接触。
Proc Natl Acad Sci U S A. 2011 Dec 6;108(49):E1293-301. doi: 10.1073/pnas.1111471108. Epub 2011 Nov 21.
10
Prediction of contacts from correlated sequence substitutions.预测相关序列取代的接触。
Curr Opin Struct Biol. 2013 Jun;23(3):473-9. doi: 10.1016/j.sbi.2013.04.001. Epub 2013 May 14.

引用本文的文献

1
Peptides from human BNIP5 and PXT1 and non-native binders of pro-apoptotic BAK can directly activate or inhibit BAK-mediated membrane permeabilization.人源 BNIP5 和 PXT1 的肽段以及促凋亡 BAK 的非天然结合物可以直接激活或抑制 BAK 介导的膜通透性。
Structure. 2023 Mar 2;31(3):265-281.e7. doi: 10.1016/j.str.2023.01.001. Epub 2023 Jan 26.
2
Structure-conditioned amino-acid couplings: How contact geometry affects pairwise sequence preferences.结构条件化的氨基酸偶联:接触几何如何影响成对序列偏好。
Protein Sci. 2022 Apr;31(4):900-917. doi: 10.1002/pro.4280. Epub 2022 Feb 15.

本文引用的文献

1
Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks.利用深度卷积神经网络增强进化耦合。
Cell Syst. 2018 Jan 24;6(1):65-74.e3. doi: 10.1016/j.cels.2017.11.014. Epub 2017 Dec 20.
2
Improved protein contact predictions with the MetaPSICOV2 server in CASP12.在蛋白质结构预测技术关键评估第12轮(CASP12)中使用MetaPSICOV2服务器改进蛋白质接触预测。
Proteins. 2018 Mar;86 Suppl 1(Suppl Suppl 1):78-83. doi: 10.1002/prot.25379. Epub 2017 Sep 29.
3
EPSILON-CP: using deep learning to combine information from multiple sources for protein contact prediction.
EPSILON-CP:利用深度学习整合多源信息进行蛋白质接触预测。
BMC Bioinformatics. 2017 Jun 17;18(1):303. doi: 10.1186/s12859-017-1713-x.
4
NeBcon: protein contact map prediction using neural network training coupled with naïve Bayes classifiers.NeBcon:结合朴素贝叶斯分类器进行神经网络训练以预测蛋白质接触图谱。
Bioinformatics. 2017 Aug 1;33(15):2296-2306. doi: 10.1093/bioinformatics/btx164.
5
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
6
Tertiary alphabet for the observable protein structural universe.可观测蛋白质结构宇宙的三级字母表。
Proc Natl Acad Sci U S A. 2016 Nov 22;113(47):E7438-E7447. doi: 10.1073/pnas.1607178113. Epub 2016 Nov 3.
7
sDFIRE: Sequence-specific statistical energy function for protein structure prediction by decoy selections.sDFIRE:用于通过诱饵选择进行蛋白质结构预测的序列特异性统计能量函数。
J Comput Chem. 2016 May 5;37(12):1119-24. doi: 10.1002/jcc.24298. Epub 2016 Feb 5.
8
Improved de novo structure prediction in CASP11 by incorporating coevolution information into Rosetta.通过将协同进化信息整合到Rosetta中,改进了CASP11中的从头结构预测。
Proteins. 2016 Sep;84 Suppl 1(Suppl 1):67-75. doi: 10.1002/prot.24974. Epub 2016 Feb 24.
9
Dimeric interactions and complex formation using direct coevolutionary couplings.利用直接协同进化耦合进行二聚体相互作用和复合物形成。
Sci Rep. 2015 Sep 4;5:13652. doi: 10.1038/srep13652.
10
Tertiary structural propensities reveal fundamental sequence/structure relationships.三级结构倾向揭示基本的序列/结构关系。
Structure. 2015 May 5;23(5):961-971. doi: 10.1016/j.str.2015.03.015. Epub 2015 Apr 23.