• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 PconsC3 预测数千个 Pfam 结构域家族中的准确接触。

Predicting accurate contacts in thousands of Pfam domain families using PconsC3.

机构信息

Science for Life Laboratory and Department of Biochemistry and Biophysics, Stockholm University, Stockholm 10691, Sweden.

Department of Chemistry, Vanderbilt University, Nashville, TN, USA.

出版信息

Bioinformatics. 2017 Sep 15;33(18):2859-2866. doi: 10.1093/bioinformatics/btx332.

DOI:10.1093/bioinformatics/btx332
PMID:28535189
Abstract

MOTIVATION

A few years ago it was shown that by using a maximum entropy approach to describe couplings between columns in a multiple sequence alignment it is possible to significantly increase the accuracy of residue contact predictions. For very large protein families with more than 1000 effective sequences the accuracy is sufficient to produce accurate models of proteins as well as complexes. Today, for about half of all Pfam domain families no structure is known, but unfortunately most of these families have at most a few hundred members, i.e. are too small for such contact prediction methods.

RESULTS

To extend accurate contact predictions to the thousands of smaller protein families we present PconsC3, a fast and improved method for protein contact predictions that can be used for families with even 100 effective sequence members. PconsC3 outperforms direct coupling analysis (DCA) methods significantly independent on family size, secondary structure content, contact range, or the number of selected contacts.

AVAILABILITY AND IMPLEMENTATION

PconsC3 is available as a web server and downloadable version at http://c3.pcons.net . The downloadable version is free for all to use and licensed under the GNU General Public License, version 2. At this site contact predictions for most Pfam families are also available. We do estimate that more than 4000 contact maps for Pfam families of unknown structure have more than 50% of the top-ranked contacts predicted correctly.

CONTACT

arne@bioinfo.se.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

几年前,有人证明通过使用最大熵方法来描述多序列比对中列之间的耦合,可以显著提高残基接触预测的准确性。对于具有 1000 多个有效序列的非常大的蛋白质家族,其准确性足以产生蛋白质和复合物的准确模型。如今,大约一半的 Pfam 结构域家族都没有结构,但不幸的是,这些家族中的大多数只有几百个成员,即对于这种接触预测方法来说太小了。

结果

为了将准确的接触预测扩展到数千个较小的蛋白质家族,我们提出了 PconsC3,这是一种快速改进的蛋白质接触预测方法,可用于具有 100 个有效序列成员的家族。PconsC3 在独立于家族大小、二级结构含量、接触范围或所选接触数量的情况下,明显优于直接耦合分析(DCA)方法。

可用性和实现

PconsC3 可作为网络服务器和可下载版本在 http://c3.pcons.net 使用。可下载版本可供所有人免费使用,并根据 GNU 通用公共许可证第 2 版获得许可。在这个站点上,还可以为大多数 Pfam 家族提供接触预测。我们估计,对于未知结构的 Pfam 家族的 4000 多个接触图中,超过 50%的顶级接触预测是正确的。

联系

arne@bioinfo.se。

补充信息

补充数据可在在线生物信息学中获得。

相似文献

1
Predicting accurate contacts in thousands of Pfam domain families using PconsC3.使用 PconsC3 预测数千个 Pfam 结构域家族中的准确接触。
Bioinformatics. 2017 Sep 15;33(18):2859-2866. doi: 10.1093/bioinformatics/btx332.
2
Large-scale structure prediction by improved contact predictions and model quality assessment.通过改进的接触预测和模型质量评估进行大规模结构预测。
Bioinformatics. 2017 Jul 15;33(14):i23-i29. doi: 10.1093/bioinformatics/btx239.
3
MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.MetaPSICOV:结合协同进化方法用于精确预测蛋白质中的接触和长程氢键
Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.
4
PconsC: combination of direct information methods and alignments improves contact prediction.PconsC:直接信息方法和比对的组合提高了接触预测。
Bioinformatics. 2013 Jul 15;29(14):1815-6. doi: 10.1093/bioinformatics/btt259. Epub 2013 May 8.
5
Forecasting residue-residue contact prediction accuracy.预测残差-残基接触预测精度。
Bioinformatics. 2017 Nov 1;33(21):3405-3414. doi: 10.1093/bioinformatics/btx416.
6
Protein multiple sequence alignment benchmarking through secondary structure prediction.通过二级结构预测进行蛋白质多序列比对基准测试。
Bioinformatics. 2017 May 1;33(9):1331-1337. doi: 10.1093/bioinformatics/btw840.
7
DOMAC: an accurate, hybrid protein domain prediction server.DOMAC:一个准确的混合蛋白质结构域预测服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W354-6. doi: 10.1093/nar/gkm390. Epub 2007 Jun 6.
8
CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations.CCMpred--快速准确地预测蛋白质残基-残基接触的相关突变。
Bioinformatics. 2014 Nov 1;30(21):3128-30. doi: 10.1093/bioinformatics/btu500. Epub 2014 Jul 26.
9
PconsFam: An Interactive Database of Structure Predictions of Pfam Families.PconsFam:Pfam 家族结构预测的交互式数据库。
J Mol Biol. 2019 Jun 14;431(13):2442-2448. doi: 10.1016/j.jmb.2019.01.047. Epub 2019 Feb 21.
10
The evolution of contact prediction: evidence that contact selection in statistical contact prediction is changing.接触预测的演变:接触选择在统计接触预测中发生变化的证据。
Bioinformatics. 2020 Mar 1;36(6):1750-1756. doi: 10.1093/bioinformatics/btz816.

引用本文的文献

1
Boosting the Full Potential of PyMOL with Structural Biology Plugins.利用结构生物学插件充分发挥 PyMOL 的潜力。
Biomolecules. 2022 Nov 27;12(12):1764. doi: 10.3390/biom12121764.
2
Limits to detecting epistasis in the fitness landscape of HIV.检测 HIV 适应度景观中上位性的局限性。
PLoS One. 2022 Jan 18;17(1):e0262314. doi: 10.1371/journal.pone.0262314. eCollection 2022.
3
Fueling ab initio folding with marine metagenomics enables structure and function predictions of new protein families.利用海洋宏基因组学为从头折叠提供燃料,能够对新的蛋白质家族进行结构和功能预测。
Genome Biol. 2019 Nov 1;20(1):229. doi: 10.1186/s13059-019-1823-z.
4
RFQAmodel: Random Forest Quality Assessment to identify a predicted protein structure in the correct fold.RFQAmodel:随机森林质量评估,用于识别正确折叠的预测蛋白结构。
PLoS One. 2019 Oct 21;14(10):e0218149. doi: 10.1371/journal.pone.0218149. eCollection 2019.
5
Assessing the accuracy of contact predictions in CASP13.评估 CASP13 中接触预测的准确性。
Proteins. 2019 Dec;87(12):1058-1068. doi: 10.1002/prot.25819. Epub 2019 Oct 24.
6
Evolutionary couplings detect side-chain interactions.进化偶联分析可检测侧链相互作用。
PeerJ. 2019 Jul 8;7:e7280. doi: 10.7717/peerj.7280. eCollection 2019.
7
Influence of multiple-sequence-alignment depth on Potts statistical models of protein covariation.多序列比对深度对蛋白质共变的 Potts 统计模型的影响。
Phys Rev E. 2019 Mar;99(3-1):032405. doi: 10.1103/PhysRevE.99.032405.
8
Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.CCMgen 通过合成蛋白比对量化残基残基接触预测中的噪声。
PLoS Comput Biol. 2018 Nov 5;14(11):e1006526. doi: 10.1371/journal.pcbi.1006526. eCollection 2018 Nov.
9
High precision in protein contact prediction using fully convolutional neural networks and minimal sequence features.利用全卷积神经网络和最小序列特征进行高精度蛋白质接触预测。
Bioinformatics. 2018 Oct 1;34(19):3308-3315. doi: 10.1093/bioinformatics/bty341.
10
DNCON2: improved protein contact prediction using two-level deep convolutional neural networks.DNCON2:使用两级深度卷积神经网络改进蛋白质接触预测。
Bioinformatics. 2018 May 1;34(9):1466-1472. doi: 10.1093/bioinformatics/btx781.