• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于序列变异衍生接触辅助的球状蛋白质从头结构预测。

De novo structure prediction of globular proteins aided by sequence variation-derived contacts.

作者信息

Kosciolek Tomasz, Jones David T

机构信息

Bioinformatics Group, Department of Computer Science, University College London, London, United Kingdom; Institute of Structural and Molecular Biology, University College London, London, United Kingdom.

出版信息

PLoS One. 2014 Mar 17;9(3):e92197. doi: 10.1371/journal.pone.0092197. eCollection 2014.

DOI:10.1371/journal.pone.0092197
PMID:24637808
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3956894/
Abstract

The advent of high accuracy residue-residue intra-protein contact prediction methods enabled a significant boost in the quality of de novo structure predictions. Here, we investigate the potential benefits of combining a well-established fragment-based folding algorithm--FRAGFOLD, with PSICOV, a contact prediction method which uses sparse inverse covariance estimation to identify co-varying sites in multiple sequence alignments. Using a comprehensive set of 150 diverse globular target proteins, up to 266 amino acids in length, we are able to address the effectiveness and some limitations of such approaches to globular proteins in practice. Overall we find that using fragment assembly with both statistical potentials and predicted contacts is significantly better than either statistical potentials or contacts alone. Results show up to nearly 80% of correct predictions (TM-score ≥0.5) within analysed dataset and a mean TM-score of 0.54. Unsuccessful modelling cases emerged either from conformational sampling problems, or insufficient contact prediction accuracy. Nevertheless, a strong dependency of the quality of final models on the fraction of satisfied predicted long-range contacts was observed. This not only highlights the importance of these contacts on determining the protein fold, but also (combined with other ensemble-derived qualities) provides a powerful guide as to the choice of correct models and the global quality of the selected model. A proposed quality assessment scoring function achieves 0.93 precision and 0.77 recall for the discrimination of correct folds on our dataset of decoys. These findings suggest the approach is well-suited for blind predictions on a variety of globular proteins of unknown 3D structure, provided that enough homologous sequences are available to construct a large and accurate multiple sequence alignment for the initial contact prediction step.

摘要

高精度蛋白质内残基-残基接触预测方法的出现,使得从头结构预测的质量得到了显著提升。在此,我们研究了将一种成熟的基于片段的折叠算法——FRAGFOLD与PSICOV相结合的潜在益处,PSICOV是一种接触预测方法,它使用稀疏逆协方差估计来识别多序列比对中的共变位点。使用一组包含150种不同的球状目标蛋白(长度可达266个氨基酸)的综合数据集,我们能够在实践中探讨此类方法对球状蛋白的有效性及一些局限性。总体而言,我们发现结合使用具有统计势和预测接触的片段组装方法,比单独使用统计势或接触要好得多。结果显示,在分析的数据集中,高达近80%的预测是正确的(TM分数≥0.5),平均TM分数为0.54。建模失败的情况要么源于构象采样问题,要么源于接触预测精度不足。然而,我们观察到最终模型的质量强烈依赖于满足预测的长程接触的比例。这不仅突出了这些接触在确定蛋白质折叠方面的重要性,而且(与其他源自集成的质量相结合)为正确模型的选择和所选模型的整体质量提供了有力指导。对于我们的诱饵数据集上正确折叠的判别,一种提出的质量评估评分函数实现了0.93的精度和0.77的召回率。这些发现表明,该方法非常适合对各种未知三维结构的球状蛋白进行盲预测,前提是有足够的同源序列可用于构建用于初始接触预测步骤的大型且准确的多序列比对。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/b5e7c78b5afa/pone.0092197.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/972a2141fbfa/pone.0092197.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/82d914b18fa0/pone.0092197.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/907a1d815874/pone.0092197.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/059346b0ad28/pone.0092197.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/2c14b313cb0d/pone.0092197.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/10e7874dde11/pone.0092197.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/9f5d40fb8eb7/pone.0092197.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/13039bd17c5f/pone.0092197.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/a5223ce7a2e7/pone.0092197.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/18dad652e6e5/pone.0092197.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/c0e879ba93fb/pone.0092197.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/b5e7c78b5afa/pone.0092197.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/972a2141fbfa/pone.0092197.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/82d914b18fa0/pone.0092197.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/907a1d815874/pone.0092197.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/059346b0ad28/pone.0092197.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/2c14b313cb0d/pone.0092197.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/10e7874dde11/pone.0092197.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/9f5d40fb8eb7/pone.0092197.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/13039bd17c5f/pone.0092197.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/a5223ce7a2e7/pone.0092197.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/18dad652e6e5/pone.0092197.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/c0e879ba93fb/pone.0092197.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6ac/3956894/b5e7c78b5afa/pone.0092197.g012.jpg

相似文献

1
De novo structure prediction of globular proteins aided by sequence variation-derived contacts.基于序列变异衍生接触辅助的球状蛋白质从头结构预测。
PLoS One. 2014 Mar 17;9(3):e92197. doi: 10.1371/journal.pone.0092197. eCollection 2014.
2
MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.MetaPSICOV:结合协同进化方法用于精确预测蛋白质中的接触和长程氢键
Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.
3
COUSCOus: improved protein contact prediction using an empirical Bayes covariance estimator.COUSCOus:使用经验贝叶斯协方差估计器改进蛋白质接触预测。
BMC Bioinformatics. 2016 Dec 15;17(1):533. doi: 10.1186/s12859-016-1400-3.
4
CONFOLD: Residue-residue contact-guided ab initio protein folding.CONFOLD:基于残基-残基接触引导的从头算蛋白质折叠。
Proteins. 2015 Aug;83(8):1436-49. doi: 10.1002/prot.24829. Epub 2015 Jun 6.
5
Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning.通过整合深度多序列比对、协同进化和机器学习进行蛋白质接触预测。
Proteins. 2018 Mar;86 Suppl 1(Suppl 1):84-96. doi: 10.1002/prot.25405. Epub 2017 Oct 31.
6
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
7
DeepCDpred: Inter-residue distance and contact prediction for improved prediction of protein structure.DeepCDpred:用于改进蛋白质结构预测的残差间距离和接触预测。
PLoS One. 2019 Jan 8;14(1):e0205214. doi: 10.1371/journal.pone.0205214. eCollection 2019.
8
Contact prediction for beta and alpha-beta proteins using integer linear optimization and its impact on the first principles 3D structure prediction method ASTRO-FOLD.使用整数线性优化进行β和α-β 蛋白的接触预测及其对第一性原理 3D 结构预测方法 ASTRO-FOLD 的影响。
Proteins. 2010 Jun;78(8):1825-46. doi: 10.1002/prot.22696.
9
DeepHelicon: Accurate prediction of inter-helical residue contacts in transmembrane proteins by residual neural networks.DeepHelicon:通过残差神经网络准确预测跨膜蛋白中螺旋间残基接触。
J Struct Biol. 2020 Oct 1;212(1):107574. doi: 10.1016/j.jsb.2020.107574. Epub 2020 Jul 11.
10
A comprehensive assessment of sequence-based and template-based methods for protein contact prediction.基于序列和基于模板的蛋白质接触预测方法的综合评估。
Bioinformatics. 2008 Apr 1;24(7):924-31. doi: 10.1093/bioinformatics/btn069. Epub 2008 Feb 22.

引用本文的文献

1
Designing and development of efficient multi-epitope-based peptide vaccine candidate against emerging avian rotavirus strains: A vaccinomic approach.针对新兴禽轮状病毒株的基于多表位的高效肽疫苗候选物的设计与开发:一种疫苗组学方法。
J Genet Eng Biotechnol. 2024 Sep;22(3):100398. doi: 10.1016/j.jgeb.2024.100398. Epub 2024 Jun 27.
2
Biophysics-based protein language models for protein engineering.用于蛋白质工程的基于生物物理学的蛋白质语言模型。
bioRxiv. 2025 Jan 14:2024.03.15.585128. doi: 10.1101/2024.03.15.585128.
3
Genetically Fusing Order-Promoting and Thermoresponsive Building Blocks to Design Hybrid Biomaterials.

本文引用的文献

1
Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era.在序列和结构丰富的时代评估基于共进化的残基-残基接触预测的效用。
Proc Natl Acad Sci U S A. 2013 Sep 24;110(39):15674-9. doi: 10.1073/pnas.1314045110. Epub 2013 Sep 5.
2
Emerging methods in protein co-evolution.蛋白质共进化的新兴方法。
Nat Rev Genet. 2013 Apr;14(4):249-61. doi: 10.1038/nrg3414. Epub 2013 Mar 5.
3
Improved contact prediction in proteins: using pseudolikelihoods to infer Potts models.蛋白质中改进的接触预测:使用伪似然性推断Potts模型。
基因融合促进有序和温敏性结构单元设计杂化生物材料。
Chemistry. 2024 May 28;30(30):e202400582. doi: 10.1002/chem.202400582. Epub 2024 Apr 10.
4
Exploratory Algorithm of a Multi-epitope-based Subunit Vaccine Candidate Against : Reverse Vaccinology-Based Immunoinformatic Approach.一种基于多表位的候选亚单位疫苗的探索性算法:基于反向疫苗学的免疫信息学方法。
Int J Pept Res Ther. 2022;28(5):134. doi: 10.1007/s10989-022-10438-6. Epub 2022 Jul 26.
5
Intelligent host engineering for metabolic flux optimisation in biotechnology.智能宿主工程在生物技术中代谢通量优化的应用。
Biochem J. 2021 Oct 29;478(20):3685-3721. doi: 10.1042/BCJ20210535.
6
Saturation Mutagenesis of the Transmembrane Region of HokC in Reveals Its High Tolerance to Mutations.霍克C跨膜区的饱和诱变揭示其对突变的高耐受性。
Int J Mol Sci. 2021 Sep 26;22(19):10359. doi: 10.3390/ijms221910359.
7
Capabilities of bioinformatics tools for optimizing physicochemical features of proteins used in Nano biosensors: A short overview of the tools related to bioinformatics.用于优化纳米生物传感器中蛋白质物理化学特性的生物信息学工具的能力:生物信息学相关工具概述
Biochem Biophys Rep. 2021 Aug 3;27:101094. doi: 10.1016/j.bbrep.2021.101094. eCollection 2021 Sep.
8
COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.COMTOP:通过混合整数线性优化进行蛋白质残基-残基接触预测。
Membranes (Basel). 2021 Jun 30;11(7):503. doi: 10.3390/membranes11070503.
9
Evaluation of residue-residue contact prediction methods: From retrospective to prospective.评估残基残基接触预测方法:从回顾性到前瞻性。
PLoS Comput Biol. 2021 May 24;17(5):e1009027. doi: 10.1371/journal.pcbi.1009027. eCollection 2021 May.
10
Comprehensive genome based analysis of Vibrio parahaemolyticus for identifying novel drug and vaccine molecules: Subtractive proteomics and vaccinomics approach.基于全基因组分析副溶血性弧菌以鉴定新型药物和疫苗分子:消减蛋白质组学和疫苗组学方法。
PLoS One. 2020 Aug 19;15(8):e0237181. doi: 10.1371/journal.pone.0237181. eCollection 2020.
Phys Rev E Stat Nonlin Soft Matter Phys. 2013 Jan;87(1):012707. doi: 10.1103/PhysRevE.87.012707. Epub 2013 Jan 11.
4
Protein structure prediction from sequence variation.从序列变异预测蛋白质结构。
Nat Biotechnol. 2012 Nov;30(11):1072-80. doi: 10.1038/nbt.2419.
5
Genomics-aided structure prediction.基于基因组学的结构预测。
Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10340-5. doi: 10.1073/pnas.1207864109. Epub 2012 Jun 12.
6
Accurate de novo structure prediction of large transmembrane protein domains using fragment-assembly and correlated mutation analysis.利用片段组装和相关突变分析准确从头预测大型跨膜蛋白结构域。
Proc Natl Acad Sci U S A. 2012 Jun 12;109(24):E1540-7. doi: 10.1073/pnas.1120036109. Epub 2012 May 29.
7
Three-dimensional structures of membrane proteins from genomic sequencing.从基因组测序中提取膜蛋白的三维结构。
Cell. 2012 Jun 22;149(7):1607-21. doi: 10.1016/j.cell.2012.04.012. Epub 2012 May 10.
8
Protein 3D structure computed from evolutionary sequence variation.基于进化序列变异计算的蛋白质 3D 结构。
PLoS One. 2011;6(12):e28766. doi: 10.1371/journal.pone.0028766. Epub 2011 Dec 7.
9
The Pfam protein families database.Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2012 Jan;40(Database issue):D290-301. doi: 10.1093/nar/gkr1065. Epub 2011 Nov 29.
10
PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments.PSICOV:使用基于稀疏逆协方差估计的大型多重序列比对进行精确结构接触预测。
Bioinformatics. 2012 Jan 15;28(2):184-90. doi: 10.1093/bioinformatics/btr638. Epub 2011 Nov 17.