• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于探索构象空间的蛋白质结构表示:速度与准确性的权衡。

Representations of protein structure for exploring the conformational space: A speed-accuracy trade-off.

作者信息

Postic Guillaume, Janel Nathalie, Moroy Gautier

机构信息

Université de Paris, BFA, UMR 8251, CNRS, ERL U1133, Inserm, F-75013 Paris, France.

Université de Paris, BFA, UMR 8251, CNRS, F-75013 Paris, France.

出版信息

Comput Struct Biotechnol J. 2021 Apr 28;19:2618-2625. doi: 10.1016/j.csbj.2021.04.049. eCollection 2021.

DOI:10.1016/j.csbj.2021.04.049
PMID:34025948
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8120936/
Abstract

The recent breakthrough in the field of protein structure prediction shows the relevance of using knowledge-based based scoring functions in combination with a low-resolution 3D representation of protein macromolecules. The choice of not using all atoms is barely supported by any data in the literature, and is mostly motivated by empirical and practical reasons, such as the computational cost of assessing the numerous folds of the protein conformational space. Here, we present a comprehensive study, carried on a large and balanced benchmark of predicted protein structures, to see how different types of structural representations rank in either accuracy or calculation speed, and which ones offer the best compromise between these two criteria. We tested ten representations, including low-resolution, high-resolution, and coarse-grained approaches. We also investigated the generalization of the findings to other formalisms than the widely-used "potential of mean force" (PMF) method. Thus, we observed that representing protein structures by their β carbons-combined or not with Cα-provides the best speedaccuracy trade-off, when using a "total information gain" scoring function. For statistical PMFs, using MARTINI backbone and side-chains beads is the best option. Finally, we also demonstrated the necessity of training the reference state on all atom types, and of including the Cα atoms of glycine residues, in a Cβ-based representation.

摘要

蛋白质结构预测领域的最新突破表明,将基于知识的评分函数与蛋白质大分子的低分辨率三维表示相结合具有重要意义。不使用所有原子的选择在文献中几乎没有得到任何数据支持,其主要动机是经验和实际原因,例如评估蛋白质构象空间众多折叠的计算成本。在此,我们对大量且平衡的预测蛋白质结构基准进行了一项全面研究,以了解不同类型的结构表示在准确性或计算速度方面的排名情况,以及哪些表示在这两个标准之间提供了最佳折衷方案。我们测试了十种表示方法,包括低分辨率、高分辨率和粗粒度方法。我们还研究了这些发现对除广泛使用的“平均力势”(PMF)方法之外的其他形式主义的通用性。因此,我们观察到,当使用“总信息增益”评分函数时,用β碳原子(结合或不结合Cα)表示蛋白质结构能提供最佳的速度-准确性权衡。对于统计PMF,使用MARTINI主链和侧链珠子是最佳选择。最后,我们还证明了在基于Cβ的表示中,对所有原子类型训练参考状态以及包括甘氨酸残基的Cα原子的必要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/067a/8120936/5b06cdab7a8d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/067a/8120936/1ab8c78ecd81/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/067a/8120936/a70b7987fa14/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/067a/8120936/5b06cdab7a8d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/067a/8120936/1ab8c78ecd81/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/067a/8120936/a70b7987fa14/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/067a/8120936/5b06cdab7a8d/gr2.jpg

相似文献

1
Representations of protein structure for exploring the conformational space: A speed-accuracy trade-off.用于探索构象空间的蛋白质结构表示:速度与准确性的权衡。
Comput Struct Biotechnol J. 2021 Apr 28;19:2618-2625. doi: 10.1016/j.csbj.2021.04.049. eCollection 2021.
2
Separation of time scale and coupling in the motion governed by the coarse-grained and fine degrees of freedom in a polypeptide backbone.多肽主链中粗粒度和细粒度自由度所支配运动的时间尺度分离与耦合
J Chem Phys. 2007 Oct 21;127(15):155103. doi: 10.1063/1.2784200.
3
Template-Guided Protein Structure Prediction and Refinement Using Optimized Folding Landscape Force Fields.基于优化折叠景观力场的模板引导蛋白结构预测和精修。
J Chem Theory Comput. 2018 Nov 13;14(11):6102-6116. doi: 10.1021/acs.jctc.8b00683. Epub 2018 Oct 8.
4
All-Atom Four-Body Knowledge-Based Statistical Potentials to Distinguish Native Protein Structures from Nonnative Folds.全原子四体基于知识的统计势能,用于区分天然蛋白质结构和非天然折叠。
Biomed Res Int. 2017;2017:5760612. doi: 10.1155/2017/5760612. Epub 2017 Oct 8.
5
MARTINI-Based Protein-DNA Coarse-Grained HADDOCKing.基于MARTINI的蛋白质-脱氧核糖核酸粗粒度对接
Front Mol Biosci. 2019 Oct 1;6:102. doi: 10.3389/fmolb.2019.00102. eCollection 2019.
6
Minimalist representations and the importance of nearest neighbor effects in protein folding simulations.简约表示法及最近邻效应在蛋白质折叠模拟中的重要性。
J Mol Biol. 2006 Nov 3;363(4):835-57. doi: 10.1016/j.jmb.2006.08.035. Epub 2006 Aug 18.
7
Necessity of high-resolution for coarse-grained modeling of flexible proteins.柔性蛋白质粗粒化建模需要高分辨率。
J Comput Chem. 2016 Jul 5;37(18):1725-33. doi: 10.1002/jcc.24391. Epub 2016 Apr 29.
8
Statistically based reduced representation of amino acid side chains.基于统计学的氨基酸侧链简化表示法。
J Chem Inf Comput Sci. 2004 May-Jun;44(3):817-30. doi: 10.1021/ci034177z.
9
A coarse-grained elastic network atom contact model and its use in the simulation of protein dynamics and the prediction of the effect of mutations.一种粗粒度弹性网络原子接触模型及其在蛋白质动力学模拟和突变效应预测中的应用。
PLoS Comput Biol. 2014 Apr 24;10(4):e1003569. doi: 10.1371/journal.pcbi.1003569. eCollection 2014 Apr.
10
Improved Angle Potentials for Coarse-Grained Molecular Dynamics Simulations.用于粗粒度分子动力学模拟的改进角度势
J Chem Theory Comput. 2013 Aug 13;9(8):3282-92. doi: 10.1021/ct400219n. Epub 2013 Jul 16.

引用本文的文献

1
Deep learning-enhanced clustering and classification of protein molecule tertiary structures using weighted distance matrices.利用加权距离矩阵的深度学习增强蛋白质分子三级结构的聚类和分类
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf331.
2
An integrated protein structure fitness scoring approach for identifying native-like model structures.一种用于识别类天然模型结构的综合蛋白质结构适应性评分方法。
Comput Struct Biotechnol J. 2022 Nov 17;20:6467-6472. doi: 10.1016/j.csbj.2022.11.032. eCollection 2022.

本文引用的文献

1
CATH: increased structural coverage of functional space.CATH:增加功能空间的结构覆盖率。
Nucleic Acids Res. 2021 Jan 8;49(D1):D266-D273. doi: 10.1093/nar/gkaa1079.
2
An information gain-based approach for evaluating protein structure models.一种基于信息增益的蛋白质结构模型评估方法。
Comput Struct Biotechnol J. 2020;18:2228-2236. doi: 10.1016/j.csbj.2020.08.013. Epub 2020 Aug 18.
3
Protein Thermal Stability Engineering Using HoTMuSiC.利用 HoTMuSiC 进行蛋白质热稳定性工程。
Methods Mol Biol. 2020;2112:59-73. doi: 10.1007/978-1-0716-0270-6_5.
4
Improved protein structure prediction using potentials from deep learning.利用深度学习势进行蛋白质结构预测的改进。
Nature. 2020 Jan;577(7792):706-710. doi: 10.1038/s41586-019-1923-7. Epub 2020 Jan 15.
5
Accurate prediction of protein beta-aggregation with generalized statistical potentials.用广义统计势能准确预测蛋白质β聚集
Bioinformatics. 2020 Apr 1;36(7):2076-2081. doi: 10.1093/bioinformatics/btz912.
6
SOLart: a structure-based method to predict protein solubility and aggregation.SOLart:一种基于结构的预测蛋白质可溶性和聚集性的方法。
Bioinformatics. 2020 Mar 1;36(5):1445-1452. doi: 10.1093/bioinformatics/btz773.
7
A Practical View of the Martini Force Field.对Martini力场的实际观点。
Methods Mol Biol. 2019;2022:105-127. doi: 10.1007/978-1-4939-9608-7_5.
8
What is the best reference state for building statistical potentials in RNA 3D structure evaluation?在 RNA 3D 结构评估中构建统计势时,最佳的参考状态是什么?
RNA. 2019 Jul;25(7):793-812. doi: 10.1261/rna.069872.118. Epub 2019 Apr 17.
9
An Overview of Scoring Functions Used for Protein-Ligand Interactions in Molecular Docking.用于分子对接中蛋白质-配体相互作用的评分函数概述。
Interdiscip Sci. 2019 Jun;11(2):320-328. doi: 10.1007/s12539-019-00327-w. Epub 2019 Mar 15.
10
MyPMFs: a simple tool for creating statistical potentials to assess protein structural models.MyPMFs:一种用于创建统计势以评估蛋白质结构模型的简单工具。
Biochimie. 2018 Aug;151:37-41. doi: 10.1016/j.biochi.2018.05.013. Epub 2018 May 29.