蛋白质构象搜索的特征空间重采样。

Feature space resampling for protein conformational search.

机构信息

Department of Electrical Engineering and Computer Science, University of California, Berkeley, 94720, USA.

出版信息

Proteins. 2010 May 1;78(6):1583-93. doi: 10.1002/prot.22677.

DOI:10.1002/prot.22677

PMID:20131376

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2854516/

Abstract

De novo protein structure prediction requires location of the lowest energy state of the polypeptide chain among a vast set of possible conformations. Powerful approaches include conformational space annealing, in which search progressively focuses on the most promising regions of conformational space, and genetic algorithms, in which features of the best conformations thus far identified are recombined. We describe a new approach that combines the strengths of these two approaches. Protein conformations are projected onto a discrete feature space which includes backbone torsion angles, secondary structure, and beta pairings. For each of these there is one "native" value: the one found in the native structure. We begin with a large number of conformations generated in independent Monte Carlo structure prediction trajectories from Rosetta. Native values for each feature are predicted from the frequencies of feature value occurrences and the energy distribution in conformations containing them. A second round of structure prediction trajectories are then guided by the predicted native feature distributions. We show that native features can be predicted at much higher than background rates, and that using the predicted feature distributions improves structure prediction in a benchmark of 28 proteins. The advantages of our approach are that features from many different input structures can be combined simultaneously without producing atomic clashes or otherwise physically inviable models, and that the features being recombined have a relatively high chance of being correct.

摘要

从头蛋白质结构预测需要在大量可能构象中定位多肽链的最低能量状态。强大的方法包括构象空间退火，其中搜索逐渐集中在构象空间最有前途的区域，以及遗传算法，其中迄今为止确定的最佳构象的特征被重新组合。我们描述了一种结合这两种方法优点的新方法。蛋白质构象被投影到一个离散的特征空间上，该空间包括骨架扭转角、二级结构和β配对。对于每一个特征，都有一个“天然”值：在天然结构中发现的那个值。我们从 Rosetta 的独立 Monte Carlo 结构预测轨迹开始，生成了大量构象。每个特征的天然值是根据特征值出现的频率和包含它们的构象中的能量分布来预测的。然后，第二轮结构预测轨迹由预测的天然特征分布指导。我们表明，可以以远高于背景的速率预测天然特征，并且使用预测的特征分布可以提高 28 个蛋白质基准测试中的结构预测。我们方法的优点是可以同时组合来自许多不同输入结构的特征，而不会产生原子冲突或以其他方式不可行的模型，并且正在重新组合的特征具有相对较高的正确性机会。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca23/2854516/1cbde36f8c2f/nihms188124f1.jpg

相似文献

Feature space resampling for protein conformational search.蛋白质构象搜索的特征空间重采样。

Proteins. 2010 May 1;78(6):1583-93. doi: 10.1002/prot.22677.

Sampling bottlenecks in de novo protein structure prediction.从头蛋白质结构预测中的采样瓶颈

J Mol Biol. 2009 Oct 16;393(1):249-60. doi: 10.1016/j.jmb.2009.07.063. Epub 2009 Jul 28.

A population-based evolutionary search approach to the multiple minima problem in de novo protein structure prediction.一种基于群体的进化搜索方法用于从头蛋白质结构预测中的多极小值问题。

BMC Struct Biol. 2013;13 Suppl 1(Suppl 1):S4. doi: 10.1186/1472-6807-13-S1-S4. Epub 2013 Nov 8.

Exploratory studies of ab initio protein structure prediction: multiple copy simulated annealing, AMBER energy functions, and a generalized born/solvent accessibility solvation model.从头算蛋白质结构预测的探索性研究：多拷贝模拟退火、AMBER能量函数和广义玻恩/溶剂可及性溶剂化模型。

Proteins. 2002 Jan 1;46(1):128-46. doi: 10.1002/prot.10020.

A Novel Method Using Abstract Convex Underestimation in Ab-Initio Protein Structure Prediction for Guiding Search in Conformational Feature Space.一种在从头算蛋白质结构预测中使用抽象凸低估来指导构象特征空间搜索的新方法。

IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):887-900. doi: 10.1109/TCBB.2015.2497226. Epub 2015 Nov 2.

Improved beta-protein structure prediction by multilevel optimization of nonlocal strand pairings and local backbone conformation.通过非局部链配对和局部主链构象的多级优化改进β-蛋白结构预测。

Proteins. 2006 Dec 1;65(4):922-9. doi: 10.1002/prot.21133.

LOOPER: a molecular mechanics-based algorithm for protein loop prediction.LOOPER：一种基于分子力学的蛋白质环预测算法。

Protein Eng Des Sel. 2008 Feb;21(2):91-100. doi: 10.1093/protein/gzm083. Epub 2008 Jan 14.

Constructing effective energy functions for protein structure prediction through broadening attraction-basin and reverse Monte Carlo sampling.通过拓宽吸引盆地和反向蒙特卡罗采样来构建蛋白质结构预测的有效能量函数。

BMC Bioinformatics. 2019 Mar 29;20(Suppl 3):135. doi: 10.1186/s12859-019-2652-5.

Improving fragment quality for de novo structure prediction.提高用于从头结构预测的片段质量。

Proteins. 2014 Sep;82(9):2240-52. doi: 10.1002/prot.24587. Epub 2014 May 2.

Sixty-five years of the long march in protein secondary structure prediction: the final stretch?蛋白质二级结构预测的长征：终章？

Brief Bioinform. 2018 May 1;19(3):482-494. doi: 10.1093/bib/bbw129.

引用本文的文献

Enhancing fragment-based protein structure prediction by customising fragment cardinality according to local secondary structure.根据局部二级结构定制片段基数以增强基于片段的蛋白质结构预测。

BMC Bioinformatics. 2020 May 1;21(1):170. doi: 10.1186/s12859-020-3491-0.

Chemical shift-based methods in NMR structure determination.基于化学位移的 NMR 结构测定方法。

Prog Nucl Magn Reson Spectrosc. 2018 Jun-Aug;106-107:1-25. doi: 10.1016/j.pnmrs.2018.03.002. Epub 2018 Mar 11.

CS-ROSETTA.CS-罗塞塔

Methods Enzymol. 2019;614:321-362. doi: 10.1016/bs.mie.2018.07.005. Epub 2018 Sep 11.

Combining physicochemical and evolutionary information for protein contact prediction.结合物理化学和进化信息进行蛋白质接触预测。

PLoS One. 2014 Oct 22;9(10):e108438. doi: 10.1371/journal.pone.0108438. eCollection 2014.

Profile of Michael I. Jordan.迈克尔·I·乔丹简介。

Proc Natl Acad Sci U S A. 2013 Jan 22;110(4):1141-3. doi: 10.1073/pnas.1222664110.

A probabilistic fragment-based protein structure prediction algorithm.基于概率的片段式蛋白质结构预测算法。

PLoS One. 2012;7(7):e38799. doi: 10.1371/journal.pone.0038799. Epub 2012 Jul 19.

Resolution-adapted recombination of structural features significantly improves sampling in restraint-guided structure calculation.分辨率自适应的结构特征重排显著提高了约束指导结构计算中的采样。

Proteins. 2012 Mar;80(3):884-95. doi: 10.1002/prot.23245.

本文引用的文献

Sampling bottlenecks in de novo protein structure prediction.从头蛋白质结构预测中的采样瓶颈

J Mol Biol. 2009 Oct 16;393(1):249-60. doi: 10.1016/j.jmb.2009.07.063. Epub 2009 Jul 28.

Macromolecular modeling with rosetta.使用Rosetta进行大分子建模。

Annu Rev Biochem. 2008;77:363-82. doi: 10.1146/annurev.biochem.77.062906.171838.

Structure prediction for CASP7 targets using extensive all-atom refinement with Rosetta@home.使用Rosetta@home进行广泛的全原子精修对第7届蛋白质结构预测关键评估（CASP7）目标进行结构预测。

Proteins. 2007;69 Suppl 8:118-28. doi: 10.1002/prot.21636.

Proteins. 2006 Dec 1;65(4):922-9. doi: 10.1002/prot.21133.

Improving protein structure prediction with model-based search.利用基于模型的搜索改进蛋白质结构预测。

Bioinformatics. 2005 Jun;21 Suppl 1:i66-74. doi: 10.1093/bioinformatics/bti1029.

Improvement of comparative model accuracy by free-energy optimization along principal components of natural structural variation.通过沿自然结构变异的主成分进行自由能优化提高比较模型的准确性。

Proc Natl Acad Sci U S A. 2004 Oct 26;101(43):15346-51. doi: 10.1073/pnas.0404703101. Epub 2004 Oct 18.

Coupled prediction of protein secondary and tertiary structure.蛋白质二级和三级结构的耦合预测

Proc Natl Acad Sci U S A. 2003 Oct 14;100(21):12105-10. doi: 10.1073/pnas.1831973100. Epub 2003 Oct 3.

Efficient, multiple-range random walk algorithm to calculate the density of states.用于计算态密度的高效多范围随机游走算法。

Phys Rev Lett. 2001 Mar 5;86(10):2050-3. doi: 10.1103/PhysRevLett.86.2050.

Protein secondary structure prediction based on position-specific scoring matrices.基于位置特异性评分矩阵的蛋白质二级结构预测

J Mol Biol. 1999 Sep 17;292(2):195-202. doi: 10.1006/jmbi.1999.3091.

Hidden Markov models for detecting remote protein homologies.用于检测远程蛋白质同源性的隐马尔可夫模型。

Bioinformatics. 1998;14(10):846-56. doi: 10.1093/bioinformatics/14.10.846.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验