Suppr超能文献

蛋白质构象搜索的特征空间重采样。

Feature space resampling for protein conformational search.

机构信息

Department of Electrical Engineering and Computer Science, University of California, Berkeley, 94720, USA.

出版信息

Proteins. 2010 May 1;78(6):1583-93. doi: 10.1002/prot.22677.

Abstract

De novo protein structure prediction requires location of the lowest energy state of the polypeptide chain among a vast set of possible conformations. Powerful approaches include conformational space annealing, in which search progressively focuses on the most promising regions of conformational space, and genetic algorithms, in which features of the best conformations thus far identified are recombined. We describe a new approach that combines the strengths of these two approaches. Protein conformations are projected onto a discrete feature space which includes backbone torsion angles, secondary structure, and beta pairings. For each of these there is one "native" value: the one found in the native structure. We begin with a large number of conformations generated in independent Monte Carlo structure prediction trajectories from Rosetta. Native values for each feature are predicted from the frequencies of feature value occurrences and the energy distribution in conformations containing them. A second round of structure prediction trajectories are then guided by the predicted native feature distributions. We show that native features can be predicted at much higher than background rates, and that using the predicted feature distributions improves structure prediction in a benchmark of 28 proteins. The advantages of our approach are that features from many different input structures can be combined simultaneously without producing atomic clashes or otherwise physically inviable models, and that the features being recombined have a relatively high chance of being correct.

摘要

从头蛋白质结构预测需要在大量可能构象中定位多肽链的最低能量状态。强大的方法包括构象空间退火,其中搜索逐渐集中在构象空间最有前途的区域,以及遗传算法,其中迄今为止确定的最佳构象的特征被重新组合。我们描述了一种结合这两种方法优点的新方法。蛋白质构象被投影到一个离散的特征空间上,该空间包括骨架扭转角、二级结构和β配对。对于每一个特征,都有一个“天然”值:在天然结构中发现的那个值。我们从 Rosetta 的独立 Monte Carlo 结构预测轨迹开始,生成了大量构象。每个特征的天然值是根据特征值出现的频率和包含它们的构象中的能量分布来预测的。然后,第二轮结构预测轨迹由预测的天然特征分布指导。我们表明,可以以远高于背景的速率预测天然特征,并且使用预测的特征分布可以提高 28 个蛋白质基准测试中的结构预测。我们方法的优点是可以同时组合来自许多不同输入结构的特征,而不会产生原子冲突或以其他方式不可行的模型,并且正在重新组合的特征具有相对较高的正确性机会。

相似文献

1
Feature space resampling for protein conformational search.
Proteins. 2010 May 1;78(6):1583-93. doi: 10.1002/prot.22677.
2
Sampling bottlenecks in de novo protein structure prediction.
J Mol Biol. 2009 Oct 16;393(1):249-60. doi: 10.1016/j.jmb.2009.07.063. Epub 2009 Jul 28.
3
A population-based evolutionary search approach to the multiple minima problem in de novo protein structure prediction.
BMC Struct Biol. 2013;13 Suppl 1(Suppl 1):S4. doi: 10.1186/1472-6807-13-S1-S4. Epub 2013 Nov 8.
5
A Novel Method Using Abstract Convex Underestimation in Ab-Initio Protein Structure Prediction for Guiding Search in Conformational Feature Space.
IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):887-900. doi: 10.1109/TCBB.2015.2497226. Epub 2015 Nov 2.
7
LOOPER: a molecular mechanics-based algorithm for protein loop prediction.
Protein Eng Des Sel. 2008 Feb;21(2):91-100. doi: 10.1093/protein/gzm083. Epub 2008 Jan 14.
9
Improving fragment quality for de novo structure prediction.
Proteins. 2014 Sep;82(9):2240-52. doi: 10.1002/prot.24587. Epub 2014 May 2.
10
Sixty-five years of the long march in protein secondary structure prediction: the final stretch?
Brief Bioinform. 2018 May 1;19(3):482-494. doi: 10.1093/bib/bbw129.

引用本文的文献

2
Chemical shift-based methods in NMR structure determination.
Prog Nucl Magn Reson Spectrosc. 2018 Jun-Aug;106-107:1-25. doi: 10.1016/j.pnmrs.2018.03.002. Epub 2018 Mar 11.
3
CS-ROSETTA.
Methods Enzymol. 2019;614:321-362. doi: 10.1016/bs.mie.2018.07.005. Epub 2018 Sep 11.
4
Combining physicochemical and evolutionary information for protein contact prediction.
PLoS One. 2014 Oct 22;9(10):e108438. doi: 10.1371/journal.pone.0108438. eCollection 2014.
5
Profile of Michael I. Jordan.
Proc Natl Acad Sci U S A. 2013 Jan 22;110(4):1141-3. doi: 10.1073/pnas.1222664110.
6
A probabilistic fragment-based protein structure prediction algorithm.
PLoS One. 2012;7(7):e38799. doi: 10.1371/journal.pone.0038799. Epub 2012 Jul 19.

本文引用的文献

1
Sampling bottlenecks in de novo protein structure prediction.
J Mol Biol. 2009 Oct 16;393(1):249-60. doi: 10.1016/j.jmb.2009.07.063. Epub 2009 Jul 28.
2
Macromolecular modeling with rosetta.
Annu Rev Biochem. 2008;77:363-82. doi: 10.1146/annurev.biochem.77.062906.171838.
5
Improving protein structure prediction with model-based search.
Bioinformatics. 2005 Jun;21 Suppl 1:i66-74. doi: 10.1093/bioinformatics/bti1029.
6
Improvement of comparative model accuracy by free-energy optimization along principal components of natural structural variation.
Proc Natl Acad Sci U S A. 2004 Oct 26;101(43):15346-51. doi: 10.1073/pnas.0404703101. Epub 2004 Oct 18.
7
Coupled prediction of protein secondary and tertiary structure.
Proc Natl Acad Sci U S A. 2003 Oct 14;100(21):12105-10. doi: 10.1073/pnas.1831973100. Epub 2003 Oct 3.
8
Efficient, multiple-range random walk algorithm to calculate the density of states.
Phys Rev Lett. 2001 Mar 5;86(10):2050-3. doi: 10.1103/PhysRevLett.86.2050.
9
Protein secondary structure prediction based on position-specific scoring matrices.
J Mol Biol. 1999 Sep 17;292(2):195-202. doi: 10.1006/jmbi.1999.3091.
10
Hidden Markov models for detecting remote protein homologies.
Bioinformatics. 1998;14(10):846-56. doi: 10.1093/bioinformatics/14.10.846.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验