Department of Chemistry, Rice University, Houston, Texas 77005, USA.
J Chem Phys. 2013 Sep 28;139(12):121908. doi: 10.1063/1.4813504.
We explore the similarities and differences between the energy landscapes of proteins that have been selected by nature and those of some proteins designed by humans. Natural proteins have evolved to function as well as fold, and this is a source of energetic frustration. The sequence of Top7, on the other hand, was designed with architecture alone in mind using only native state stability as the optimization criterion. Its topology had not previously been observed in nature. Experimental studies show that the folding kinetics of Top7 is more complex than the kinetics of folding of otherwise comparable naturally occurring proteins. In this paper, we use structure prediction tools, frustration analysis, and free energy profiles to illustrate the folding landscapes of Top7 and two other proteins designed by Takada. We use both perfectly funneled (structure-based) and predictive (transferable) models to gain insight into the role of topological versus energetic frustration in these systems and show how they differ from those found for natural proteins. We also study how robust the folding of these designs would be to the simplification of the sequences using fewer amino acid types. Simplification using a five amino acid type code results in comparable quality of structure prediction to the full sequence in some cases, while the two-letter simplification scheme dramatically reduces the quality of structure prediction.
我们探索了经过自然选择的蛋白质和人类设计的一些蛋白质的能量景观之间的相似性和差异。天然蛋白质的功能与其折叠有关,这是能量受挫的一个来源。另一方面,Top7 的序列是仅使用天然状态稳定性作为优化标准,完全基于结构设计的。它的拓扑结构以前在自然界中没有观察到。实验研究表明,Top7 的折叠动力学比其他可比天然存在的蛋白质的折叠动力学更为复杂。在本文中,我们使用结构预测工具、挫折分析和自由能曲线来阐明 Top7 以及 Takada 设计的另外两种蛋白质的折叠景观。我们使用完全有沟(基于结构)和预测(可转移)模型来深入了解拓扑和能量挫折在这些系统中的作用,并展示它们与天然蛋白质的不同之处。我们还研究了这些设计在使用较少的氨基酸类型简化序列时的折叠稳定性。在某些情况下,使用五种氨基酸类型代码进行简化可以与完整序列的结构预测质量相媲美,而两个字母的简化方案则大大降低了结构预测的质量。