Department of Chemistry, University of Texas at Austin, TX 78712, USA.
Biophysics Program, Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, USA.
Soft Matter. 2017 May 21;13(19):3622-3638. doi: 10.1039/c7sm00074j. Epub 2017 Apr 27.
Folded states of single domain globular proteins are compact with high packing density. The radius of gyration, R, of both the folded and unfolded states increase as N where N is the number of amino acids in the protein. The values of the Flory exponent ν are, respectively, ≈⅓ and ≈0.6 in the folded and unfolded states, coinciding with those for homopolymers. However, the extent of compaction of the unfolded state of a protein under low denaturant concentration (collapsibility), conditions favoring the formation of the folded state, is unknown. We develop a theory that uses the contact map of proteins as input to quantitatively assess collapsibility of proteins. Although collapsibility is universal, the propensity to be compact depends on the protein architecture. Application of the theory to over two thousand proteins shows that collapsibility depends not only on N but also on the contact map reflecting the native structure. A major prediction of the theory is that β-sheet proteins are far more collapsible than structures dominated by α-helices. The theory and the accompanying simulations, validating the theoretical predictions, provide insights into the differing conclusions reached using different experimental probes assessing the extent of compaction of proteins. By calculating the criterion for collapsibility as a function of protein length we provide quantitative insights into the reasons why single domain proteins are small and the physical reasons for the origin of multi-domain proteins. Collapsibility of non-coding RNA molecules is similar β-sheet proteins structures adding support to "Compactness Selection Hypothesis".
折叠状态的单域球状蛋白结构紧凑,堆积密度高。无论是折叠态还是去折叠态,其回转半径 R 随蛋白质中氨基酸数量 N 的增加呈 N 的幂次增长,其中 Flory 指数 ν 的数值分别约为 2/3 和 0.6,与均聚物的数值一致。然而,在低变性剂浓度(可折叠性)下,蛋白质去折叠态的紧凑程度,有利于形成折叠态的条件,目前还不清楚。我们提出了一种理论,该理论使用蛋白质的接触图作为输入,以定量评估蛋白质的可折叠性。尽管可折叠性是普遍存在的,但紧凑的倾向取决于蛋白质的结构。该理论应用于两千多种蛋白质表明,可折叠性不仅取决于 N,还取决于反映天然结构的接触图。该理论的一个主要预测是,β-折叠蛋白比由α-螺旋主导的结构更具可折叠性。该理论和伴随的模拟验证了理论预测,为理解使用不同实验探针评估蛋白质紧凑程度时得出的不同结论提供了新的视角。通过计算作为蛋白质长度函数的可折叠性标准,我们深入了解了单域蛋白为什么小以及多域蛋白起源的物理原因。非编码 RNA 分子的可折叠性与β-折叠蛋白结构相似,为“紧凑选择假说”提供了支持。