Department of Biomedical Engineering and Center for Biological Systems Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA.
Proc Natl Acad Sci U S A. 2013 Aug 13;110(33):13392-7. doi: 10.1073/pnas.1304749110. Epub 2013 Jul 30.
The functions of intrinsically disordered proteins (IDPs) are governed by relationships between information encoded in their amino acid sequences and the ensembles of conformations that they sample as autonomous units. Most IDPs are polyampholytes, with sequences that include both positively and negatively charged residues. Accordingly, we focus here on the sequence-ensemble relationships of polyampholytic IDPs. The fraction of charged residues discriminates between weak and strong polyampholytes. Using atomistic simulations, we show that weak polyampholytes form globules, whereas the conformational preferences of strong polyampholytes are determined by a combination of fraction of charged residues values and the linear sequence distributions of oppositely charged residues. We quantify the latter using a patterning parameter κ that lies between zero and one. The value of κ is low for well-mixed sequences, and in these sequences, intrachain electrostatic repulsions and attractions are counterbalanced, leading to the unmasking of preferences for conformations that resemble either self-avoiding random walks or generic Flory random coils. Segregation of oppositely charged residues within linear sequences leads to high κ-values and preferences for hairpin-like conformations caused by long-range electrostatic attractions induced by conformational fluctuations. We propose a scaling theory to explain the sequence-encoded conformational properties of strong polyampholytes. We show that naturally occurring strong polyampholytes have low κ-values, and this feature implies a selection for random coil ensembles. The design of sequences with different κ-values demonstrably alters the conformational preferences of polyampholytic IDPs, and this ability could become a useful tool for enabling direct inquiries into connections between sequence-ensemble relationships and functions of IDPs.
无规卷曲蛋白质(IDP)的功能由其氨基酸序列中编码的信息与它们作为自主单元采样的构象集合之间的关系决定。大多数 IDP 都是多电荷的,其序列中既包含正电荷残基又包含负电荷残基。因此,我们在这里重点关注多电荷 IDP 的序列-集合关系。带电荷残基的比例区分了弱多电荷和强多电荷。我们使用原子模拟表明,弱多电荷形成球体,而强多电荷的构象偏好由带电荷残基的比例值和相反电荷残基的线性序列分布的组合决定。我们使用介于 0 和 1 之间的模式参数 κ 来量化后者。对于充分混合的序列,κ 值较低,在这些序列中,链内静电斥力和吸引力相互平衡,导致类似于自回避随机行走或通用 Flory 无规线团的构象偏好的暴露。线性序列中相反电荷残基的分离导致高 κ 值和发夹样构象的偏好,这是由构象波动引起的长程静电吸引力引起的。我们提出了一种标度理论来解释强多电荷的序列编码构象特性。我们表明,天然存在的强多电荷具有低 κ 值,这一特征意味着对无规线团集合的选择。具有不同 κ 值的序列的设计明显改变了多电荷 IDP 的构象偏好,这种能力可能成为一种有用的工具,能够直接探究 IDP 的序列-集合关系和功能之间的联系。