Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States.
Biochemistry. 2020 May 26;59(20):1881-1895. doi: 10.1021/acs.biochem.9b01114. Epub 2020 May 11.
The degree of hydrophobicity and net charge per residue are physical properties that enable the discrimination of folded from intrinsically disordered proteins (IDPs) solely on the basis of amino acid sequence. Here, we improve upon the existing classification of proteins and IDPs based on the parameters mentioned above by adopting the scale of nonpolar content of Rose et al. and by taking amino acid side-chain acidity and basicity into account. The resulting algorithm, denoted here as net charge nonpolar or NECNOP, enables the facile prediction of the folded and disordered status of proteins under physiologically relevant conditions with >95% accuracy, based on amino-acid sequence alone. The NECNOP approach displays a much-enhanced performance for proteins with >140 residues, suggesting that small proteins are more likely to have irregular charge and hydrophobicity features. NECNOP analysis of the entire proteome identifies specific net charge and nonpolar regions peculiar to soluble, integral membrane, and non-integral membrane proteins. Surprisingly, protein net charge and hydrophobicity are found to converge to specific values as chain length increases, across the proteome. In addition, NECNOP plots enable the straightforward identification of protein sequences corresponding to prion proteins and promise to serve as a powerful predictive tool for the design of large proteins. In summary, NECNOP plots are a straightforward approach that improves our understanding of the relation between the amino acid sequence and three-dimensional structure of proteins as a function of molecular mass.
疏水性和每个残基的净电荷是物理性质,仅基于氨基酸序列就能区分折叠和无规卷曲蛋白质(IDP)。在这里,我们通过采用 Rose 等人的非极性含量尺度,并考虑氨基酸侧链的酸碱性,对基于上述参数的蛋白质和 IDP 分类进行了改进。所得算法,这里称为净电荷非极性或 NECNOP,仅基于氨基酸序列,就能轻松预测生理相关条件下蛋白质的折叠和无规卷曲状态,准确率>95%。对于>140 个残基的蛋白质,NECNOP 方法的性能有了很大提高,这表明小蛋白质更可能具有不规则的电荷和疏水性特征。对整个蛋白质组的 NECNOP 分析确定了可溶性、整膜和非整膜蛋白质特有的特定净电荷和非极性区域。令人惊讶的是,随着链长的增加,蛋白质净电荷和疏水性在整个蛋白质组中收敛到特定值。此外,NECNOP 图能够直接识别与朊病毒蛋白相对应的蛋白质序列,并有望成为设计大蛋白质的强大预测工具。总之,NECNOP 图是一种简单的方法,可以加深我们对蛋白质的氨基酸序列和三维结构之间关系的理解,这是一个随分子量变化的函数。