HCEMM-BRC Metabolic Systems Biology Lab, Szeged, Hungary.
Biological Research Centre, Institute of Biochemistry, Synthetic and Systems Biology Unit, Eötvös Loránd Research Network (ELKH), Szeged, Hungary.
Sci Rep. 2022 Apr 21;12(1):6547. doi: 10.1038/s41598-022-10427-1.
Proteins are prone to aggregate when expressed above their solubility limits. Aggregation may occur rapidly, potentially as early as proteins emerge from the ribosome, or slowly, following synthesis. However, in vivo data on aggregation rates are scarce. Here, we classified the Escherichia coli proteome into rapidly and slowly aggregating proteins using an in vivo image-based screen coupled with machine learning. We find that the majority (70%) of cytosolic proteins that become insoluble upon overexpression have relatively low rates of aggregation and are unlikely to aggregate co-translationally. Remarkably, such proteins exhibit higher folding rates compared to rapidly aggregating proteins, potentially implying that they aggregate after reaching their folded states. Furthermore, we find that a substantial fraction (~ 35%) of the proteome remain soluble at concentrations much higher than those found naturally, indicating a large margin of safety to tolerate gene expression changes. We show that high disorder content and low surface stickiness are major determinants of high solubility and are favored in abundant bacterial proteins. Overall, our study provides a global view of aggregation rates and hence solubility limits of proteins in a bacterial cell.
当蛋白质的表达量超过其溶解度极限时,它们很容易发生聚集。聚集可能会很快发生,可能早在核糖体上出现蛋白质时就已经发生,也可能在合成后缓慢发生。然而,关于聚集速率的体内数据非常有限。在这里,我们使用基于体内图像的筛选方法结合机器学习,将大肠杆菌蛋白质组分类为快速和缓慢聚集的蛋白质。我们发现,在过量表达时变得不溶的大多数(70%)细胞质蛋白的聚集速度相对较低,不太可能在共翻译时聚集。值得注意的是,与快速聚集的蛋白质相比,这些蛋白质具有更高的折叠速率,这可能意味着它们在达到折叠状态后才聚集。此外,我们发现相当一部分(~35%)蛋白质在远高于天然存在浓度的情况下仍保持可溶性,这表明在容忍基因表达变化方面有很大的安全裕度。我们表明,高无序含量和低表面粘性是高可溶性的主要决定因素,在丰富的细菌蛋白中受到青睐。总的来说,我们的研究提供了细菌细胞中蛋白质聚集速率和因此溶解度极限的全局视图。