Monsellier Elodie, Ramazzotti Matteo, Taddei Niccolò, Chiti Fabrizio
Dipartimento di Scienze Biochimiche, Università degli studi di Firenze, Florence, Italy.
PLoS Comput Biol. 2008 Oct;4(10):e1000199. doi: 10.1371/journal.pcbi.1000199. Epub 2008 Oct 17.
Formation of amyloid-like fibrils is involved in numerous human protein deposition diseases, but is also an intrinsic property of polypeptide chains in general. Progress achieved recently now allows the aggregation propensity of proteins to be analyzed over large scales. In this work we used a previously developed predictive algorithm to analyze the propensity of the 34,180 protein sequences of the human proteome to form amyloid-like fibrils. We show that long proteins have, on average, less intense aggregation peaks than short ones. Human proteins involved in protein deposition diseases do not differ extensively from the rest of the proteome, further demonstrating the generality of protein aggregation. We were also able to reproduce some of the results obtained with other algorithms, demonstrating that they do not depend on the type of computational tool employed. For example, proteins with different subcellular localizations were found to have different aggregation propensities, in relation to the various efficiencies of quality control mechanisms. Membrane proteins, intrinsically disordered proteins, and folded proteins were confirmed to have very different aggregation propensities, as a consequence of their different structures and cellular microenvironments. In addition, gatekeeper residues at strategic positions of the sequences were found to protect human proteins from aggregation. The results of these comparative analyses highlight the existence of intimate links between the propensity of proteins to form aggregates with beta-structure and their biology. In particular, they emphasize the existence of a negative selection pressure that finely modulates protein sequences in order to adapt their aggregation propensity to their biological context.
淀粉样纤维的形成与多种人类蛋白质沉积疾病有关,但总体而言也是多肽链的一种内在特性。最近取得的进展使得现在能够大规模分析蛋白质的聚集倾向。在这项工作中,我们使用了先前开发的预测算法来分析人类蛋白质组中34,180个蛋白质序列形成淀粉样纤维的倾向。我们表明,平均而言,长蛋白质的聚集峰强度低于短蛋白质。参与蛋白质沉积疾病的人类蛋白质与蛋白质组的其他部分没有太大差异,这进一步证明了蛋白质聚集的普遍性。我们还能够重现其他算法获得的一些结果,表明这些结果不依赖于所使用的计算工具类型。例如,发现具有不同亚细胞定位的蛋白质具有不同的聚集倾向,这与质量控制机制的各种效率有关。由于其不同的结构和细胞微环境,膜蛋白、内在无序蛋白和折叠蛋白被证实具有非常不同的聚集倾向。此外,发现在序列的关键位置的守门残基可保护人类蛋白质不发生聚集。这些比较分析的结果突出了蛋白质形成β结构聚集体的倾向与其生物学之间存在密切联系。特别是,它们强调存在一种负选择压力,该压力精细调节蛋白质序列,以便使其聚集倾向适应其生物学背景。