Galea Charles A, High Anthony A, Obenauer John C, Mishra Ashutosh, Park Cheon-Gil, Punta Marco, Schlessinger Avner, Ma Jing, Rost Burkhard, Slaughter Clive A, Kriwacki Richard W
Department of Structural Biology, St. Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, Tennessee 38105, USA.
J Proteome Res. 2009 Jan;8(1):211-26. doi: 10.1021/pr800308v.
Intrinsically disordered proteins are predicted to be highly abundant and play broad biological roles in eukaryotic cells. In particular, by virtue of their structural malleability and propensity to interact with multiple binding partners, disordered proteins are thought to be specialized for roles in signaling and regulation. However, these concepts are based on in silico analyses of translated whole genome sequences, not on large-scale analyses of proteins expressed in living cells. Therefore, whether these concepts broadly apply to expressed proteins is currently unknown. Previous studies have shown that heat-treatment of cell extracts lead to partial enrichment of soluble, disordered proteins. On the basis of this observation, we sought to address the current dearth of knowledge about expressed, disordered proteins by performing a large-scale proteomics study of thermostable proteins isolated from mouse fibroblast cells. With the use of novel multidimensional chromatography methods and mass spectrometry, we identified a total of 1320 thermostable proteins from these cells. Further, we used a variety of bioinformatics methods to analyze the structural and biological properties of these proteins. Interestingly, more than 900 of these expressed proteins were predicted to be substantially disordered. These were divided into two categories, with 514 predicted to be predominantly disordered and 395 predicted to exhibit both disordered and ordered/folded features. In addition, 411 of the thermostable proteins were predicted to be folded. Despite the use of heat treatment (60 min at 98 degrees C) to partially enrich for disordered proteins, which might have been expected to select for small proteins, the sequences of these proteins exhibited a wide range of lengths (622 +/- 555 residues (average length +/- standard deviation) for disordered proteins and 569 +/- 598 residues for folded proteins). Computational structural analyses revealed several unexpected features of the thermostable proteins: (1) disordered domains and coiled-coil domains occurred together in a large number of disordered proteins, suggesting functional interplay between these domains; and (2) more than 170 proteins contained lengthy domains (>300 residues) known to be folded. Reference to Gene Ontology Consortium functional annotations revealed that, while disordered proteins play diverse biological roles in mouse fibroblasts, they do exhibit heightened involvement in several functional categories, including, cytoskeletal structure and cell movement, metabolic and biosynthetic processes, organelle structure, cell division, gene transcription, and ribonucleoprotein complexes. We believe that these results reflect the general properties of the mouse intrinsically disordered proteome (IDP-ome) although they also reflect the specialized physiology of fibroblast cells. Large-scale identification of expressed, thermostable proteins from other cell types in the future, grown under varied physiological conditions, will dramatically expand our understanding of the structural and biological properties of disordered eukaryotic proteins.
据预测,内在无序蛋白质在真核细胞中含量极高且发挥着广泛的生物学作用。特别是,由于其结构的可塑性以及与多个结合伴侣相互作用的倾向,无序蛋白质被认为在信号传导和调节方面具有特殊作用。然而,这些概念是基于对翻译后的全基因组序列的计算机分析,而非对活细胞中表达的蛋白质进行的大规模分析。因此,这些概念是否广泛适用于表达的蛋白质目前尚不清楚。先前的研究表明,对细胞提取物进行热处理会导致可溶性无序蛋白质部分富集。基于这一观察结果,我们试图通过对从小鼠成纤维细胞中分离出的热稳定蛋白质进行大规模蛋白质组学研究,来解决目前关于表达的无序蛋白质的知识匮乏问题。利用新型多维色谱方法和质谱技术,我们从这些细胞中总共鉴定出1320种热稳定蛋白质。此外,我们使用了多种生物信息学方法来分析这些蛋白质的结构和生物学特性。有趣的是,这些表达的蛋白质中有900多种被预测为基本上无序。它们被分为两类,其中514种被预测主要为无序,395种被预测同时具有无序和有序/折叠特征。此外,411种热稳定蛋白质被预测为折叠状态。尽管使用了热处理(98摄氏度下60分钟)来部分富集无序蛋白质,而这可能会选择小蛋白质,但这些蛋白质的序列显示出广泛的长度范围(无序蛋白质为622±555个残基(平均长度±标准差),折叠蛋白质为569±598个残基)。计算结构分析揭示了热稳定蛋白质几个意想不到的特征:(1)无序结构域和卷曲螺旋结构域在大量无序蛋白质中共同出现,表明这些结构域之间存在功能相互作用;(2)超过170种蛋白质包含已知为折叠状态的长结构域(>300个残基)。参考基因本体联合会的功能注释发现,虽然无序蛋白质在小鼠成纤维细胞中发挥着多种生物学作用,但它们在几个功能类别中的参与度确实有所提高,包括细胞骨架结构和细胞运动、代谢和生物合成过程、细胞器结构、细胞分裂、基因转录以及核糖核蛋白复合物。我们认为,这些结果反映了小鼠内在无序蛋白质组(IDP组)的一般特性,尽管它们也反映了成纤维细胞的特殊生理学特性。未来,在不同生理条件下培养其他细胞类型并大规模鉴定表达的热稳定蛋白质,将极大地扩展我们对无序真核蛋白质的结构和生物学特性的理解。