Peng Zhenling, Yan Jing, Fan Xiao, Mizianty Marcin J, Xue Bin, Wang Kui, Hu Gang, Uversky Vladimir N, Kurgan Lukasz
Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada.
Cell Mol Life Sci. 2015 Jan;72(1):137-51. doi: 10.1007/s00018-014-1661-9. Epub 2014 Jun 18.
Recent years witnessed increased interest in intrinsically disordered proteins and regions. These proteins and regions are abundant and possess unique structural features and a broad functional repertoire that complements ordered proteins. However, modern studies on the abundance and functions of intrinsically disordered proteins and regions are relatively limited in size and scope of their analysis. To fill this gap, we performed a broad and detailed computational analysis of over 6 million proteins from 59 archaea, 471 bacterial, 110 eukaryotic and 325 viral proteomes. We used arguably more accurate consensus-based disorder predictions, and for the first time comprehensively characterized intrinsic disorder at proteomic and protein levels from all significant perspectives, including abundance, cellular localization, functional roles, evolution, and impact on structural coverage. We show that intrinsic disorder is more abundant and has a unique profile in eukaryotes. We map disorder into archaea, bacterial and eukaryotic cells, and demonstrate that it is preferentially located in some cellular compartments. Functional analysis that considers over 1,200 annotations shows that certain functions are exclusively implemented by intrinsically disordered proteins and regions, and that some of them are specific to certain domains of life. We reveal that disordered regions are often targets for various post-translational modifications, but primarily in the eukaryotes and viruses. Using a phylogenetic tree for 14 eukaryotic and 112 bacterial species, we analyzed relations between disorder, sequence conservation and evolutionary speed. We provide a complete analysis that clearly shows that intrinsic disorder is exceptionally and uniquely abundant in each domain of life.
近年来,人们对内在无序蛋白质及区域的兴趣与日俱增。这些蛋白质及区域数量众多,具有独特的结构特征和广泛的功能,可补充有序蛋白质的功能。然而,目前关于内在无序蛋白质及区域的丰度和功能的现代研究在分析规模和范围上相对有限。为填补这一空白,我们对来自59个古菌、471个细菌、110个真核生物和325个病毒蛋白质组的600多万种蛋白质进行了广泛而详细的计算分析。我们使用了可能更为准确的基于共识的无序预测方法,并首次从蛋白质组和蛋白质水平的所有重要角度全面表征了内在无序,包括丰度、细胞定位、功能作用、进化以及对结构覆盖率的影响。我们发现内在无序在真核生物中更为丰富且具有独特的特征。我们将无序映射到古菌、细菌和真核细胞中,并证明其优先位于某些细胞区室。考虑了1200多种注释的功能分析表明,某些功能仅由内在无序蛋白质及区域实现,其中一些功能特定于某些生命域。我们发现无序区域常常是各种翻译后修饰的靶点,但主要是在真核生物和病毒中。利用14种真核生物和112种细菌物种的系统发育树,我们分析了无序、序列保守性和进化速度之间的关系。我们提供了一个完整的分析,清楚地表明内在无序在生命的每个域中都异常且独特地丰富。