Physics Department, University of California Merced, Merced, California, United States of America.
PLoS One. 2013 Sep 16;8(9):e73831. doi: 10.1371/journal.pone.0073831. eCollection 2013.
Bioinformatics of disordered proteins is especially challenging given high mutation rates for homologous proteins and that functionality may not be strongly related to sequence. Here we have performed a novel bioinformatic analysis, based on the spatial clustering of physically relevant features such as binding motifs and charges within disordered proteins, on thousands of Nuclear Pore Complex (NPC) FG motif containing proteins (FG nups). The biophysical mechanism by which FG nups regulate nucleocytoplasmic transport has remained elusive. Our analysis revealed a set of highly conserved spatial features in the sequence structure of individual FG nups, such as the separation, localization, and ordering of FG motifs and charged residues along the protein chain. These functionally conserved features provide insight into the particular biophysical mechanisms responsible for regulation of nucleocytoplasmic traffic in the NPC, strongly constraining current models. Additionally this method allows us to identify potentially functionally analogous disordered proteins across distantly related species.
鉴于同源蛋白的突变率较高,且功能可能与序列没有很强的相关性,因此无序蛋白的生物信息学研究尤其具有挑战性。在这里,我们对数千种含有核孔复合物(NPC)FG 模体的 FG 核孔蛋白(FG nup)进行了一项新颖的生物信息学分析,该分析基于物理相关特征(如结合基序和无序蛋白中的电荷)在空间上的聚类。FG nup 调节核质转运的生物物理机制仍然难以捉摸。我们的分析揭示了单个 FG nup 序列结构中的一组高度保守的空间特征,例如 FG 模体和带电残基沿蛋白质链的分离、定位和排序。这些功能保守的特征为 NPC 中负责调节核质运输的特定生物物理机制提供了深入了解,强烈限制了当前的模型。此外,该方法还允许我们识别不同远缘物种中潜在功能类似的无序蛋白。