Gollery Martin, Harper Jeff, Cushman John, Mittler Taliah, Girke Thomas, Zhu Jian-Kang, Bailey-Serres Julia, Mittler Ron
Department of Biochemistry and Molecular Biology, University of Nevada, Reno, NV 89557, USA.
Genome Biol. 2006;7(7):R57. doi: 10.1186/gb-2006-7-7-r57.
Proteins with obscure features (POFs), which lack currently defined motifs or domains, represent between 18% and 38% of a typical eukaryotic proteome. To evaluate the contribution of this class of proteins to the diversity of eukaryotes, we performed a comparative analysis of the predicted proteomes derived from 10 different sequenced genomes, including budding and fission yeast, worm, fly, mosquito, Arabidopsis, rice, mouse, rat, and human.
Only 1,650 protein groups were found to be conserved among these proteomes (BLAST E-value threshold of 10(-6)). Of these, only three were designated as POFs. Surprisingly, we found that, on average, 60% of the POFs identified in these 10 proteomes (44,236 in total) were species specific. In contrast, only 7.5% of the proteins with defined features (PDFs) were species specific (17,554 in total). As a group, POFs appear similar to PDFs in their relative contribution to biological functions, as indicated by their expression, participation in protein-protein interactions and association with mutant phenotypes. However, POF have more predicted disordered structure than PDFs, implying that they may exhibit preferential involvement in species-specific regulatory and signaling networks.
Because the majority of eukaryotic POFs are not well conserved, and by definition do not have defined domains or motifs upon which to formulate a functional working hypothesis, understanding their biochemical and biological functions will require species-specific investigations.
具有模糊特征的蛋白质(POF)缺乏当前已定义的基序或结构域,在典型的真核生物蛋白质组中占比18%至38%。为评估这类蛋白质对真核生物多样性的贡献,我们对来自10个不同测序基因组(包括芽殖酵母、裂殖酵母、线虫、果蝇、蚊子、拟南芥、水稻、小鼠、大鼠和人类)的预测蛋白质组进行了比较分析。
在这些蛋白质组中仅发现1650个蛋白质组保守(BLAST E值阈值为10^(-6))。其中,只有三个被指定为POF。令人惊讶的是,我们发现,平均而言,在这10个蛋白质组中鉴定出的POF中有60%(总共44236个)是物种特异性的。相比之下,具有明确特征的蛋白质(PDF)中只有7.5%是物种特异性的(总共17554个)。作为一个群体,POF在对生物学功能的相对贡献方面似乎与PDF相似,这体现在它们的表达、参与蛋白质 - 蛋白质相互作用以及与突变表型的关联上。然而,POF比PDF具有更多预测的无序结构,这意味着它们可能在物种特异性调控和信号网络中表现出优先参与。
由于大多数真核生物POF保守性不佳,并且根据定义没有可用于构建功能工作假设的明确结构域或基序,因此了解它们的生化和生物学功能将需要进行物种特异性研究。