Staub Eike, Mackowiak Sebastian, Vingron Martin
Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany.
Genome Biol. 2006;7(10):R98. doi: 10.1186/gb-2006-7-10-r98. Epub 2006 Oct 26.
Although baker's yeast is a primary model organism for research on eukaryotic ribosome assembly and nucleoli, the list of its proteins that are functionally associated with nucleoli or ribosomes is still incomplete. We trained a naïve Bayesian classifier to predict novel proteins that are associated with yeast nucleoli or ribosomes based on parts lists of nucleoli in model organisms and large-scale protein interaction data sets. Phylogenetic profiling and gene expression analysis were carried out to shed light on evolutionary and regulatory aspects of nucleoli and ribosome assembly.
We predict that, in addition to 439 known proteins, a further 62 yeast proteins are associated with components of the nucleolus or the ribosome. The complete set comprises a large core of archaeal-type proteins, several bacterial-type proteins, but mostly eukaryote-specific inventions. Expression of nucleolar and ribosomal genes tends to be strongly co-regulated compared to other yeast genes.
The number of proteins associated with nucleolar or ribosomal components in yeast is at least 14% higher than known before. The nucleolus probably evolved from an archaeal-type ribosome maturation machinery by recruitment of several bacterial-type and mostly eukaryote-specific factors. Not only expression of ribosomal protein genes, but also expression of genes encoding the 90S processosome, are strongly co-regulated and both regulatory programs are distinct from each other.
尽管面包酵母是研究真核生物核糖体组装和核仁的主要模式生物,但其与核仁或核糖体功能相关的蛋白质列表仍不完整。我们训练了一个朴素贝叶斯分类器,以基于模式生物中核仁的部件列表和大规模蛋白质相互作用数据集来预测与酵母核仁或核糖体相关的新蛋白质。进行了系统发育分析和基因表达分析,以阐明核仁和核糖体组装的进化和调控方面。
我们预测,除了439种已知蛋白质外,还有62种酵母蛋白质与核仁或核糖体的成分相关。完整的集合包括大量的古细菌型蛋白质核心、几种细菌型蛋白质,但主要是真核生物特有的发明。与其他酵母基因相比,核仁和核糖体基因的表达倾向于受到强烈的共同调控。
与酵母核仁或核糖体成分相关的蛋白质数量比之前已知的至少高出14%。核仁可能是通过招募几种细菌型和主要是真核生物特有的因子,从古细菌型核糖体成熟机制进化而来的。不仅核糖体蛋白基因的表达,而且编码90S加工体的基因的表达,都受到强烈的共同调控,并且这两种调控程序彼此不同。