Kaplan Noam, Linial Michal
Department of Biological Chemistry, Life Science Institute, The Hebrew University, Jerusalem 91904, Israel.
Genome Res. 2006 Nov;16(11):1431-8. doi: 10.1101/gr.4916306. Epub 2006 Oct 25.
The recently sequenced genome of the honey bee (Apis mellifera) has produced 10,157 predicted protein sequences, calling for a computational effort to extract biological insights from them. We have applied an unsupervised hierarchical protein-clustering method, which was previously used in the ProtoNet system, to nearly 200,000 proteins consisting of the predicted honey bee proteins, the SWISS-PROT protein database, and the complete set of proteins of the mouse (Mus musculus) and the fruit fly (Drosophila melanogaster). The hierarchy produced by this method has been entitled ProtoBee. In ProtoBee, the proteins are hierarchically organized into 18,936 separate tree hierarchies, each representing a protein functional family. By using the mouse and Drosophila complete proteomes as reference, we are able to highlight functional groups of putative gene-loss events, putative novel proteins of unique functionality, and bee-specific paralogs. We have studied some of the ProtoBee findings and suggest their biological relevance. Examples include novel opsin genes and intriguing nuclear matches of mitochondrial genes. The organization of bee sequences into functional clusters suggests a natural way of automatically inferring functional annotation. Following this notion, we were able to assign functional annotation to about 70% of the sequences. ProtoBee is available at http://www.protobee.cs.huji.ac.il.
最近测序的蜜蜂(意大利蜜蜂)基因组产生了10157个预测的蛋白质序列,这需要通过计算从这些序列中提取生物学见解。我们应用了一种先前在ProtoNet系统中使用的无监督层次蛋白质聚类方法,该方法用于由预测的蜜蜂蛋白质、SWISS-PROT蛋白质数据库以及小鼠(小家鼠)和果蝇(黑腹果蝇)的全套蛋白质组成的近200000个蛋白质。通过这种方法产生的层次结构被命名为ProtoBee。在ProtoBee中,蛋白质被层次化地组织成18936个独立的树形层次结构,每个层次结构代表一个蛋白质功能家族。通过将小鼠和果蝇的完整蛋白质组作为参考,我们能够突出推定的基因缺失事件的功能组、具有独特功能的推定新蛋白质以及蜜蜂特有的旁系同源物。我们研究了一些ProtoBee的发现,并提出了它们的生物学相关性。例子包括新的视蛋白基因和线粒体基因有趣的核匹配。将蜜蜂序列组织成功能簇暗示了一种自动推断功能注释的自然方式。基于这一概念,我们能够为大约70%的序列分配功能注释。ProtoBee可在http://www.protobee.cs.huji.ac.il上获取。