Suppr超能文献

原蜂(ProtoBee):蜜蜂蛋白质组的分层分类与注释

ProtoBee: hierarchical classification and annotation of the honey bee proteome.

作者信息

Kaplan Noam, Linial Michal

机构信息

Department of Biological Chemistry, Life Science Institute, The Hebrew University, Jerusalem 91904, Israel.

出版信息

Genome Res. 2006 Nov;16(11):1431-8. doi: 10.1101/gr.4916306. Epub 2006 Oct 25.

Abstract

The recently sequenced genome of the honey bee (Apis mellifera) has produced 10,157 predicted protein sequences, calling for a computational effort to extract biological insights from them. We have applied an unsupervised hierarchical protein-clustering method, which was previously used in the ProtoNet system, to nearly 200,000 proteins consisting of the predicted honey bee proteins, the SWISS-PROT protein database, and the complete set of proteins of the mouse (Mus musculus) and the fruit fly (Drosophila melanogaster). The hierarchy produced by this method has been entitled ProtoBee. In ProtoBee, the proteins are hierarchically organized into 18,936 separate tree hierarchies, each representing a protein functional family. By using the mouse and Drosophila complete proteomes as reference, we are able to highlight functional groups of putative gene-loss events, putative novel proteins of unique functionality, and bee-specific paralogs. We have studied some of the ProtoBee findings and suggest their biological relevance. Examples include novel opsin genes and intriguing nuclear matches of mitochondrial genes. The organization of bee sequences into functional clusters suggests a natural way of automatically inferring functional annotation. Following this notion, we were able to assign functional annotation to about 70% of the sequences. ProtoBee is available at http://www.protobee.cs.huji.ac.il.

摘要

最近测序的蜜蜂(意大利蜜蜂)基因组产生了10157个预测的蛋白质序列,这需要通过计算从这些序列中提取生物学见解。我们应用了一种先前在ProtoNet系统中使用的无监督层次蛋白质聚类方法,该方法用于由预测的蜜蜂蛋白质、SWISS-PROT蛋白质数据库以及小鼠(小家鼠)和果蝇(黑腹果蝇)的全套蛋白质组成的近200000个蛋白质。通过这种方法产生的层次结构被命名为ProtoBee。在ProtoBee中,蛋白质被层次化地组织成18936个独立的树形层次结构,每个层次结构代表一个蛋白质功能家族。通过将小鼠和果蝇的完整蛋白质组作为参考,我们能够突出推定的基因缺失事件的功能组、具有独特功能的推定新蛋白质以及蜜蜂特有的旁系同源物。我们研究了一些ProtoBee的发现,并提出了它们的生物学相关性。例子包括新的视蛋白基因和线粒体基因有趣的核匹配。将蜜蜂序列组织成功能簇暗示了一种自动推断功能注释的自然方式。基于这一概念,我们能够为大约70%的序列分配功能注释。ProtoBee可在http://www.protobee.cs.huji.ac.il上获取。

相似文献

2

引用本文的文献

4
8

本文引用的文献

1
Functional annotation prediction: all for one and one for all.功能注释预测:人人为我,我为人人。
Protein Sci. 2006 Jun;15(6):1557-62. doi: 10.1110/ps.062185706. Epub 2006 May 2.
3
InterProScan: protein domains identifier.InterProScan:蛋白质结构域识别工具。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W116-20. doi: 10.1093/nar/gki442.
4
Ensembl 2005.Ensembl 2005。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D447-53. doi: 10.1093/nar/gki138.
6
InterPro, progress and status in 2005.InterPro 2005年的进展与现状
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D201-5. doi: 10.1093/nar/gki106.
7
The Universal Protein Resource (UniProt).通用蛋白质资源(UniProt)。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D154-9. doi: 10.1093/nar/gki070.
10
Signatures of selection among sex-determining alleles of the honey bee.蜜蜂性别决定等位基因间的选择特征
Proc Natl Acad Sci U S A. 2004 Apr 6;101(14):4888-93. doi: 10.1073/pnas.0307147101. Epub 2004 Mar 29.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验