Schneider Michel, Lane Lydie, Boutet Emmanuel, Lieberherr Damien, Tognolli Michael, Bougueleret Lydie, Bairoch Amos
Swiss Institute of Bioinformatics, Centre Médical Universitaire, Genève 4, Switzerland.
J Proteomics. 2009 Apr 13;72(3):567-73. doi: 10.1016/j.jprot.2008.11.010. Epub 2008 Nov 24.
The UniProt knowledgebase, UniProtKB, is the main product of the UniProt consortium. It consists of two sections, UniProtKB/Swiss-Prot, the manually curated section, and UniProtKB/TrEMBL, the computer translation of the EMBL/GenBank/DDBJ nucleotide sequence database. Taken together, these two sections cover all the proteins characterized or inferred from all publicly available nucleotide sequences. The Plant Proteome Annotation Program (PPAP) of UniProtKB/Swiss-Prot focuses on the manual annotation of plant-specific proteins and protein families. Our major effort is currently directed towards the two model plants Arabidopsis thaliana and Oryza sativa. In UniProtKB/Swiss-Prot, redundancy is minimized by merging all data from different sources in a single entry. The proposed protein sequence is frequently modified after comparison with ESTs, full length transcripts or homologous proteins from other species. The information present in manually curated entries allows the reconstruction of all described isoforms. The annotation also includes proteomics data such as PTM and protein identification MS experimental results. UniProtKB and the other products of the UniProt consortium are accessible online at www.uniprot.org.
通用蛋白质数据库(UniProt知识库,UniProtKB)是通用蛋白质数据库协会的主要产品。它由两个部分组成,即UniProtKB/Swiss-Prot(人工注释部分)和UniProtKB/TrEMBL(EMBL/GenBank/DDBJ核苷酸序列数据库的计算机翻译结果)。这两个部分合起来涵盖了所有从公开可用核苷酸序列中鉴定或推断出的蛋白质。UniProtKB/Swiss-Prot的植物蛋白质组注释计划(PPAP)专注于对植物特异性蛋白质和蛋白质家族的人工注释。我们目前的主要工作方向是两种模式植物——拟南芥和水稻。在UniProtKB/Swiss-Prot中,通过将来自不同来源的所有数据合并到一个条目中,使冗余度降至最低。与EST、全长转录本或其他物种的同源蛋白质进行比较后,所提交的蛋白质序列常常会被修改。人工注释条目中的信息可用于重建所有已描述的异构体。注释还包括蛋白质组学数据,如PTM和蛋白质鉴定质谱实验结果。可通过www.uniprot.org在线访问UniProtKB及通用蛋白质数据库协会的其他产品。