Suppr超能文献

SWISS-PROT和TrEMBL在基因组研究环境中所起的作用。

The role SWISS-PROT and TrEMBL play in the genome research environment.

作者信息

Junker V, Contrino S, Fleischmann W, Hermjakob H, Lang F, Magrane M, Martin M J, Mitaritonna N, O'Donovan C, Apweiler R

机构信息

EMBL Outstation, The European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, UK.

出版信息

J Biotechnol. 2000 Mar 31;78(3):221-34. doi: 10.1016/s0168-1656(00)00198-x.

Abstract

SWISS-PROT, a curated protein sequence data bank, contains not only sequence data but also annotation relevant to a particular sequence. The annotation added to each entry is done by a team of biologists and comes, primarily, from articles in journals reporting the actual sequencing and sometimes characterisation. Review articles and collaboration with external experts also play a role along with the use of secondary databases like PROSITE and Pfam in addition to a variety of feature prediction methods. Annotation added by these methods is checked for relevance and likelihood to a particular sequence. The onset of genome sequencing has led to a dramatic increase in sequence data to be included in SWISS-PROT. This has led to the production of TrEMBL (Translation of the EMBL database). TrEMBL consists of entries in a SWISS-PROT format that are derived from the translation of all coding sequences in the EMBL nucleotide sequence database, that are not in SWISS-PROT. Unlike SWISS-PROT entries those in TrEMBL are awaiting manual annotation. However, rather than just representing basic sequence and source information, steps have been taken to add features and annotation automatically. In taking these steps it is hoped that TrEMBL entries are enhanced with some indication as to what a protein is, could or may be.

摘要

SWISS-PROT是一个经过整理的蛋白质序列数据库,不仅包含序列数据,还包含与特定序列相关的注释。添加到每个条目的注释由一组生物学家完成,主要来自报道实际测序以及有时还包括特征描述的期刊文章。综述文章、与外部专家的合作以及除了各种特征预测方法之外还使用像PROSITE和Pfam这样的二级数据库也发挥了作用。通过这些方法添加的注释会针对与特定序列的相关性和可能性进行检查。基因组测序的出现导致要纳入SWISS-PROT的序列数据急剧增加。这催生了TrEMBL(EMBL数据库的翻译)。TrEMBL由以SWISS-PROT格式呈现的条目组成,这些条目源自EMBL核苷酸序列数据库中所有不在SWISS-PROT里的编码序列的翻译。与SWISS-PROT条目不同,TrEMBL中的条目正在等待人工注释。然而,为了不仅仅呈现基本的序列和来源信息,已经采取措施自动添加特征和注释。采取这些措施希望能增强TrEMBL条目的信息,以某种方式表明一种蛋白质是什么、可能是什么或也许是什么。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验