• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组时代的蛋白质序列注释:SWISS-PROT+TREMBL注释概念

Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.

作者信息

Apweiler R, Gateau A, Contrino S, Martin M J, Junker V, O'Donovan C, Lang F, Mitaritonna N, Kappus S, Bairoch A

机构信息

EMBL Outstation-The European Bioinformatics Institute, Cambridge, UK.

出版信息

Proc Int Conf Intell Syst Mol Biol. 1997;5:33-43.

PMID:9322012
Abstract

SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.

摘要

SWISS-PROT是一个经过精心整理的蛋白质序列数据库,致力于提供高水平的注释、最低限度的冗余以及与其他数据库的高度整合。正在进行的基因组测序项目极大地增加了要纳入SWISS-PROT的蛋白质序列数量。由于我们不想通过纳入未经适当序列分析和注释的序列来稀释SWISS-PROT的质量标准,所以我们不能无限制地加快新输入数据的纳入速度。然而,由于我们也希望尽快提供这些序列,我们引入了TREMBL(EMBL核苷酸序列数据库的翻译),作为SWISS-PROT的补充。TREMBL由以SWISS-PROT格式计算机注释的条目组成,这些条目源自EMBL核苷酸序列数据库中所有编码序列(CDS)的翻译,但不包括已包含在SWISS-PROT中的CDS。虽然TREMBL已经具有巨大价值,但其计算机生成的注释与SWISS-PROT的质量不匹配。主要区别在于附加到序列上的蛋白质功能信息。考虑到这一点,我们正在投入大量精力开发和应用计算机方法,以增强附加到TREMBL条目的功能信息。

相似文献

1
Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.基因组时代的蛋白质序列注释:SWISS-PROT+TREMBL注释概念
Proc Int Conf Intell Syst Mol Biol. 1997;5:33-43.
2
The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.2000年的SWISS-PROT蛋白质序列数据库及其补充数据库TrEMBL。
Nucleic Acids Res. 2000 Jan 1;28(1):45-8. doi: 10.1093/nar/28.1.45.
3
The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999.1999年的SWISS-PROT蛋白质序列数据库及其补充数据库TrEMBL。
Nucleic Acids Res. 1999 Jan 1;27(1):49-54. doi: 10.1093/nar/27.1.49.
4
The SWISS-PROT protein sequence data bank and its new supplement TREMBL.SWISS-PROT蛋白质序列数据库及其新的补充数据库TREMBL。
Nucleic Acids Res. 1996 Jan 1;24(1):21-5. doi: 10.1093/nar/24.1.21.
5
The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998.1998年的SWISS-PROT蛋白质序列数据库及其补充数据库TrEMBL。
Nucleic Acids Res. 1998 Jan 1;26(1):38-42. doi: 10.1093/nar/26.1.38.
6
The SWISS-PROT protein sequence data bank and its supplement TrEMBL.SWISS-PROT蛋白质序列数据库及其补充数据库TrEMBL。
Nucleic Acids Res. 1997 Jan 1;25(1):31-6. doi: 10.1093/nar/25.1.31.
7
The role SWISS-PROT and TrEMBL play in the genome research environment.SWISS-PROT和TrEMBL在基因组研究环境中所起的作用。
J Biotechnol. 2000 Mar 31;78(3):221-34. doi: 10.1016/s0168-1656(00)00198-x.
8
High-quality protein knowledge resource: SWISS-PROT and TrEMBL.高质量蛋白质知识资源:SWISS-PROT和TrEMBL。
Brief Bioinform. 2002 Sep;3(3):275-84. doi: 10.1093/bib/3.3.275.
9
Removing redundancy in SWISS-PROT and TrEMBL.去除SWISS-PROT和TrEMBL中的冗余信息。
Bioinformatics. 1999 Mar;15(3):258-9. doi: 10.1093/bioinformatics/15.3.258.
10
Database verification studies of SWISS-PROT and GenBank.SWISS-PROT和GenBank的数据库验证研究。
Bioinformatics. 2001 Jun;17(6):526-32; discussion 533-4. doi: 10.1093/bioinformatics/17.6.526.

引用本文的文献

1
Isolation of a novel multiple-heavy metal resistant Lampropedia aestuarii GYF-1 and investigation of its bioremediation potential.一株新型耐多种重金属 Lampropedia aestuarii GYF-1 的分离及其生物修复潜力研究。
BMC Microbiol. 2023 Nov 7;23(1):330. doi: 10.1186/s12866-023-03093-4.
2
Prediction of carbohydrate-binding proteins from sequences using support vector machines.使用支持向量机从序列预测碳水化合物结合蛋白。
Adv Bioinformatics. 2010;2010. doi: 10.1155/2010/289301. Epub 2010 Sep 27.
3
SCANMOT: searching for similar sequences using a simultaneous scan of multiple sequence motifs.
SCANMOT:通过同时扫描多个序列基序来搜索相似序列。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W274-6. doi: 10.1093/nar/gki493.
4
Prediction of functional residues in water channels and related proteins.水通道及相关蛋白中功能残基的预测
Protein Sci. 1998 Jun;7(6):1458-68. doi: 10.1002/pro.5560070623.