通用蛋白质资源（UniProt）：不断扩展的蛋白质信息宇宙。

The Universal Protein Resource (UniProt): an expanding universe of protein information.

作者信息

Wu Cathy H, Apweiler Rolf, Bairoch Amos, Natale Darren A, Barker Winona C, Boeckmann Brigitte, Ferro Serenella, Gasteiger Elisabeth, Huang Hongzhan, Lopez Rodrigo, Magrane Michele, Martin Maria J, Mazumder Raja, O'Donovan Claire, Redaschi Nicole, Suzek Baris

机构信息

Department of Biochemistry and Molecular Biology, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20057-1414, USA.

出版信息

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D187-91. doi: 10.1093/nar/gkj161.

DOI:10.1093/nar/gkj161

PMID:16381842

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1347523/

Abstract

The Universal Protein Resource (UniProt) provides a central resource on protein sequences and functional annotation with three database components, each addressing a key need in protein bioinformatics. The UniProt Knowledgebase (UniProtKB), comprising the manually annotated UniProtKB/Swiss-Prot section and the automatically annotated UniProtKB/TrEMBL section, is the preeminent storehouse of protein annotation. The extensive cross-references, functional and feature annotations and literature-based evidence attribution enable scientists to analyse proteins and query across databases. The UniProt Reference Clusters (UniRef) speed similarity searches via sequence space compression by merging sequences that are 100% (UniRef100), 90% (UniRef90) or 50% (UniRef50) identical. Finally, the UniProt Archive (UniParc) stores all publicly available protein sequences, containing the history of sequence data with links to the source databases. UniProt databases continue to grow in size and in availability of information. Recent and upcoming changes to database contents, formats, controlled vocabularies and services are described. New download availability includes all major releases of UniProtKB, sequence collections by taxonomic division and complete proteomes. A bibliography mapping service has been added, and an ID mapping service will be available soon. UniProt databases can be accessed online at http://www.uniprot.org or downloaded at ftp://ftp.uniprot.org/pub/databases/.

摘要

通用蛋白质资源（UniProt）提供了一个关于蛋白质序列和功能注释的核心资源，它有三个数据库组件，每个组件都满足蛋白质生物信息学中的一个关键需求。UniProt知识库（UniProtKB）由人工注释的UniProtKB/Swiss-Prot部分和自动注释的UniProtKB/TrEMBL部分组成，是蛋白质注释的卓越仓库。广泛的交叉引用、功能和特征注释以及基于文献的证据归属使科学家能够分析蛋白质并跨数据库进行查询。UniProt参考簇（UniRef）通过合并100%相同（UniRef100）、90%相同（UniRef90）或50%相同（UniRef50）的序列，通过序列空间压缩加快相似性搜索。最后，UniProt存档（UniParc）存储所有公开可用的蛋白质序列，包含序列数据的历史记录以及到源数据库的链接。UniProt数据库在规模和信息可用性方面持续增长。描述了数据库内容（格式、控制词汇和服务）的近期和即将发生的变化。新的下载可用性包括UniProtKB的所有主要版本、按分类划分的序列集合和完整的蛋白质组。增加了一个文献映射服务，并且一个ID映射服务将很快可用。可以通过http://www.uniprot.org在线访问UniProt数据库，或从ftp://ftp.uniprot.org/pub/databases/下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e609/1347523/c7d7679dd728/gkj161f1.jpg

相似文献

The Universal Protein Resource (UniProt): an expanding universe of protein information.

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D187-91. doi: 10.1093/nar/gkj161.

UniProtKB/Swiss-Prot.

Methods Mol Biol. 2007;406:89-112. doi: 10.1007/978-1-59745-535-0_4.

UniProtKB/Swiss-Prot, the Manually Annotated Section of the UniProt KnowledgeBase: How to Use the Entry View.

Methods Mol Biol. 2016;1374:23-54. doi: 10.1007/978-1-4939-3167-5_2.

The Universal Protein Resource (UniProt).

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D154-9. doi: 10.1093/nar/gki070.

UniProt: the Universal Protein knowledgebase.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D115-9. doi: 10.1093/nar/gkh131.

UniRef: comprehensive and non-redundant UniProt reference clusters.

Bioinformatics. 2007 May 15;23(10):1282-8. doi: 10.1093/bioinformatics/btm098. Epub 2007 Mar 22.

Plant protein annotation in the UniProt Knowledgebase.

Plant Physiol. 2005 May;138(1):59-66. doi: 10.1104/pp.104.058933.

The Universal Protein Resource (UniProt) 2009.

Nucleic Acids Res. 2009 Jan;37(Database issue):D169-74. doi: 10.1093/nar/gkn664. Epub 2008 Oct 4.

The Universal Protein Resource (UniProt).

Nucleic Acids Res. 2007 Jan;35(Database issue):D193-7. doi: 10.1093/nar/gkl929. Epub 2006 Nov 16.

The universal protein resource (UniProt).

Nucleic Acids Res. 2008 Jan;36(Database issue):D190-5. doi: 10.1093/nar/gkm895. Epub 2007 Nov 27.

引用本文的文献

Mechanistic Study of NT5E in Reg3β-Induced Macrophage Polarization and Cooperation with Plasma Proteins in Myocarditis Injury and Repair.

Biology (Basel). 2025 Aug 7;14(8):1017. doi: 10.3390/biology14081017.

mRNA vaccine design using the proteome of through immunoinformatics approaches.

mSphere. 2025 May 27;10(5):e0080924. doi: 10.1128/msphere.00809-24. Epub 2025 May 1.

Genomic evidence for low genetic diversity but purging of strong deleterious variants in snow leopards.

Genome Biol. 2025 Apr 14;26(1):94. doi: 10.1186/s13059-025-03555-0.

Hindguts of harbor phylogenetically and genomically distinct capable of degrading algal polysaccharides and diazotrophy.

mSystems. 2025 Jan 21;10(1):e0100724. doi: 10.1128/msystems.01007-24. Epub 2024 Dec 23.

BioMedGraphica: An All-in-One Platform for Biomedical Prior Knowledge and Omic Signaling Graph Generation.

bioRxiv. 2024 Dec 9:2024.12.05.627020. doi: 10.1101/2024.12.05.627020.

Bioassay and Pharmacokinetic Characteristics of Xanthium strumarium Plant Extract as Possible Acaricidal Agent.

Curr Pharm Des. 2025;31(12):992-1005. doi: 10.2174/0113816128317849241108064144.

Comparative analysis of adhesion virulence protein FadA from gut-associated bacteria of colorectal cancer patients () and healthy individuals ().

J Cancer. 2024 Aug 19;15(17):5492-5505. doi: 10.7150/jca.98951. eCollection 2024.

PPI-hotspot for detecting protein-protein interaction hot spots from the free protein structure.

Elife. 2024 Sep 16;13:RP96643. doi: 10.7554/eLife.96643.

Preventive and Therapeutic Effects of HD02 and MD159 through Mast Cell Degranulation Inhibition in Mouse Models of Atopic Dermatitis.

Nutrients. 2024 Sep 6;16(17):3021. doi: 10.3390/nu16173021.

and are equipped to degrade a cascade of polysaccharides along the hindgut of the herbivorous fish .

ISME Commun. 2024 Aug 1;4(1):ycae102. doi: 10.1093/ismeco/ycae102. eCollection 2024 Jan.

本文引用的文献

The Mouse Genome Database (MGD): from genes to mice--a community resource for mouse biology.

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D471-5. doi: 10.1093/nar/gki113.

Reactome: a knowledgebase of biological pathways.

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D428-32. doi: 10.1093/nar/gki072.

Database resources of the National Center for Biotechnology Information.

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D39-45. doi: 10.1093/nar/gki062.

Fungal BLAST and Model Organism BLASTP Best Hits: new comparison resources at the Saccharomyces Genome Database (SGD).

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D374-7. doi: 10.1093/nar/gki023.

The EMBL Nucleotide Sequence Database.

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D29-33. doi: 10.1093/nar/gki098.

The RCSB Protein Data Bank: a redesigned query system and relational database based on the mmCIF schema.

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D233-7. doi: 10.1093/nar/gki057.

InterPro, progress and status in 2005.

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D201-5. doi: 10.1093/nar/gki106.

The iProClass integrated database for protein functional analysis.

Comput Biol Chem. 2004 Feb;28(1):87-96. doi: 10.1016/j.compbiolchem.2003.10.003.

The Gene Ontology (GO) database and informatics resource.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D258-61. doi: 10.1093/nar/gkh036.

MEROPS: the peptidase database.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D160-4. doi: 10.1093/nar/gkh071.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通用蛋白质资源（UniProt）：不断扩展的蛋白质信息宇宙。

The Universal Protein Resource (UniProt): an expanding universe of protein information.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献