通用蛋白质数据库（UniProt）：蛋白质信息中心。

UniProt: a hub for protein information.

出版信息

Nucleic Acids Res. 2015 Jan;43(Database issue):D204-12. doi: 10.1093/nar/gku989. Epub 2014 Oct 27.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4384041/

Abstract

UniProt is an important collection of protein sequences and their annotations, which has doubled in size to 80 million sequences during the past year. This growth in sequences has prompted an extension of UniProt accession number space from 6 to 10 characters. An increasing fraction of new sequences are identical to a sequence that already exists in the database with the majority of sequences coming from genome sequencing projects. We have created a new proteome identifier that uniquely identifies a particular assembly of a species and strain or subspecies to help users track the provenance of sequences. We present a new website that has been designed using a user-experience design process. We have introduced an annotation score for all entries in UniProt to represent the relative amount of knowledge known about each protein. These scores will be helpful in identifying which proteins are the best characterized and most informative for comparative analysis. All UniProt data is provided freely and is available on the web at http://www.uniprot.org/.

摘要

通用蛋白质数据库（UniProt）是蛋白质序列及其注释的重要集合，在过去一年中其规模已翻倍至8000万个序列。序列数量的增长促使通用蛋白质数据库登录号空间从6个字符扩展到10个字符。新序列中与数据库中已存在序列相同的比例越来越高，其中大多数序列来自基因组测序项目。我们创建了一个新的蛋白质组标识符，用于唯一标识一个物种、菌株或亚种的特定组装体，以帮助用户追踪序列的来源。我们展示了一个采用用户体验设计流程设计的新网站。我们为通用蛋白质数据库中的所有条目引入了注释分数，以表示关于每种蛋白质已知知识的相对数量。这些分数将有助于确定哪些蛋白质特征最明确且对比较分析最具信息价值。所有通用蛋白质数据库数据均免费提供，可在网站http://www.uniprot.org/上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5149/4384041/b49398c04a10/gku989fig1.jpg

相似文献

UniProt: a hub for protein information.通用蛋白质数据库（UniProt）：蛋白质信息中心。

Nucleic Acids Res. 2015 Jan;43(Database issue):D204-12. doi: 10.1093/nar/gku989. Epub 2014 Oct 27.

UniProt: a worldwide hub of protein knowledge.UniProt：蛋白质知识的全球枢纽。

Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515. doi: 10.1093/nar/gky1049.

UniProt: the universal protein knowledgebase.通用蛋白质知识库：UniProt

Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.

Reorganizing the protein space at the Universal Protein Resource (UniProt).重新组织通用蛋白质资源库（UniProt）中的蛋白质空间。

Nucleic Acids Res. 2012 Jan;40(Database issue):D71-5. doi: 10.1093/nar/gkr981. Epub 2011 Nov 18.

UniProt Knowledgebase: a hub of integrated protein data.UniProt 知识库：一个集成蛋白质数据的中心。

Database (Oxford). 2011 Mar 29;2011:bar009. doi: 10.1093/database/bar009. Print 2011.

UniProt: the Universal Protein Knowledgebase in 2023.UniProt：2023 年的通用蛋白质知识库。

Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.

UniProt: the universal protein knowledgebase in 2021.UniProt：2021 年的通用蛋白质知识库。

Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489. doi: 10.1093/nar/gkaa1100.

The Universal Protein Resource (UniProt): an expanding universe of protein information.通用蛋白质资源（UniProt）：不断扩展的蛋白质信息宇宙。

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D187-91. doi: 10.1093/nar/gkj161.

Infrastructure for the life sciences: design and implementation of the UniProt website.生命科学基础设施：UniProt网站的设计与实现

BMC Bioinformatics. 2009 May 8;10:136. doi: 10.1186/1471-2105-10-136.

UniProt: the Universal Protein Knowledgebase in 2025.通用蛋白质知识库（UniProt）：2025年的情况

Nucleic Acids Res. 2025 Jan 6;53(D1):D609-D617. doi: 10.1093/nar/gkae1010.

引用本文的文献

ResLysEmbed: a ResNet-based framework for succinylated lysine residue prediction using sequence and language model embeddings.ResLysEmbed：一种基于ResNet的框架，用于使用序列和语言模型嵌入预测琥珀酰化赖氨酸残基。

Bioinform Adv. 2025 Aug 22;5(1):vbaf198. doi: 10.1093/bioadv/vbaf198. eCollection 2025.

B-vac a robust software package for bacterial vaccine design.B-vac是一个用于细菌疫苗设计的强大软件包。

Sci Rep. 2025 Aug 28;15(1):31745. doi: 10.1038/s41598-025-01201-0.

Functional Genomic Characteristics of Marine Sponge-Associated MI-G.海洋海绵相关微生物群的功能基因组特征

Microorganisms. 2025 Aug 20;13(8):1940. doi: 10.3390/microorganisms13081940.

Comparative genome analysis of patulin-producing OM1 isolated from pears.从梨中分离出的产棒曲霉素的OM1的比较基因组分析。

PeerJ. 2025 Aug 22;13:e19848. doi: 10.7717/peerj.19848. eCollection 2025.

Nascent liver proteome reveals enzymes and transcription regulators under physiological and alcohol exposure conditions.新生肝脏蛋白质组揭示了生理条件和酒精暴露条件下的酶和转录调节因子。

Nat Commun. 2025 Aug 26;16(1):7945. doi: 10.1038/s41467-025-63212-9.

Common molecular links and therapeutic insights between type 2 diabetes and kidney cancer.2型糖尿病与肾癌之间的常见分子联系及治疗见解

PLoS One. 2025 Aug 20;20(8):e0330619. doi: 10.1371/journal.pone.0330619. eCollection 2025.

Global coral genomic vulnerability explains recent reef losses.全球珊瑚基因组脆弱性解释了近期珊瑚礁的损失。

bioRxiv. 2025 Aug 11:2024.03.25.586253. doi: 10.1101/2024.03.25.586253.

Finding the dark matter: Large language model-based enzyme kinetic data extractor and its validation.寻找暗物质：基于大语言模型的酶动力学数据提取器及其验证

Protein Sci. 2025 Sep;34(9):e70251. doi: 10.1002/pro.70251.

MKFGO: integrating multi-source knowledge fusion with pretrained language model for high-accuracy protein function prediction.MKFGO：将多源知识融合与预训练语言模型相结合用于高精度蛋白质功能预测

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf420.

RPIPLM: Prediction of ncRNA-protein interaction by post-training a dual-tower pretrained biological model with supervised contrastive learning.RPIPLM：通过使用监督对比学习对双塔预训练生物模型进行训练后预测非编码RNA与蛋白质的相互作用

PLoS One. 2025 Aug 14;20(8):e0329174. doi: 10.1371/journal.pone.0329174. eCollection 2025.

本文引用的文献

HAMAP in 2015: updates to the protein family classification and annotation system.2015年的HAMAP：蛋白质家族分类与注释系统的更新

Nucleic Acids Res. 2015 Jan;43(Database issue):D1064-70. doi: 10.1093/nar/gku1002. Epub 2014 Oct 27.

Profiling the orphan enzymes.鉴定孤儿酶。

Biol Direct. 2014 Jun 6;9:10. doi: 10.1186/1745-6150-9-10.

A code for RanGDP binding in ankyrin repeats defines a nuclear import pathway.RanGDP 与锚蛋白重复序列的结合密码子定义了核输入途径。

Cell. 2014 May 22;157(5):1130-45. doi: 10.1016/j.cell.2014.05.006.

Finding sequences for over 270 orphan enzymes.找到超过270种孤儿酶的序列。

PLoS One. 2014 May 14;9(5):e97250. doi: 10.1371/journal.pone.0097250. eCollection 2014.

Enzymatic and structural characterization of rTSγ provides insights into the function of rTSβ.rTSγ 的酶学和结构特征为 rTSβ 的功能提供了深入了解。

Biochemistry. 2014 Apr 29;53(16):2732-8. doi: 10.1021/bi500349e. Epub 2014 Apr 15.

Expert curation in UniProtKB: a case study on dealing with conflicting and erroneous data.UniProtKB中的专家管理：处理冲突和错误数据的案例研究

Database (Oxford). 2014 Mar 12;2014:bau016. doi: 10.1093/database/bau016. Print 2014.

The International Nucleotide Sequence Database Collaboration.国际核苷酸序列数据库协作组织。

Nucleic Acids Res. 2013 Jan;41(Database issue):D21-4. doi: 10.1093/nar/gks1084. Epub 2012 Nov 24.

The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013.《ChEBI 参考数据库和生物学相关化学本体：2013 年的增强》

Nucleic Acids Res. 2013 Jan;41(Database issue):D456-63. doi: 10.1093/nar/gks1146. Epub 2012 Nov 24.

Rhea--a manually curated resource of biochemical reactions.雷亚--一个人工 curated 的生化反应资源。

Nucleic Acids Res. 2012 Jan;40(Database issue):D754-60. doi: 10.1093/nar/gkr1126. Epub 2011 Dec 1.

The UniProt-GO Annotation database in 2011.2011 年的 UniProt-GO Annotation 数据库。

Nucleic Acids Res. 2012 Jan;40(Database issue):D565-70. doi: 10.1093/nar/gkr1048. Epub 2011 Nov 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通用蛋白质数据库（UniProt）：蛋白质信息中心。

UniProt: a hub for protein information.

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献