从蛋白质序列到 3D 结构，甚至更远：以 UniProt 知识库为例。

From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase.

机构信息

Swiss-Prot Group, Swiss Institute of Bioinformatics, 1 rue Michel Servet, 1211, Geneva, Switzerland.

出版信息

Cell Mol Life Sci. 2010 Apr;67(7):1049-64. doi: 10.1007/s00018-009-0229-6. Epub 2009 Dec 31.

DOI:10.1007/s00018-009-0229-6

PMID:20043185

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2835715/

Abstract

With the dramatic increase in the volume of experimental results in every domain of life sciences, assembling pertinent data and combining information from different fields has become a challenge. Information is dispersed over numerous specialized databases and is presented in many different formats. Rapid access to experiment-based information about well-characterized proteins helps predict the function of uncharacterized proteins identified by large-scale sequencing. In this context, universal knowledgebases play essential roles in providing access to data from complementary types of experiments and serving as hubs with cross-references to many specialized databases. This review outlines how the value of experimental data is optimized by combining high-quality protein sequences with complementary experimental results, including information derived from protein 3D-structures, using as an example the UniProt knowledgebase (UniProtKB) and the tools and links provided on its website ( http://www.uniprot.org/ ). It also evokes precautions that are necessary for successful predictions and extrapolations.

摘要

随着生命科学各个领域实验结果数量的急剧增加，收集相关数据并整合来自不同领域的信息已成为一项挑战。信息分散在众多专业数据库中，呈现出多种不同的格式。快速获取关于特征明确的蛋白质的基于实验的信息有助于预测通过大规模测序鉴定的特征不明确的蛋白质的功能。在这种情况下，通用知识库在提供对来自互补类型实验的数据的访问以及充当枢纽并与许多专业数据库交叉引用方面发挥着重要作用。本文通过将高质量的蛋白质序列与互补的实验结果（包括来自蛋白质 3D 结构的信息）相结合，概述了如何优化实验数据的价值，以 UniProt 知识库（UniProtKB）为例，并介绍了其网站上提供的工具和链接（http://www.uniprot.org/）。本文还提到了成功预测和推断所需的注意事项。

相似文献

From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase.

Cell Mol Life Sci. 2010 Apr;67(7):1049-64. doi: 10.1007/s00018-009-0229-6. Epub 2009 Dec 31.

UniProt Knowledgebase: a hub of integrated protein data.

Database (Oxford). 2011 Mar 29;2011:bar009. doi: 10.1093/database/bar009. Print 2011.

Annotating single amino acid polymorphisms in the UniProt/Swiss-Prot knowledgebase.

Hum Mutat. 2008 Mar;29(3):361-6. doi: 10.1002/humu.20671.

UniProtKB/Swiss-Prot.

Methods Mol Biol. 2007;406:89-112. doi: 10.1007/978-1-59745-535-0_4.

The Universal Protein Resource (UniProt): an expanding universe of protein information.

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D187-91. doi: 10.1093/nar/gkj161.

UniProt: the Universal Protein knowledgebase.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D115-9. doi: 10.1093/nar/gkh131.

Reorganizing the protein space at the Universal Protein Resource (UniProt).

Nucleic Acids Res. 2012 Jan;40(Database issue):D71-5. doi: 10.1093/nar/gkr981. Epub 2011 Nov 18.

The Universal Protein Resource (UniProt) 2009.

Nucleic Acids Res. 2009 Jan;37(Database issue):D169-74. doi: 10.1093/nar/gkn664. Epub 2008 Oct 4.

The universal protein resource (UniProt).

Nucleic Acids Res. 2008 Jan;36(Database issue):D190-5. doi: 10.1093/nar/gkm895. Epub 2007 Nov 27.

UniProt: the Universal Protein Knowledgebase in 2023.

Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.

引用本文的文献

In-silico structural and functional analysis of nonsynonymous single nucleotide polymorphisms in human gene.

In Silico Pharmacol. 2025 Feb 25;13(1):32. doi: 10.1007/s40203-025-00319-3. eCollection 2025.

Naringenin inhibits ferroptosis to reduce radiation-induced lung injury: insights from network Pharmacology and molecular docking.

Pharm Biol. 2025 Dec;63(1):1-10. doi: 10.1080/13880209.2025.2465312. Epub 2025 Feb 19.

Advances in Computational Intelligence-Based Methods of Structure and Function Prediction of Proteins.

Biomolecules. 2024 Aug 29;14(9):1083. doi: 10.3390/biom14091083.

Bioprospecting Microbial Diversity for Lignin Valorization: Dry and Wet Screening Methods.

Front Microbiol. 2020 Jun 9;11:1081. doi: 10.3389/fmicb.2020.01081. eCollection 2020.

Transcriptomic and Proteomic Analysis of the Tentacles and Mucus of Verrill, 1869.

Mar Drugs. 2019 Jul 25;17(8):436. doi: 10.3390/md17080436.

The Resistome of Low-Impacted Marine Environments Is Composed by Distant Metallo-β-Lactamases Homologs.

Front Microbiol. 2018 Apr 5;9:677. doi: 10.3389/fmicb.2018.00677. eCollection 2018.

NIPS, a 3D network-integrated predictor of deleterious protein SAPs, and its application in cancer prognosis.

Sci Rep. 2018 Apr 16;8(1):6021. doi: 10.1038/s41598-018-24286-2.

Convergent Balancing Selection on the Mu-Opioid Receptor in Primates.

Mol Biol Evol. 2017 Jul 1;34(7):1629-1643. doi: 10.1093/molbev/msx105.

GlycoMine: a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features.

Sci Rep. 2016 Oct 6;6:34595. doi: 10.1038/srep34595.

Evolutionary Conserved Positions Define Protein Conformational Diversity.

PLoS Comput Biol. 2016 Mar 23;12(3):e1004775. doi: 10.1371/journal.pcbi.1004775. eCollection 2016 Mar.

本文引用的文献

The CATH hierarchy revisited-structural divergence in domain superfamilies and the continuity of fold space.

Structure. 2009 Aug 12;17(8):1051-62. doi: 10.1016/j.str.2009.06.015.

PSI-2: structural genomics to cover protein domain family space.

Structure. 2009 Jun 10;17(6):869-81. doi: 10.1016/j.str.2009.03.015.

The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes.

Genome Res. 2009 Jul;19(7):1316-23. doi: 10.1101/gr.080531.108. Epub 2009 Jun 4.

Atomic structures of IAPP (amylin) fusions suggest a mechanism for fibrillation and the role of insulin in the process.

Protein Sci. 2009 Jul;18(7):1521-30. doi: 10.1002/pro.145.

X-ray structure breakthroughs in the GPCR transmembrane region.

Biochem Pharmacol. 2009 Jul 1;78(1):11-20. doi: 10.1016/j.bcp.2009.02.012. Epub 2009 Feb 27.

Infrastructure for the life sciences: design and implementation of the UniProt website.

BMC Bioinformatics. 2009 May 8;10:136. doi: 10.1186/1471-2105-10-136.

Mass-spectrometric identification and relative quantification of N-linked cell surface glycoproteins.

Nat Biotechnol. 2009 Apr;27(4):378-86. doi: 10.1038/nbt.1532. Epub 2009 Apr 6.

Genomic and structural aspects of protein evolution.

Biochem J. 2009 Apr 1;419(1):15-28. doi: 10.1042/BJ20090122.

Protein function prediction--the power of multiplicity.

Trends Biotechnol. 2009 Apr;27(4):210-9. doi: 10.1016/j.tibtech.2009.01.002. Epub 2009 Feb 27.

Sequence-based feature prediction and annotation of proteins.

Genome Biol. 2009 Feb 2;10(2):206. doi: 10.1186/gb-2009-10-2-206.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从蛋白质序列到 3D 结构，甚至更远：以 UniProt 知识库为例。

From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase.

机构信息

Swiss-Prot Group, Swiss Institute of Bioinformatics, 1 rue Michel Servet, 1211, Geneva, Switzerland.

出版信息

Cell Mol Life Sci. 2010 Apr;67(7):1049-64. doi: 10.1007/s00018-009-0229-6. Epub 2009 Dec 31.

DOI:10.1007/s00018-009-0229-6

PMID:20043185

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2835715/

Abstract

摘要

从蛋白质序列到 3D 结构，甚至更远：以 UniProt 知识库为例。

From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

从蛋白质序列到 3D 结构，甚至更远：以 UniProt 知识库为例。

From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献