Suppr超能文献

蛋白质信息资源:蛋白质功能注释的综合公共资源。

The Protein Information Resource: an integrated public resource of functional annotation of proteins.

作者信息

Wu Cathy H, Huang Hongzhan, Arminski Leslie, Castro-Alvear Jorge, Chen Yongxing, Hu Zhang-Zhi, Ledley Robert S, Lewis Kali C, Mewes Hans-Werner, Orcutt Bruce C, Suzek Baris E, Tsugita Akira, Vinayaka C R, Yeh Lai-Su L, Zhang Jian, Barker Winona C

机构信息

National Biomedical Research Foundation, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA.

出版信息

Nucleic Acids Res. 2002 Jan 1;30(1):35-7. doi: 10.1093/nar/30.1.35.

Abstract

The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation of protein data to support genomic/proteomic research and scientific discovery. The PIR, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the PIR-International Protein Sequence Database (PSD), the major annotated protein sequence database in the public domain, containing about 250 000 proteins. To improve protein annotation and the coverage of experimentally validated data, a bibliography submission system is developed for scientists to submit, categorize and retrieve literature information. Comprehensive protein information is available from iProClass, which includes family classification at the superfamily, domain and motif levels, structural and functional features of proteins, as well as cross-references to over 40 biological databases. To provide timely and comprehensive protein data with source attribution, we have introduced a non-redundant reference protein database, PIR-NREF. The database consists of about 800 000 proteins collected from PIR-PSD, SWISS-PROT, TrEMBL, GenPept, RefSeq and PDB, with composite protein names and literature data. To promote database interoperability, we provide XML data distribution and open database schema, and adopt common ontologies. The PIR web site (http://pir.georgetown.edu/) features data mining and sequence analysis tools for information retrieval and functional identification of proteins based on both sequence and annotation information. The PIR databases and other files are also available by FTP (ftp://nbrfa.georgetown.edu/pir_databases).

摘要

蛋白质信息资源(PIR)是蛋白质数据功能注释的综合公共资源,以支持基因组学/蛋白质组学研究和科学发现。PIR与慕尼黑蛋白质序列信息中心(MIPS)以及日本国际蛋白质信息数据库(JIPID)合作,创建了PIR国际蛋白质序列数据库(PSD),这是公共领域中主要的带注释蛋白质序列数据库,包含约250,000种蛋白质。为了改进蛋白质注释和实验验证数据的覆盖范围,开发了一个文献提交系统,供科学家提交、分类和检索文献信息。通过iProClass可获取全面的蛋白质信息,其中包括超家族、结构域和基序水平的家族分类、蛋白质的结构和功能特征,以及与40多个生物数据库的交叉引用。为了提供具有来源归属的及时、全面的蛋白质数据,我们引入了一个非冗余参考蛋白质数据库PIR-NREF。该数据库由从PIR-PSD、SWISS-PROT、TrEMBL、GenPept、RefSeq和PDB收集的约800,000种蛋白质组成,并带有复合蛋白质名称和文献数据。为了促进数据库的互操作性,我们提供XML数据分发和开放数据库模式,并采用通用本体。PIR网站(http://pir.georgetown.edu/)具有数据挖掘和序列分析工具,可基于序列和注释信息进行蛋白质信息检索和功能鉴定。PIR数据库和其他文件也可通过FTP(ftp://nbrfa.georgetown.edu/pir_databases)获取。

相似文献

2
The Protein Information Resource.蛋白质信息资源
Nucleic Acids Res. 2003 Jan 1;31(1):345-7. doi: 10.1093/nar/gkg040.
3
The protein information resource (PIR).蛋白质信息资源(PIR)。
Nucleic Acids Res. 2000 Jan 1;28(1):41-4. doi: 10.1093/nar/28.1.41.
6
Protein family classification and functional annotation.蛋白质家族分类与功能注释。
Comput Biol Chem. 2003 Feb;27(1):37-47. doi: 10.1016/s1476-9271(02)00098-1.
8
The PIR-International Protein Sequence Database.PIR国际蛋白质序列数据库。
Nucleic Acids Res. 1999 Jan 1;27(1):39-43. doi: 10.1093/nar/27.1.39.
9
iProLINK: an integrated protein resource for literature mining.iProLINK:用于文献挖掘的综合蛋白质资源。
Comput Biol Chem. 2004 Dec;28(5-6):409-16. doi: 10.1016/j.compbiolchem.2004.09.010.

引用本文的文献

1
Protein structure prediction via deep learning: an in-depth review.基于深度学习的蛋白质结构预测:深入综述
Front Pharmacol. 2025 Apr 3;16:1498662. doi: 10.3389/fphar.2025.1498662. eCollection 2025.
4
Assessing the accuracy of contact predictions in CASP13.评估 CASP13 中接触预测的准确性。
Proteins. 2019 Dec;87(12):1058-1068. doi: 10.1002/prot.25819. Epub 2019 Oct 24.
9
PFClust: a novel parameter free clustering algorithm.PFClust:一种新颖的无参数聚类算法。
BMC Bioinformatics. 2013 Jul 3;14:213. doi: 10.1186/1471-2105-14-213.

本文引用的文献

4
PIR: a new resource for bioinformatics.PIR:生物信息学的一种新资源。
Bioinformatics. 2000 Mar;16(3):290-1. doi: 10.1093/bioinformatics/16.3.290.
5
ProClass protein family database.ProClass蛋白质家族数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):273-6. doi: 10.1093/nar/28.1.273.
6
The Pfam protein families database.Pfam蛋白质家族数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):263-6. doi: 10.1093/nar/28.1.263.
7
The Protein Data Bank.蛋白质数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):235-42. doi: 10.1093/nar/28.1.235.
9
The PROSITE database, its status in 1999.PROSITE数据库及其1999年的状况。
Nucleic Acids Res. 1999 Jan 1;27(1):215-9. doi: 10.1093/nar/27.1.215.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验