Martens Lennart, Hermjakob Henning, Jones Philip, Adamski Marcin, Taylor Chris, States David, Gevaert Kris, Vandekerckhove Joël, Apweiler Rolf
Department of Biochemistry, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium.
Proteomics. 2005 Aug;5(13):3537-45. doi: 10.1002/pmic.200401303.
The advent of high-throughput proteomics has enabled the identification of ever increasing numbers of proteins. Correspondingly, the number of publications centered on these protein identifications has increased dramatically. With the first results of the HUPO Plasma Proteome Project being analyzed and many other large-scale proteomics projects about to disseminate their data, this trend is not likely to flatten out any time soon. However, the publication mechanism of these identified proteins has lagged behind in technical terms. Often very long lists of identifications are either published directly with the article, resulting in both a voluminous and rather tedious read, or are included on the publisher's website as supplementary information. In either case, these lists are typically only provided as portable document format documents with a custom-made layout, making it practically impossible for computer programs to interpret them, let alone efficiently query them. Here we propose the proteomics identifications (PRIDE) database (http://www.ebi.ac.uk/pride) as a means to finally turn publicly available data into publicly accessible data. PRIDE offers a web-based query interface, a user-friendly data upload facility, and a documented application programming interface for direct computational access. The complete PRIDE database, source code, data, and support tools are freely available for web access or download and local installation.
高通量蛋白质组学的出现使得能够鉴定出越来越多的蛋白质。相应地,以这些蛋白质鉴定为中心的出版物数量急剧增加。随着人类蛋白质组组织血浆蛋白质组计划的首批结果得到分析,以及许多其他大规模蛋白质组学项目即将发布其数据,这种趋势短期内不太可能趋于平缓。然而,这些已鉴定蛋白质的发布机制在技术方面滞后了。通常,非常长的鉴定列表要么直接与文章一起发表,导致阅读量庞大且相当乏味,要么作为补充信息包含在出版商的网站上。在这两种情况下,这些列表通常仅以具有定制布局的便携式文档格式文档提供,这使得计算机程序几乎无法解释它们,更不用说有效地查询它们了。在此,我们提出蛋白质组学鉴定(PRIDE)数据库(http://www.ebi.ac.uk/pride),作为最终将公开可用数据转化为可公开访问数据的一种手段。PRIDE提供了基于网络的查询界面、用户友好的数据上传工具以及用于直接计算访问的文档化应用程序编程接口。完整的PRIDE数据库、源代码、数据和支持工具均可免费通过网络访问、下载或进行本地安装。