EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
J Proteomics. 2010 Oct 10;73(11):2136-46. doi: 10.1016/j.jprot.2010.06.008. Epub 2010 Jul 6.
Despite the fact that data deposition is not a generalised fact yet in the field of proteomics, several mass spectrometry (MS) based proteomics repositories are publicly available for the scientific community. The main existing resources are: the Global Proteome Machine Database (GPMDB), PeptideAtlas, the PRoteomics IDEntifications database (PRIDE), Tranche, and NCBI Peptidome. In this review the capabilities of each of these will be described, paying special attention to four key properties: data types stored, applicable data submission strategies, supported formats, and available data mining and visualization tools. Additionally, the data contents from model organisms will be enumerated for each resource. There are other valuable smaller and/or more specialized repositories but they will not be covered in this review. Finally, the concept behind the ProteomeXchange consortium, a collaborative effort among the main resources in the field, will be introduced.
尽管在蛋白质组学领域,数据存储还不是一个普遍的事实,但已经有几个基于质谱(MS)的蛋白质组学存储库可供科学界使用。主要的现有资源有:全球蛋白质组机器数据库(GPMDB)、肽图集、蛋白质鉴定数据库(PRIDE)、Tranche 和 NCBI 肽组。在这篇综述中,将描述这些资源中的每一个的功能,特别关注四个关键属性:存储的数据类型、适用的数据提交策略、支持的格式以及可用的数据挖掘和可视化工具。此外,还将列举每个资源中来自模式生物的数据内容。还有其他有价值的更小和/或更专业的存储库,但不在本综述范围内。最后,将介绍 ProteomeXchange 联盟的概念,这是该领域主要资源之间的合作努力。