Institute for Systems Biology, Seattle, WA 98109, USA.
Mol Cell Proteomics. 2011 Sep;10(9):M110.006353. doi: 10.1074/mcp.M110.006353. Epub 2011 Jun 1.
Human blood plasma can be obtained relatively noninvasively and contains proteins from most, if not all, tissues of the body. Therefore, an extensive, quantitative catalog of plasma proteins is an important starting point for the discovery of disease biomarkers. In 2005, we showed that different proteomics measurements using different sample preparation and analysis techniques identify significantly different sets of proteins, and that a comprehensive plasma proteome can be compiled only by combining data from many different experiments. Applying advanced computational methods developed for the analysis and integration of very large and diverse data sets generated by tandem MS measurements of tryptic peptides, we have now compiled a high-confidence human plasma proteome reference set with well over twice the identified proteins of previous high-confidence sets. It includes a hierarchy of protein identifications at different levels of redundancy following a clearly defined scheme, which we propose as a standard that can be applied to any proteomics data set to facilitate cross-proteome analyses. Further, to aid in development of blood-based diagnostics using techniques such as selected reaction monitoring, we provide a rough estimate of protein concentrations using spectral counting. We identified 20,433 distinct peptides, from which we inferred a highly nonredundant set of 1929 protein sequences at a false discovery rate of 1%. We have made this resource available via PeptideAtlas, a large, multiorganism, publicly accessible compendium of peptides identified in tandem MS experiments conducted by laboratories around the world.
人血浆可以相对无创地获得,并且包含来自身体大多数(如果不是全部)组织的蛋白质。因此,广泛的、定量的血浆蛋白质目录是发现疾病生物标志物的重要起点。2005 年,我们表明,使用不同的样品制备和分析技术的不同蛋白质组学测量方法会鉴定出明显不同的蛋白质组,并且只有通过结合来自许多不同实验的数据,才能综合编制血浆蛋白质组。我们应用了为串联 MS 测量的肽的分析和集成非常大且多样化的数据集而开发的先进计算方法,现在已经编译了一个高可信度的人类血浆蛋白质组参考集,其中包含的鉴定蛋白质是以前的高可信度蛋白质组的两倍多。它包括一个按照明确定义的方案在不同冗余级别上的蛋白质鉴定层次结构,我们将其作为一个标准,可以应用于任何蛋白质组数据集,以促进跨蛋白质组分析。此外,为了使用诸如选择反应监测等技术开发基于血液的诊断方法,我们使用谱计数提供了蛋白质浓度的粗略估计。我们鉴定了 20433 个独特的肽,从中推断出 1929 个蛋白质序列,具有 1%的错误发现率。我们通过 PeptideAtlas 提供了这个资源,PeptideAtlas 是一个大型的、多器官的、公共访问的肽综合目录,其中包含了世界各地实验室进行的串联 MS 实验中鉴定的肽。