Suppr
超能文献

构建总结：使用基于分组的方法提高 shotgun 蛋白质组学中肽/蛋白质鉴定的灵敏度。

BuildSummary: using a group-based approach to improve the sensitivity of peptide/protein identification in shotgun proteomics.

机构信息

Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences , Shanghai 200031, China.

出版信息

J Proteome Res. 2012 Mar 2;11(3):1494-502. doi: 10.1021/pr200194p. Epub 2012 Feb 8.

DOI:10.1021/pr200194p

PMID:22217156

Abstract

The target-decoy database search strategy is widely accepted as a standard method for estimating the false discovery rate (FDR) of peptide identification, based on which peptide-spectrum matches (PSMs) from the target database are filtered. To improve the sensitivity of protein identification given a fixed accuracy (frequently defined by a protein FDR threshold), a postprocessing procedure is often used that integrates results from different peptide search engines that had assayed the same data set. In this work, we show that PSMs that are grouped by the precursor charge, the number of missed internal cleavage sites, the modification state, and the numbers of protease termini and that the proteins grouped by their unique peptide count should be filtered separately according to the given FDR. We also develop an iterative procedure to filter the PSMs and proteins simultaneously, according to the given FDR. Finally, we present a general framework to integrate the results from different peptide search engines using the same FDR threshold. Our method was tested with several shotgun proteomics data sets that were acquired by multiple LC/MS instruments from two different biological samples. The results showed a satisfactory performance. We implemented the method in a user-friendly software package called BuildSummary, which can be downloaded for free from http://www.proteomics.ac.cn/software/proteomicstools/index.htm as part of the software suite ProteomicsTools.

摘要

基于目标-诱饵数据库搜索策略被广泛接受为估计肽鉴定的假发现率 (FDR) 的标准方法，根据该策略从目标数据库中筛选肽-谱匹配 (PSM)。为了在给定固定精度（通常由蛋白质 FDR 阈值定义）的情况下提高蛋白质鉴定的灵敏度，通常使用后处理程序，该程序集成了对同一数据集进行分析的不同肽搜索引擎的结果。在这项工作中，我们表明，根据给定的 FDR，应分别按前体电荷、缺失内部切割位点的数量、修饰状态、蛋白酶末端的数量以及按其独特肽计数分组的蛋白质对 PSM 进行过滤。我们还开发了一种迭代过程，根据给定的 FDR 同时过滤 PSM 和蛋白质。最后，我们提出了一个通用框架，使用相同的 FDR 阈值整合来自不同肽搜索引擎的结果。我们的方法已通过多个来自两个不同生物样品的 LC/MS 仪器获得的 shotgun 蛋白质组学数据集进行了测试，结果显示出令人满意的性能。我们在一个名为 BuildSummary 的用户友好软件包中实现了该方法，该软件包可从 http://www.proteomics.ac.cn/software/proteomicstools/index.htm 免费下载，作为 ProteomicsTools 软件套件的一部分。