Laboratory for Bioinformatics and Computational Biology, Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong, China.
Bioinformatics. 2009 Nov 15;25(22):2969-74. doi: 10.1093/bioinformatics/btp501. Epub 2009 Aug 18.
Database searching is the major peptide identification method in shotgun proteomics. It searches tandem mass spectrometry (MS/MS) spectra against a protein database to identify target peptides. The success of such a database searching method relies on a scoring algorithm that can evaluate the quality of peptide-spectrum matches (PSMs) accurately. However, current scoring algorithms frequently generate inaccurate assignments due to variations and noises in the MS/MS spectra. To address this issue, we like to improve peptide identification by using additional information from other data sources.
Single-stage MS data is complementary to MS/MS data in the sense that it provides broader mass coverage but less sequence information. In this article, we show that single-stage MS data can be used to re-rank PSMs. The proposed method explores a linear combination of scores between MS and MS/MS data to perform re-ranking. Experimental results on real data show that such a re-ranking strategy improves the identification performance significantly.
数据库搜索是 shotgun 蛋白质组学中的主要肽段鉴定方法。它根据蛋白质数据库搜索串联质谱(MS/MS)谱,以鉴定目标肽段。这种数据库搜索方法的成功依赖于一种评分算法,该算法可以准确地评估肽段谱匹配(PSM)的质量。然而,由于 MS/MS 谱中的变化和噪声,当前的评分算法经常会产生不准确的分配。为了解决这个问题,我们希望通过使用来自其他数据源的附加信息来改进肽段鉴定。
单级 MS 数据在提供更广泛的质量覆盖范围但序列信息较少的意义上与 MS/MS 数据互补。在本文中,我们表明单级 MS 数据可用于重新排列 PSM。所提出的方法探索了 MS 和 MS/MS 数据之间的得分线性组合,以进行重新排序。对真实数据的实验结果表明,这种重新排序策略可以显著提高鉴定性能。