MacCoss Michael J, Wu Christine C, Yates John R
Department of Cell Biology, The Scripps Research Institute, La Jolla, California 92037, USA.
Anal Chem. 2002 Nov 1;74(21):5593-9. doi: 10.1021/ac025826t.
Database-searching algorithms compatible with shotgun proteomics match a peptide tandem mass spectrum to a predicted mass spectrum for an amino acid sequence within a database. SEQUEST is one of the most common software algorithms used for the analysis of peptide tandem mass spectra by using a cross-correlation (XCorr) scoring routine to match tandem mass spectra to model spectra derived from peptide sequences. To assess a match, SEQUEST uses the difference between the first- and second-ranked sequences (ACn). This value is dependent on the database size, search parameters, and sequence homologies. In this report, we demonstrate the use of a scoring routine (SEQUEST-NORM) that normalizes XCorr values to be independent of peptide size and the database used to perform the search. This new scoring routine is used to objectively calculate the percent confidence of protein identifications and posttranslational modifications based solely on the XCorr value.
与鸟枪法蛋白质组学兼容的数据库搜索算法,会将肽段串联质谱与数据库中氨基酸序列的预测质谱进行匹配。SEQUEST是用于分析肽段串联质谱的最常用软件算法之一,它通过互相关(XCorr)评分程序,将串联质谱与从肽段序列推导出来的模型质谱进行匹配。为了评估匹配情况,SEQUEST使用排名第一和第二的序列之间的差异(ACn)。该值取决于数据库大小、搜索参数和序列同源性。在本报告中,我们展示了一种评分程序(SEQUEST-NORM)的使用,该程序将XCorr值标准化,使其独立于肽段大小和用于执行搜索的数据库。这种新的评分程序用于仅基于XCorr值客观地计算蛋白质鉴定和翻译后修饰的置信度百分比。