Bern Marshall, Cai Yuhan, Goldberg David
Palo Alto Research Center, 3333 Coyote Hill Road, Palo Alto, California 94304, USA.
Anal Chem. 2007 Feb 15;79(4):1393-400. doi: 10.1021/ac0617013. Epub 2007 Jan 23.
A powerful technique for peptide and protein identification is tandem mass spectrometry followed by database search using a program such as SEQUEST or Mascot. These programs, however, become slow and lose sensitivity when allowing nonspecific cleavages or peptide modifications. De novo sequencing and hybrid methods such as sequence tagging offer speed and robustness for wider searches, yet these approaches require better spectra with more complete and consecutive fragmentation and, hence, are less sensitive to low-abundance peptides. Here we describe a new hybrid method that retains the sensitivity of pure database search. The method uses a small amount of de novo analysis to identify likely b- and y-ion peaks--"lookup peaks"--that can then be used to extract candidate peptides from the database, with the number of candidates tunable to fit a computing budget. We describe a program called ByOnic that implements this method, and we benchmark ByOnic on several data sets, including one of mouse blood plasma spiked with low concentrations of recombinant human proteins. We demonstrate that ByOnic is more sensitive than sequence tagging and, indeed, more sensitive than the three most popular pure database search tools--SEQUEST, Mascot, and X!Tandem--on both the peptide and protein levels. On the mouse plasma samples, ByOnic consistently found spiked proteins missed by the other tools.
一种用于肽和蛋白质鉴定的强大技术是串联质谱,随后使用诸如SEQUEST或Mascot等程序进行数据库搜索。然而,当允许非特异性裂解或肽修饰时,这些程序会变慢并失去灵敏度。从头测序和诸如序列标签等混合方法为更广泛的搜索提供了速度和稳健性,但这些方法需要具有更完整和连续片段化的更好光谱,因此对低丰度肽不太敏感。在这里,我们描述了一种新的混合方法,该方法保留了纯数据库搜索的灵敏度。该方法使用少量的从头分析来识别可能的b离子和y离子峰——“查找峰”——然后可用于从数据库中提取候选肽,候选肽的数量可进行调整以适应计算预算。我们描述了一个名为ByOnic的程序来实现这种方法,并且我们在几个数据集上对ByOnic进行了基准测试,包括一个添加了低浓度重组人蛋白质的小鼠血浆数据集。我们证明ByOnic比序列标签更灵敏,实际上,在肽和蛋白质水平上,它比三种最流行的纯数据库搜索工具——SEQUEST、Mascot和X!Tandem——更灵敏。在小鼠血浆样本上,ByOnic始终能发现其他工具遗漏的添加蛋白质。