Payne Samuel H, Yau Margaret, Smolka Marcus B, Tanner Stephen, Zhou Huilin, Bafna Vineet
Bioinformatics Program, University of California San Diego, Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093, USA.
J Proteome Res. 2008 Aug;7(8):3373-81. doi: 10.1021/pr800129m. Epub 2008 Jun 19.
The promise of mass spectrometry as a tool for probing signal-transduction is predicated on reliable identification of post-translational modifications. Phosphorylations are key mediators of cellular signaling, yet are hard to detect, partly because of unusual fragmentation patterns of phosphopeptides. In addition to being accurate, MS/MS identification software must be robust and efficient to deal with increasingly large spectral data sets. Here, we present a new scoring function for the Inspect software for phosphorylated peptide tandem mass spectra for ion-trap instruments, without the need for manual validation. The scoring function was modeled by learning fragmentation patterns from 7677 validated phosphopeptide spectra. We compare our algorithm against SEQUEST and X!Tandem on testing and training data sets. At a 1% false positive rate, Inspect identified the greatest total number of phosphorylated spectra, 13% more than SEQUEST and 39% more than X!Tandem. Spectra identified by Inspect tended to score better in several spectral quality measures. Furthermore, Inspect runs much faster than either SEQUEST or X!Tandem, making desktop phosphoproteomics feasible. Finally, we used our new models to reanalyze a corpus of 423,000 LTQ spectra acquired for a phosphoproteome analysis of Saccharomyces cerevisiae DNA damage and repair pathways and discovered 43% more phosphopeptides than the previous study.
质谱作为一种探测信号转导的工具,其前景取决于能否可靠地识别翻译后修饰。磷酸化是细胞信号传导的关键介质,但难以检测,部分原因是磷酸化肽段的碎片化模式异常。除了准确之外,串联质谱(MS/MS)识别软件还必须强大且高效,以处理日益庞大的光谱数据集。在此,我们为离子阱仪器的磷酸化肽段串联质谱的Inspect软件提出了一种新打分函数,无需人工验证。该打分函数通过从7677个经验证的磷酸化肽段光谱中学习碎片化模式进行建模。我们在测试和训练数据集上,将我们的算法与SEQUEST和X!Tandem进行了比较。在1%的假阳性率下,Inspect识别出的磷酸化光谱总数最多,比SEQUEST多13%,比X!Tandem多39%。Inspect识别出的光谱在几种光谱质量指标上往往得分更高。此外,Inspect的运行速度比SEQUEST或X!Tandem快得多,使得桌面磷酸化蛋白质组学成为可能。最后,我们使用新模型重新分析了为酿酒酵母DNA损伤和修复途径的磷酸化蛋白质组分析获取的423,000个线性离子阱(LTQ)光谱数据集,发现的磷酸化肽段比之前的研究多43%。