College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China.
J Am Soc Mass Spectrom. 2013 Jun;24(6):857-67. doi: 10.1007/s13361-013-0591-3. Epub 2013 Mar 16.
A comprehensive investigation was performed to understand the influence of sequence scrambling in peptide ions on peptide identification results. To achieve this, four tandem mass spectrometry datasets with scrambled ions included and with them excluded were analyzed by Crux, X!Tandem, SpectraST, Lutefisk, and PepNovo. While the different algorithms differed in their performance, an increase in the number of correctly identified peptides was generally observed when removing scrambled ions, with the exception of the SpectraST algorithm. However, the variation of the match scores upon removal was unpredictable. Following these investigations, an interpretation was given on how the scrambled ions affect peptide identification. Lastly, a simulated theoretical mass spectral library derived from the NIST peptide Libraries was constructed and searched by SpectraST to study whether scrambled ions in predicted mass spectra could affect peptide identification. Consistent with the peptide library search results, no significant variations for dot product scores as well as peptide identification results were observed when these ions were included in the theoretical MS/MS spectra. From the five adopted algorithms, the SpectraST and Crux provided the most robust results, whereas X!Tandem, PepNovo, and Lutefisk were sensitive to the existence of the scrambled ions, especially the latter two de novo sequencing algorithms.
进行了全面的调查,以了解肽离子序列乱序对肽鉴定结果的影响。为此,通过 Crux、X!Tandem、SpectraST、Lutefisk 和 PepNovo 分析了四个包含和不包含乱序离子的串联质谱数据集。虽然不同的算法在性能上有所不同,但在去除乱序离子后,通常会观察到正确鉴定的肽数量增加,除了 SpectraST 算法。然而,去除后匹配分数的变化是不可预测的。在这些研究之后,对乱序离子如何影响肽鉴定进行了解释。最后,构建了一个源自 NIST 肽库的模拟理论质谱文库,并由 SpectraST 进行搜索,以研究预测质谱中的乱序离子是否会影响肽鉴定。与肽文库搜索结果一致,当将这些离子包含在理论 MS/MS 光谱中时,点积分数和肽鉴定结果没有观察到显著变化。在采用的五个算法中,SpectraST 和 Crux 提供了最稳健的结果,而 X!Tandem、PepNovo 和 Lutefisk 对乱序离子的存在敏感,特别是后两个从头测序算法。