Fu Yan, Yang Qiang, Sun Ruixiang, Li Dequan, Zeng Rong, Ling Charles X, Gao Wen
Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China.
Bioinformatics. 2004 Aug 12;20(12):1948-54. doi: 10.1093/bioinformatics/bth186. Epub 2004 Mar 25.
The correlation among fragment ions in a tandem mass spectrum is crucial in reducing stochastic mismatches for peptide identification by database searching. Until now, an efficient scoring algorithm that considers the correlative information in a tunable and comprehensive manner has been lacking.
This paper provides a promising approach to utilizing the correlative information for improving the peptide identification accuracy. The kernel trick, rooted in the statistical learning theory, is exploited to address this issue with low computational effort. The common scoring method, the tandem mass spectral dot product (SDP), is extended to the kernel SDP (KSDP). Experiments on a dataset reported previously demonstrate the effectiveness of the KSDP. The implementation on consecutive fragments shows a decrease of 10% in the error rate compared with the SDP. Our software tool, pFind, using a simple scoring function based on the KSDP, outperforms two SDP-based software tools, SEQUEST and Sonar MS/MS, in terms of identification accuracy.
串联质谱中碎片离子之间的相关性对于减少数据库搜索肽段鉴定中的随机错配至关重要。到目前为止,一直缺乏一种能够以可调节且全面的方式考虑相关信息的高效评分算法。
本文提供了一种利用相关信息提高肽段鉴定准确性的有前景的方法。基于统计学习理论的核技巧被用于以低计算量解决此问题。常见的评分方法,串联质谱点积(SDP),被扩展为核SDP(KSDP)。在先前报道的一个数据集上的实验证明了KSDP的有效性。对连续片段的实现表明,与SDP相比,错误率降低了10%。我们基于KSDP使用简单评分函数的软件工具pFind在鉴定准确性方面优于两个基于SDP的软件工具SEQUEST和Sonar MS/MS。