Liu Yi, Sun Weiping, John Julius, Lajoie Gilles, Ma Bin, Zhang Kaizhong
IEEE Trans Nanobioscience. 2016 Mar;15(2):166-76. doi: 10.1109/TNB.2016.2519841. Epub 2016 Jan 19.
Extensive research has been conducted for the computational analysis of mass spectrometry based proteomics data. However, there are still remaining challenges, among which, one particular challenge is the low identification rate of the collected spectral data. A specific contributing factor is the existence of mixture spectra in the collected MS/MS spectra which are generated by the concurrent fragmentation of multiple precursors in one sequencing attempt. The quite frequently observed mixture spectra necessitates the development of effective computational approaches to characterize those non-conventional spectral data. In this research, we proposed an approach for matching the query mixture spectra with a pair of peptide sequences acquired from the protein database by incorporating a special de novo assisted filtration strategy. The experiment results on two different datasets of MS/MS spectra containing mixed ion fragments from multiple peptides demonstrated the efficiency of the integrated filtration strategy in reducing examination space and verified the effectiveness of the proposed matching scheme as well.
针对基于质谱的蛋白质组学数据的计算分析,已经开展了广泛的研究。然而,仍然存在一些挑战,其中一个特别的挑战是所收集光谱数据的低识别率。一个特定的促成因素是在收集的MS/MS光谱中存在混合光谱,这些混合光谱是在一次测序尝试中由多个前体的同时碎片化产生的。经常观察到的混合光谱使得有必要开发有效的计算方法来表征那些非常规光谱数据。在本研究中,我们提出了一种方法,通过纳入一种特殊的从头测序辅助过滤策略,将查询混合光谱与从蛋白质数据库中获取的一对肽序列进行匹配。对包含来自多个肽的混合离子片段的两个不同MS/MS光谱数据集的实验结果证明了集成过滤策略在减少检查空间方面的效率,并验证了所提出匹配方案的有效性。