Arnold Randy J, Jayasankar Narmada, Aggarwal Divya, Tang Haixu, Radivojac Predrag
Department of Chemistry, Indiana University, Bloomington, IN 47405, USA.
Pac Symp Biocomput. 2006:219-30.
Accurate peptide identification from tandem mass spectrometry experiments is the cornerstone of proteomics. Although various approaches for matching database sequences with experimental spectra have been developed to date (e.g. Sequest, Mascot) the sensitivity and specificity of peptide identification have not yet reached their full potential. This is in part due to the tradeoffs between robustness and accuracy of the existing methods with respect to the non-uniform nature of peptide fragmentation and bond cleavages induced by different mass spectrometers. Accordingly, it is expected that new approaches to de novo predicting peptide fragmentation spectra will enable more accurate peptide identification. To address this problem, here we used a data-driven approach to learn peptide fragmentation rules in mass spectrometry, in the form of posterior probabilities, for various fragment-ion types of doubly and triply charged precursor ions. We show that the accuracy of our neural-network based methodology is useful for subsequent peptide database searches and that the most useful rules of fragmentation significantly differ across ion and precursor types.
从串联质谱实验中准确鉴定肽段是蛋白质组学的基石。尽管迄今为止已经开发了各种将数据库序列与实验光谱进行匹配的方法(例如Sequest、Mascot),但肽段鉴定的灵敏度和特异性尚未充分发挥其潜力。部分原因在于现有方法在肽段碎片化和不同质谱仪诱导的键断裂的非均匀性质方面,在稳健性和准确性之间进行了权衡。因此,预计新的从头预测肽段碎片化光谱的方法将实现更准确的肽段鉴定。为了解决这个问题,我们在这里使用了一种数据驱动的方法,以后验概率的形式学习质谱中肽段的碎片化规则,用于双电荷和三电荷前体离子的各种碎片离子类型。我们表明,基于神经网络的方法的准确性对于后续的肽段数据库搜索很有用,并且最有用的碎片化规则在离子和前体类型之间存在显著差异。