Matthiesen Rune, Trelle Morten Beck, Højrup Peter, Bunkenborg Jakob, Jensen Ole N
Department of Biochemistry & Molecular Biology, University of Southern Denmark, Odense, Denmark.
J Proteome Res. 2005 Nov-Dec;4(6):2338-47. doi: 10.1021/pr050264q.
Protein and peptide mass analysis and amino acid sequencing by mass spectrometry is widely used for identification and annotation of post-translational modifications (PTMs) in proteins. Modification-specific mass increments, neutral losses or diagnostic fragment ions in peptide mass spectra provide direct evidence for the presence of post-translational modifications, such as phosphorylation, acetylation, methylation or glycosylation. However, the commonly used database search engines are not always practical for exhaustive searches for multiple modifications and concomitant missed proteolytic cleavage sites in large-scale proteomic datasets, since the search space is dramatically expanded. We present a formal definition of the problem of searching databases with tandem mass spectra of peptides that are partially (sub-stoichiometrically) modified. In addition, an improved search algorithm and peptide scoring scheme that includes modification specific ion information from MS/MS spectra was implemented and tested using the Virtual Expert Mass Spectrometrist (VEMS) software. A set of 2825 peptide MS/MS spectra were searched with 16 variable modifications and 6 missed cleavages. The scoring scheme returned a large set of post-translationally modified peptides including precise information on modification type and position. The scoring scheme was able to extract and distinguish the near-isobaric modifications of trimethylation and acetylation of lysine residues based on the presence and absence of diagnostic neutral losses and immonium ions. In addition, the VEMS software contains a range of new features for analysis of mass spectrometry data obtained in large-scale proteomic experiments. Windows binaries are available at http://www.yass.sdu.dk/.
通过质谱进行蛋白质和肽质量分析以及氨基酸测序被广泛用于蛋白质翻译后修饰(PTM)的鉴定和注释。肽质谱图中修饰特异性的质量增量、中性丢失或诊断性碎片离子为翻译后修饰(如磷酸化、乙酰化、甲基化或糖基化)的存在提供了直接证据。然而,对于大规模蛋白质组学数据集中多种修饰和伴随的蛋白酶解位点缺失进行详尽搜索时,常用的数据库搜索引擎并不总是实用的,因为搜索空间会大幅扩展。我们给出了使用部分(亚化学计量)修饰的肽串联质谱搜索数据库问题的形式化定义。此外,使用虚拟专家质谱仪(VEMS)软件实现并测试了一种改进的搜索算法和肽评分方案,该方案包含来自MS/MS谱图的修饰特异性离子信息。用16种可变修饰和6个缺失裂解位点搜索了一组2825个肽MS/MS谱图。评分方案返回了大量翻译后修饰的肽,包括修饰类型和位置的精确信息。该评分方案能够基于诊断性中性丢失和铵离子的有无来提取和区分赖氨酸残基三甲基化和乙酰化的近等压修饰。此外,VEMS软件包含一系列用于分析大规模蛋白质组学实验中获得的质谱数据的新功能。Windows二进制文件可在http://www.yass.sdu.dk/获取。