MS Proteomics Research Group, HUN-REN Research Centre for Natural Sciences, Magyar Tudósok körútja 2., Budapest H-1117, Hungary.
Faculty of Science, Institute of Chemistry, Hevesy György PhD School of Chemistry, Eötvös Loránd University, Pázmány Péter sétány 1/A, Budapest H-1117, Hungary.
J Am Soc Mass Spectrom. 2024 Feb 7;35(2):333-343. doi: 10.1021/jasms.3c00375. Epub 2024 Jan 29.
High confidence and reproducibility are still challenges in bottom-up mass spectrometric -glycopeptide identification. The collision energy used in the MS/MS measurements and the database search engine used to identify the species are perhaps the two most decisive factors. We investigated how the structural features of -glycopeptides and the choice of the search engine influence the optimal collision energy, delivering the highest identification confidence. We carried out LC-MS/MS measurements using a series of collision energies on a large set of -glycopeptides with both the glycan and peptide part varied and studied the behavior of Byonic, pGlyco, and GlycoQuest scores. We found that search engines show a range of behavior between peptide-centric and glycan-centric, which manifests itself already in the dependence of optimal collision energy on /. Using classical statistical and machine learning methods, we revealed that peptide hydrophobicity, glycan and peptide masses, and the number of mobile protons also have significant and search-engine-dependent influence, as opposed to a series of other parameters we probed. We envisioned an MS/MS workflow making a smart collision energy choice based on online available features such as the hydrophobicity (described by retention time) and glycan mass (potentially available from a scout MS/MS). Our assessment suggests that this workflow can lead to a significant gain (up to 100%) in the identification confidence, particularly for low-scoring hits close to the filtering limit, which has the potential to enhance reproducibility of -glycopeptide analyses. Data are available via MassIVE (MSV000093110).
在从头测序的质谱-糖肽鉴定中,高可信度和可重复性仍然是挑战。在 MS/MS 测量中使用的碰撞能以及用于鉴定物种的数据库搜索引擎可能是两个最决定性的因素。我们研究了 -糖肽的结构特征以及搜索引擎的选择如何影响最佳碰撞能,从而提供最高的鉴定可信度。我们使用一系列碰撞能对一组具有不同聚糖和肽部分的 -糖肽进行了 LC-MS/MS 测量,并研究了 Byonic、pGlyco 和 GlycoQuest 分数的行为。我们发现搜索引擎表现出介于肽中心和聚糖中心之间的一系列行为,这在最佳碰撞能对/的依赖性上表现出来。使用经典的统计和机器学习方法,我们揭示了肽疏水性、聚糖和肽质量以及可移动质子的数量也具有显著的、依赖于搜索引擎的影响,而不是我们探测到的一系列其他参数。我们设想了一种 MS/MS 工作流程,根据在线可用的特征(如保留时间描述的疏水性和可能从侦察 MS/MS 获得的聚糖质量)做出智能的碰撞能选择。我们的评估表明,该工作流程可以显著提高鉴定可信度(高达 100%),特别是对于接近过滤限的低评分命中,这有可能提高 -糖肽分析的重现性。数据可通过 MassIVE(MSV000093110)获得。