• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

准确的肽段碎裂预测可使数据驱动的方法取代并改进蛋白质组学搜索引擎评分函数。

Accurate peptide fragmentation predictions allow data driven approaches to replace and improve upon proteomics search engine scoring functions.

机构信息

VIB-UGent Center for Medical Biotechnology, Ghent, Belgium.

Department of Biomolecular Medicine, Faculty of Medicine, Ghent, Belgium.

出版信息

Bioinformatics. 2019 Dec 15;35(24):5243-5248. doi: 10.1093/bioinformatics/btz383.

DOI:10.1093/bioinformatics/btz383
PMID:31077310
Abstract

MOTIVATION

The use of post-processing tools to maximize the information gained from a proteomics search engine is widely accepted and used by the community, with the most notable example being Percolator-a semi-supervised machine learning model which learns a new scoring function for a given dataset. The usage of such tools is however bound to the search engine's scoring scheme, which doesn't always make full use of the intensity information present in a spectrum. We aim to show how this tool can be applied in such a way that maximizes the use of spectrum intensity information by leveraging another machine learning-based tool, MS2PIP. MS2PIP predicts fragment ion peak intensities.

RESULTS

We show how comparing predicted intensities to annotated experimental spectra by calculating direct similarity metrics provides enough information for a tool such as Percolator to accurately separate two classes of peptide-to-spectrum matches. This approach allows using more information out of the data (compared with simpler intensity based metrics, like peak counting or explained intensities summing) while maintaining control of statistics such as the false discovery rate.

AVAILABILITY AND IMPLEMENTATION

All of the code is available online at https://github.com/compomics/ms2rescore.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

利用后处理工具最大限度地从蛋白质组学搜索引擎中获取信息,这已被社区广泛接受和使用,其中最著名的例子是 Percolator——一种半监督机器学习模型,它为给定数据集学习新的评分函数。然而,此类工具的使用受限于搜索引擎的评分方案,该方案并非总能充分利用谱中的强度信息。我们旨在展示如何通过利用另一个基于机器学习的工具 MS2PIP 来应用此工具,从而最大限度地利用谱强度信息。MS2PIP 可预测碎片离子峰强度。

结果

我们通过计算直接相似性度量值,展示了如何通过将预测强度与注释的实验谱进行比较,为 Percolator 等工具提供足够的信息,以准确地区分两类肽与谱匹配。这种方法允许使用更多的数据信息(与简单的基于强度的指标相比,如峰计数或解释强度求和),同时保持对错误发现率等统计数据的控制。

可用性和实现

所有代码都可在 https://github.com/compomics/ms2rescore 上在线获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
Accurate peptide fragmentation predictions allow data driven approaches to replace and improve upon proteomics search engine scoring functions.准确的肽段碎裂预测可使数据驱动的方法取代并改进蛋白质组学搜索引擎评分函数。
Bioinformatics. 2019 Dec 15;35(24):5243-5248. doi: 10.1093/bioinformatics/btz383.
2
Sensitive and Specific Spectral Library Searching with CompOmics Spectral Library Searching Tool and Percolator.使用 CompOmics 光谱库检索工具和 percolator 进行敏感和特异的光谱库检索。
J Proteome Res. 2022 May 6;21(5):1365-1370. doi: 10.1021/acs.jproteome.2c00075. Epub 2022 Apr 21.
3
MS2PIP: a tool for MS/MS peak intensity prediction.MS2PIP:用于 MS/MS 峰强度预测的工具。
Bioinformatics. 2013 Dec 15;29(24):3199-203. doi: 10.1093/bioinformatics/btt544. Epub 2013 Sep 27.
4
In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。
J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.
5
Data-Dependent Scoring Parameter Optimization in MS-GF+ Using Spectrum Quality Filter.基于谱质量过滤的 MS-GF+ 数据相关评分参数优化
J Proteome Res. 2018 Oct 5;17(10):3593-3598. doi: 10.1021/acs.jproteome.8b00415. Epub 2018 Jul 26.
6
Annotation of tandem mass spectrometry data using stochastic neural networks in shotgun proteomics.基于随机神经网络的 shotgun 蛋白质组学串联质谱数据注释。
Bioinformatics. 2020 Jun 1;36(12):3781-3787. doi: 10.1093/bioinformatics/btaa206.
7
Optimization of Search Engines and Postprocessing Approaches to Maximize Peptide and Protein Identification for High-Resolution Mass Data.优化搜索引擎和后处理方法以最大化高分辨率质谱数据的肽段和蛋白质鉴定
J Proteome Res. 2015 Nov 6;14(11):4662-73. doi: 10.1021/acs.jproteome.5b00536. Epub 2015 Sep 30.
8
INFERYS rescoring: Boosting peptide identifications and scoring confidence of database search results.INFERYS重新评分:提高肽段鉴定及数据库搜索结果的评分可信度。
Rapid Commun Mass Spectrom. 2025 May;39 Suppl 1:e9128. doi: 10.1002/rcm.9128. Epub 2021 Jun 28.
9
MSRescore 3.0 Is a Modular, Flexible, and User-Friendly Platform to Boost Peptide Identifications, as Showcased with MS Amanda 3.0.MSRescore 3.0 是一个模块化、灵活且用户友好的平台,可提高肽鉴定的质量,这一点在 MS Amanda 3.0 中得到了很好的展示。
J Proteome Res. 2024 Aug 2;23(8):3200-3207. doi: 10.1021/acs.jproteome.3c00785. Epub 2024 Mar 16.
10
General framework for developing and evaluating database scoring algorithms using the TANDEM search engine.使用串联搜索引擎开发和评估数据库评分算法的总体框架。
Bioinformatics. 2006 Nov 15;22(22):2830-2. doi: 10.1093/bioinformatics/btl379. Epub 2006 Jul 28.

引用本文的文献

1
TIMSRescore: A Data Dependent Acquisition-Parallel Accumulation and Serial Fragmentation-Optimized Data-Driven Rescoring Pipeline Based on MSRescore.TIMS重评分:一种基于MSRescore的数据依赖采集-并行累积与串行碎片化优化的数据驱动重评分流程。
J Proteome Res. 2025 Mar 7;24(3):1067-1076. doi: 10.1021/acs.jproteome.4c00609. Epub 2025 Feb 6.
2
ProtGraph: a tool for the quick and comprehensive exploration and exploitation of the peptide search space derived from protein sequence databases using graphs.ProtGraph:一种利用图形对源自蛋白质序列数据库的肽搜索空间进行快速全面探索和利用的工具。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae671.
3
Proteome-wide non-cleavable crosslink identification with MS Annika 3.0 reveals the structure of the C. elegans Box C/D complex.
利用MS Annika 3.0进行全蛋白质组不可裂解交联鉴定揭示了秀丽隐杆线虫Box C/D复合物的结构。
Commun Chem. 2024 Dec 19;7(1):300. doi: 10.1038/s42004-024-01386-x.
4
Interrogating data-independent acquisition LC-MS/MS for affinity proteomics.用于亲和蛋白质组学的非数据依赖型采集液相色谱-串联质谱分析
J Proteins Proteom. 2024;15(3):281-298. doi: 10.1007/s42485-024-00166-4. Epub 2024 Sep 17.
5
Rescoring Peptide Spectrum Matches: Boosting Proteomics Performance by Integrating Peptide Property Predictors Into Peptide Identification.重新评分肽谱匹配:通过将肽性质预测器集成到肽鉴定中提高蛋白质组学性能。
Mol Cell Proteomics. 2024 Jul;23(7):100798. doi: 10.1016/j.mcpro.2024.100798. Epub 2024 Jun 11.
6
Fit for Purpose Approach To Evaluate Detection of Amino Acid Substitutions in Shotgun Proteomics.适用于目的的方法来评估 shotgun 蛋白质组学中氨基酸取代的检测。
J Proteome Res. 2024 Apr 5;23(4):1263-1271. doi: 10.1021/acs.jproteome.3c00730. Epub 2024 Mar 13.
7
MARS an improved de novo peptide candidate selection method for non-canonical antigen target discovery in cancer.MARS:一种改进的从头多肽候选物选择方法,用于癌症中非规范抗原靶标的发现。
Nat Commun. 2024 Jan 22;15(1):661. doi: 10.1038/s41467-023-44460-z.
8
Deep Learning Prediction Boosts Phosphoproteomics-Based Discoveries Through Improved Phosphopeptide Identification.深度学习预测通过改进磷酸肽鉴定来增强基于磷酸蛋白质组学的发现。
Mol Cell Proteomics. 2024 Feb;23(2):100707. doi: 10.1016/j.mcpro.2023.100707. Epub 2023 Dec 26.
9
The Association of Biomolecular Resource Facilities Proteome Informatics Research Group Study on Metaproteomics (iPRG-2020).生物分子资源设施协会蛋白质组信息学研究组关于宏蛋白质组学的研究 (iPRG-2020)。
J Biomol Tech. 2023 Aug 7;34(3). doi: 10.7171/3fc1f5fe.a058bad4. eCollection 2023 Sep 30.
10
Insight on physicochemical properties governing peptide MS1 response in HPLC-ESI-MS/MS: A deep learning approach.关于HPLC-ESI-MS/MS中控制肽段MS1响应的物理化学性质的见解:一种深度学习方法。
Comput Struct Biotechnol J. 2023 Jul 22;21:3715-3727. doi: 10.1016/j.csbj.2023.07.027. eCollection 2023.