使用截留法进行串联质谱分析时假发现率控制的评估

Assessment of false discovery rate control in tandem mass spectrometry analysis using entrapment.

作者信息

Wen Bo, Freestone Jack, Riffle Michael, MacCoss Michael J, Noble William S, Keich Uri

机构信息

Department of Genome Sciences, University of Washington, Seattle, WA, USA.

School of Mathematics and Statistics, University of Sydney, Sydney, New South Wales, Australia.

出版信息

Nat Methods. 2025 Jun 16. doi: 10.1038/s41592-025-02719-x.

DOI:10.1038/s41592-025-02719-x

PMID:40524023

Abstract

A critical challenge in mass spectrometry proteomics is accurately assessing error control, especially given that software tools employ distinct methods for reporting errors. Many tools are closed-source and poorly documented, leading to inconsistent validation strategies. Here we identify three prevalent methods for validating false discovery rate (FDR) control: one invalid, one providing only a lower bound, and one valid but under-powered. The result is that the proteomics community has limited insight into actual FDR control effectiveness, especially for data-independent acquisition (DIA) analyses. We propose a theoretical framework for entrapment experiments, allowing us to rigorously characterize different approaches. Moreover, we introduce a more powerful evaluation method and apply it alongside existing techniques to assess existing tools. We first validate our analysis in the better-understood data-dependent acquisition setup, and then, we analyze DIA data, where we find that no DIA search tool consistently controls the FDR, with particularly poor performance on single-cell datasets.

摘要

质谱蛋白质组学中的一个关键挑战是准确评估错误控制，特别是考虑到软件工具采用不同的方法来报告错误。许多工具是闭源的且文档记录不完善，导致验证策略不一致。在这里，我们确定了三种验证错误发现率（FDR）控制的普遍方法：一种无效，一种仅提供下限，一种有效但效能不足。结果是蛋白质组学界对实际FDR控制效果的了解有限，尤其是对于数据非依赖采集（DIA）分析。我们提出了一个用于诱捕实验的理论框架，使我们能够严格表征不同的方法。此外，我们引入了一种更强大的评估方法，并将其与现有技术一起应用于评估现有工具。我们首先在理解得更好的数据依赖采集设置中验证我们的分析，然后，我们分析DIA数据，发现在DIA数据中，没有一个搜索工具能始终如一地控制FDR，在单细胞数据集上的表现尤其差。

相似文献

Assessment of false discovery rate control in tandem mass spectrometry analysis using entrapment.使用截留法进行串联质谱分析时假发现率控制的评估

Nat Methods. 2025 Jun 16. doi: 10.1038/s41592-025-02719-x.

Assessment of false discovery rate control in tandem mass spectrometry analysis using entrapment.使用截留法对串联质谱分析中的错误发现率控制进行评估。

bioRxiv. 2025 Jan 21:2024.06.01.596967. doi: 10.1101/2024.06.01.596967.

The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂（GLP-1 RAs）减肥效果的网状Meta分析的数量、质量及结果：一项范围综述

Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.

Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施：一项网状Meta分析

Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.

The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历：系统检索与综述

Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.

Deworming drugs for soil-transmitted intestinal worms in children: effects on nutritional indicators, haemoglobin and school performance.儿童肠道土源性蠕虫驱虫药物：对营养指标、血红蛋白及学业表现的影响

Cochrane Database Syst Rev. 2012 Jul 11(7):CD000371. doi: 10.1002/14651858.CD000371.pub4.

Deworming drugs for soil-transmitted intestinal worms in children: effects on nutritional indicators, haemoglobin and school performance.儿童肠道土源性蠕虫驱虫药物：对营养指标、血红蛋白及学习成绩的影响

Cochrane Database Syst Rev. 2012 Nov 14;11:CD000371. doi: 10.1002/14651858.CD000371.pub5.

Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理：一项网络荟萃分析。

Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.

Doppler trans-thoracic echocardiography for detection of pulmonary hypertension in adults.经胸多普勒超声心动图用于检测成人肺动脉高压。

Cochrane Database Syst Rev. 2022 May 9;5(5):CD012809. doi: 10.1002/14651858.CD012809.pub2.

Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.用于识别下肢溃疡患者外周动脉疾病的自动化设备：证据综合和成本效益分析。

Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.

引用本文的文献

De novo peptide databases enable protein-based stable isotope probing of microbial communities with up to species-level resolution.从头合成肽数据库能够对微生物群落进行基于蛋白质的稳定同位素探测，分辨率可达物种水平。

Environ Microbiome. 2025 Aug 26;20(1):111. doi: 10.1186/s40793-025-00767-6.

Sensitive neoantigen discovery by real-time mutanome-guided immunopeptidomics.通过实时突变组引导的免疫肽组学发现敏感新抗原

Nat Commun. 2025 Aug 7;16(1):7269. doi: 10.1038/s41467-025-62647-4.

Thin-diaPASEF: diaPASEF for maximizing proteome coverage in single-shot proteomics.薄直径PASEF：用于在单次蛋白质组学中最大化蛋白质组覆盖范围的直径PASEF。

DNA Res. 2025 Jul 4;32(4). doi: 10.1093/dnares/dsaf019.

MSFragger-DDA+ Enhances Peptide Identification Sensitivity with Full Isolation Window Search.MSFragger-DDA+通过全隔离窗口搜索提高肽段鉴定灵敏度。

bioRxiv. 2024 Oct 15:2024.10.12.618041. doi: 10.1101/2024.10.12.618041.

本文引用的文献

How to Train a Postprocessor for Tandem Mass Spectrometry Proteomics Database Search While Maintaining Control of the False Discovery Rate.在控制错误发现率的同时如何训练用于串联质谱蛋白质组学数据库搜索的后处理器。

J Proteome Res. 2025 May 2;24(5):2266-2279. doi: 10.1021/acs.jproteome.4c00742. Epub 2025 Mar 31.

Fast and deep phosphoproteome analysis with the Orbitrap Astral mass spectrometer.利用 Orbitrap Astral 质谱仪进行快速而深入的磷酸化蛋白质组分析。

Nat Commun. 2024 Aug 15;15(1):7016. doi: 10.1038/s41467-024-51274-0.

Reinvestigating the Correctness of Decoy-Based False Discovery Rate Control in Proteomics Tandem Mass Spectrometry.重新考察基于诱饵的蛋白质组学串联质谱假发现率控制的正确性。

J Proteome Res. 2024 Jun 7;23(6):1907-1914. doi: 10.1021/acs.jproteome.3c00902. Epub 2024 Apr 30.

Analysis of Tandem Mass Spectrometry Data with CONGA: Combining Open and Narrow Searches with Group-Wise Analysis.CONGA 分析串联质谱数据：开放和窄搜索与群组分析相结合。

J Proteome Res. 2024 Jun 7;23(6):1894-1906. doi: 10.1021/acs.jproteome.3c00399. Epub 2024 Apr 23.

On the use of tandem mass spectra acquired from samples of evolutionarily distant organisms to validate methods for false discovery rate estimation.利用来自进化上相距较远的生物体样本获得的串联质谱数据来验证假发现率估计方法。

Proteomics. 2024 Aug;24(15):e2300398. doi: 10.1002/pmic.202300398. Epub 2024 Mar 15.

AlphaPept: a modern and open framework for MS-based proteomics.AlphaPept：基于 MS 的蛋白质组学的现代开放框架。

Nat Commun. 2024 Mar 9;15(1):2168. doi: 10.1038/s41467-024-46485-4.

Sage: An Open-Source Tool for Fast Proteomics Searching and Quantification at Scale.Sage：一种用于大规模快速蛋白质组学搜索和定量的开源工具。

J Proteome Res. 2023 Nov 3;22(11):3652-3659. doi: 10.1021/acs.jproteome.3c00486. Epub 2023 Oct 11.

Oktoberfest: Open-source spectral library generation and rescoring pipeline based on Prosit.慕尼黑啤酒节：基于 Prosit 的开源光谱库生成和重评分管道。

Proteomics. 2024 Apr;24(8):e2300112. doi: 10.1002/pmic.202300112. Epub 2023 Sep 6.

MSBooster: improving peptide identification rates using deep learning-based features.MSBooster：基于深度学习的特征提高肽段鉴定率。

Nat Commun. 2023 Jul 27;14(1):4539. doi: 10.1038/s41467-023-40129-9.

Analysis of DIA proteomics data using MSFragger-DIA and FragPipe computational platform.使用 MSFragger-DIA 和 FragPipe 计算平台分析 DIA 蛋白质组学数据。

Nat Commun. 2023 Jul 12;14(1):4154. doi: 10.1038/s41467-023-39869-5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用截留法进行串联质谱分析时假发现率控制的评估

Assessment of false discovery rate control in tandem mass spectrometry analysis using entrapment.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献