用于鸟枪法蛋白质组学的改进型错误发现率估计程序

Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.

作者信息

Keich Uri, Kertesz-Farkas Attila, Noble William Stafford

机构信息

†School of Mathematics and Statistics F07, University of Sydney, Sydney, New South Wales 2006, Australia.

‡Department of Genome Sciences, University of Washington, Foege Building S220B, 3720 15th Avenue North East, Seattle, Washington 98195-5065, United States.

出版信息

J Proteome Res. 2015 Aug 7;14(8):3148-61. doi: 10.1021/acs.jproteome.5b00081. Epub 2015 Jul 27.

DOI:10.1021/acs.jproteome.5b00081

PMID:26152888

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4533616/

Abstract

Interpreting the potentially vast number of hypotheses generated by a shotgun proteomics experiment requires a valid and accurate procedure for assigning statistical confidence estimates to identified tandem mass spectra. Despite the crucial role such procedures play in most high-throughput proteomics experiments, the scientific literature has not reached a consensus about the best confidence estimation methodology. In this work, we evaluate, using theoretical and empirical analysis, four previously proposed protocols for estimating the false discovery rate (FDR) associated with a set of identified tandem mass spectra: two variants of the target-decoy competition protocol (TDC) of Elias and Gygi and two variants of the separate target-decoy search protocol of Käll et al. Our analysis reveals significant biases in the two separate target-decoy search protocols. Moreover, the one TDC protocol that provides an unbiased FDR estimate among the target PSMs does so at the cost of forfeiting a random subset of high-scoring spectrum identifications. We therefore propose the mix-max procedure to provide unbiased, accurate FDR estimates in the presence of well-calibrated scores. The method avoids biases associated with the two separate target-decoy search protocols and also avoids the propensity for target-decoy competition to discard a random subset of high-scoring target identifications.

摘要

解读鸟枪法蛋白质组学实验中可能产生的大量假设，需要一个有效且准确的程序，用于为已识别的串联质谱分配统计置信度估计值。尽管此类程序在大多数高通量蛋白质组学实验中起着关键作用，但科学文献尚未就最佳置信度估计方法达成共识。在这项工作中，我们通过理论和实证分析，评估了四种先前提出的用于估计与一组已识别串联质谱相关的错误发现率（FDR）的方案：Elias和Gygi的目标-诱饵竞争方案（TDC）的两种变体，以及Käll等人的单独目标-诱饵搜索方案的两种变体。我们的分析揭示了两种单独目标-诱饵搜索方案中存在显著偏差。此外，在目标肽段谱匹配（PSM）中提供无偏FDR估计的一种TDC方案，是以放弃一部分高分谱图识别结果的随机子集为代价的。因此，我们提出了混合最大程序，以便在存在校准良好的分数时提供无偏、准确的FDR估计。该方法避免了与两种单独目标-诱饵搜索方案相关的偏差，也避免了目标-诱饵竞争丢弃一部分高分目标识别结果的随机子集的倾向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8de2/4533616/b3f7fdd73a34/pr-2015-00081t_0001.jpg

相似文献

Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.

J Proteome Res. 2015 Aug 7;14(8):3148-61. doi: 10.1021/acs.jproteome.5b00081. Epub 2015 Jul 27.

Decoy methods for assessing false positives and false discovery rates in shotgun proteomics.

Anal Chem. 2009 Jan 1;81(1):146-59. doi: 10.1021/ac801664q.

Two-dimensional target decoy strategy for shotgun proteomics.

J Proteome Res. 2011 Dec 2;10(12):5296-301. doi: 10.1021/pr200780j. Epub 2011 Nov 7.

Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach.

J Proteome Res. 2017 Feb 3;16(2):393-397. doi: 10.1021/acs.jproteome.6b00144. Epub 2016 Dec 13.

Averaging Strategy To Reduce Variability in Target-Decoy Estimates of False Discovery Rate.

J Proteome Res. 2019 Feb 1;18(2):585-593. doi: 10.1021/acs.jproteome.8b00802. Epub 2019 Jan 3.

Bias in False Discovery Rate Estimation in Mass-Spectrometry-Based Peptide Identification.

J Proteome Res. 2019 May 3;18(5):2354-2358. doi: 10.1021/acs.jproteome.8b00991. Epub 2019 Apr 18.

Common Decoy Distributions Simplify False Discovery Rate Estimation in Shotgun Proteomics.

J Proteome Res. 2022 Feb 4;21(2):339-348. doi: 10.1021/acs.jproteome.1c00600. Epub 2022 Jan 6.

Improving Peptide-Level Mass Spectrometry Analysis via Double Competition.

J Proteome Res. 2022 Oct 7;21(10):2412-2420. doi: 10.1021/acs.jproteome.2c00282. Epub 2022 Sep 27.

Target-decoy false discovery rate estimation using Crema.

Proteomics. 2024 Apr;24(8):e2300084. doi: 10.1002/pmic.202300084. Epub 2024 Feb 21.

Modeling Lower-Order Statistics to Enable Decoy-Free FDR Estimation in Proteomics.

J Proteome Res. 2023 Apr 7;22(4):1159-1171. doi: 10.1021/acs.jproteome.2c00604. Epub 2023 Mar 24.

引用本文的文献

Ketogenic Metabolism in Neurodegenerative Diseases: Mechanisms of Action and Therapeutic Potential.

Metabolites. 2025 Jul 31;15(8):508. doi: 10.3390/metabo15080508.

Query Mix-Max Method for FDR Estimation Supported by Entrapment Queries.

J Proteome Res. 2025 Mar 7;24(3):1135-1147. doi: 10.1021/acs.jproteome.4c00744. Epub 2025 Feb 5.

PyViscount: Validating False Discovery Rate Estimation Methods via Random Search Space Partition.

J Proteome Res. 2025 Mar 7;24(3):1118-1134. doi: 10.1021/acs.jproteome.4c00743. Epub 2025 Feb 5.

Ion entropy and accurate entropy-based FDR estimation in metabolomics.

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae056.

Amino acid sequence assignment from single molecule peptide sequencing data using a two-stage classifier.

PLoS Comput Biol. 2023 May 30;19(5):e1011157. doi: 10.1371/journal.pcbi.1011157. eCollection 2023 May.

Analyzing rare mutations in metagenomes assembled using long and accurate reads.

Genome Res. 2022 Nov-Dec;32(11-12):2119-2133. doi: 10.1101/gr.276917.122. Epub 2022 Nov 23.

An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics.

Genome Biol. 2022 Jun 20;23(1):132. doi: 10.1186/s13059-022-02701-2.

Considerations for constructing a protein sequence database for metaproteomics.

Comput Struct Biotechnol J. 2022 Jan 21;20:937-952. doi: 10.1016/j.csbj.2022.01.018. eCollection 2022.

Mapping specificity, cleavage entropy, allosteric changes and substrates of blood proteases in a high-throughput screen.

Nat Commun. 2021 Mar 16;12(1):1693. doi: 10.1038/s41467-021-21754-8.

Proteome Analysis of Molecular Events in Oral Pathogenesis and Virus: A Review with a Particular Focus on Periodontitis.

Int J Mol Sci. 2020 Jul 22;21(15):5184. doi: 10.3390/ijms21155184.

本文引用的文献

On the importance of well-calibrated scores for identifying shotgun proteomics spectra.

J Proteome Res. 2015 Feb 6;14(2):1147-60. doi: 10.1021/pr5010983. Epub 2014 Dec 17.

MS-GF+ makes progress towards a universal database search tool for proteomics.

Nat Commun. 2014 Oct 31;5:5277. doi: 10.1038/ncomms6277.

Crux: rapid open source protein tandem mass spectrometry analysis.

J Proteome Res. 2014 Oct 3;13(10):4488-91. doi: 10.1021/pr500741y. Epub 2014 Sep 9.

Computing exact p-values for a cross-correlation shotgun proteomics score function.

Mol Cell Proteomics. 2014 Sep;13(9):2467-79. doi: 10.1074/mcp.O113.036327. Epub 2014 Jun 2.

Global analysis of protein expression and phosphorylation of three stages of Plasmodium falciparum intraerythrocytic development.

J Proteome Res. 2013 Sep 6;12(9):4028-45. doi: 10.1021/pr400394g. Epub 2013 Aug 26.

False discovery rates in spectral identification.

BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S2. doi: 10.1186/1471-2105-13-S16-S2. Epub 2012 Nov 5.

Faster SEQUEST searching for peptide identification from tandem mass spectra.

J Proteome Res. 2011 Sep 2;10(9):3871-9. doi: 10.1021/pr101196n. Epub 2011 Jul 29.

Quality assessments of peptide-spectrum matches in shotgun proteomics.

Proteomics. 2011 Mar;11(6):1086-93. doi: 10.1002/pmic.201000432. Epub 2011 Feb 7.

A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

J Proteomics. 2010 Oct 10;73(11):2092-123. doi: 10.1016/j.jprot.2010.08.009. Epub 2010 Sep 8.

Target-decoy search strategy for mass spectrometry-based proteomics.

Methods Mol Biol. 2010;604:55-71. doi: 10.1007/978-1-60761-444-9_5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于鸟枪法蛋白质组学的改进型错误发现率估计程序

Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.

作者信息

Keich Uri, Kertesz-Farkas Attila, Noble William Stafford

机构信息

†School of Mathematics and Statistics F07, University of Sydney, Sydney, New South Wales 2006, Australia.

‡Department of Genome Sciences, University of Washington, Foege Building S220B, 3720 15th Avenue North East, Seattle, Washington 98195-5065, United States.

出版信息

J Proteome Res. 2015 Aug 7;14(8):3148-61. doi: 10.1021/acs.jproteome.5b00081. Epub 2015 Jul 27.

DOI:10.1021/acs.jproteome.5b00081

PMID:26152888

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4533616/

Abstract

摘要

用于鸟枪法蛋白质组学的改进型错误发现率估计程序

Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于鸟枪法蛋白质组学的改进型错误发现率估计程序

Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.

作者信息

机构信息

出版信息