• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于使用诱饵搜索策略估计肽段鉴定中的假阳性

On the estimation of false positives in peptide identifications using decoy search strategy.

作者信息

Shen Changyu, Sheng Quanhu, Dai Jie, Li Yixue, Zeng Rong, Tang Haixu

机构信息

Division of Biostatistics, Indiana University School of Medicine, Indianapolis, IN 46202 , USA.

出版信息

Proteomics. 2009 Jan;9(1):194-204. doi: 10.1002/pmic.200800330.

DOI:10.1002/pmic.200800330
PMID:19053142
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3076744/
Abstract

False positive control/estimate in peptide identifications by MS is of critical importance for reliable inference at the protein level and downstream bioinformatics analysis. Approaches based on search against decoy databases have become popular for its conceptual simplicity and easy implementation. Although various decoy search strategies have been proposed, few studies have investigated their difference in performance. With datasets collected on a mixture of model proteins, we demonstrate that a single search against the target database coupled with its reversed version offers a good balance between performance and simplicity. In particular, both the accuracy of the estimate of the number of false positives and sensitivity is at least comparable to other procedures examined in this study. It is also shown that scrambling while preserving frequency of amino acid words can potentially improve the accuracy of false positive estimate, though more studies are needed to investigate the optimal scrambling procedure for specific condition and the variation of the estimate across repeated scrambling.

摘要

质谱肽段鉴定中的假阳性对照/估计对于蛋白质水平的可靠推断和下游生物信息学分析至关重要。基于对诱饵数据库进行搜索的方法因其概念简单和易于实现而变得流行。尽管已经提出了各种诱饵搜索策略,但很少有研究调查它们在性能上的差异。通过在模型蛋白质混合物上收集的数据集,我们证明对目标数据库及其反向版本进行单次搜索在性能和简单性之间提供了良好的平衡。特别是,假阳性数量估计的准确性和灵敏度至少与本研究中检查的其他程序相当。还表明,在保留氨基酸词频率的同时进行加扰可能会提高假阳性估计的准确性,不过需要更多研究来研究特定条件下的最佳加扰程序以及重复加扰时估计值的变化。

相似文献

1
On the estimation of false positives in peptide identifications using decoy search strategy.关于使用诱饵搜索策略估计肽段鉴定中的假阳性
Proteomics. 2009 Jan;9(1):194-204. doi: 10.1002/pmic.200800330.
2
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.用于提高质谱法大规模蛋白质鉴定可信度的靶标-诱饵搜索策略。
Nat Methods. 2007 Mar;4(3):207-14. doi: 10.1038/nmeth1019.
3
Using the entrapment sequence method as a standard to evaluate key steps of proteomics data analysis process.以截留序列法作为标准来评估蛋白质组学数据分析过程的关键步骤。
BMC Genomics. 2017 Mar 14;18(Suppl 2):143. doi: 10.1186/s12864-017-3491-2.
4
Statistical validation of peptide identifications in large-scale proteomics using the target-decoy database search strategy and flexible mixture modeling.使用目标-诱饵数据库搜索策略和灵活混合模型对大规模蛋白质组学中的肽段鉴定进行统计验证。
J Proteome Res. 2008 Jan;7(1):286-92. doi: 10.1021/pr7006818. Epub 2007 Dec 14.
5
Reverse and Random Decoy Methods for False Discovery Rate Estimation in High Mass Accuracy Peptide Spectral Library Searches.反转和随机诱饵方法在高质量精度肽谱库搜索中的假发现率估计。
J Proteome Res. 2018 Feb 2;17(2):846-857. doi: 10.1021/acs.jproteome.7b00614. Epub 2018 Jan 11.
6
A hierarchical statistical model to assess the confidence of peptides and proteins inferred from tandem mass spectrometry.一种用于评估从串联质谱推断出的肽段和蛋白质可信度的分层统计模型。
Bioinformatics. 2008 Jan 15;24(2):202-8. doi: 10.1093/bioinformatics/btm555. Epub 2007 Nov 17.
7
False discovery rates in spectral identification.光谱识别中的假发现率。
BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S2. doi: 10.1186/1471-2105-13-S16-S2. Epub 2012 Nov 5.
8
Decoy methods for assessing false positives and false discovery rates in shotgun proteomics.用于评估鸟枪法蛋白质组学中假阳性和错误发现率的诱饵方法。
Anal Chem. 2009 Jan 1;81(1):146-59. doi: 10.1021/ac801664q.
9
Instance based algorithm for posterior probability calculation by target-decoy strategy to improve protein identifications.基于实例的算法,通过目标-诱饵策略计算后验概率以提高蛋白质鉴定率。
Anal Chem. 2008 Dec 1;80(23):9326-35. doi: 10.1021/ac8017229.
10
Semisupervised model-based validation of peptide identifications in mass spectrometry-based proteomics.基于半监督模型的质谱蛋白质组学中肽段鉴定的验证
J Proteome Res. 2008 Jan;7(1):254-65. doi: 10.1021/pr070542g. Epub 2007 Dec 27.

引用本文的文献

1
Splicing neoantigen discovery with SNAF reveals shared targets for cancer immunotherapy.拼接新抗原发现与 SNAF 揭示癌症免疫治疗的共同靶点。
Sci Transl Med. 2024 Jan 17;16(730):eade2886. doi: 10.1126/scitranslmed.ade2886.
2
Mechanisms and Minimization of False Discovery of Metabolic Bioorthogonal Noncanonical Amino Acid Proteomics.代谢生物正交非规范氨基酸蛋白质组学的假发现机制及最小化
Rejuvenation Res. 2022 Apr;25(2):95-109. doi: 10.1089/rej.2022.0019.
3
Benefits of Collisional Cross Section Assisted Precursor Selection (caps-PASEF) for Cross-linking Mass Spectrometry.碰撞截面辅助前体选择(caps-PASEF)在交联质谱分析中的优势。
Mol Cell Proteomics. 2020 Oct;19(10):1677-1687. doi: 10.1074/mcp.RA120.002094. Epub 2020 Jul 21.
4
Identification of the iduronate-2-sulfatase proteome in wild-type mouse brain.野生型小鼠大脑中艾杜糖醛酸-2-硫酸酯酶蛋白质组的鉴定。
Heliyon. 2019 May 10;5(5):e01667. doi: 10.1016/j.heliyon.2019.e01667. eCollection 2019 May.
5
Application of clinical proteomics in acute respiratory distress syndrome.临床蛋白质组学在急性呼吸窘迫综合征中的应用。
Clin Transl Med. 2014 Dec;3(1):34. doi: 10.1186/s40169-014-0034-1. Epub 2014 Oct 15.
6
A high performance profile-biomarker diagnosis for mass spectral profiles.一种用于质谱图谱的高性能轮廓生物标志物诊断方法。
BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S5. doi: 10.1186/1752-0509-5-S2-S5. Epub 2011 Dec 14.
7
Two-dimensional target decoy strategy for shotgun proteomics. shotgun 蛋白质组学的二维靶标诱饵策略。
J Proteome Res. 2011 Dec 2;10(12):5296-301. doi: 10.1021/pr200780j. Epub 2011 Nov 7.

本文引用的文献

1
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。
J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.
2
False discovery rates and related statistical concepts in mass spectrometry-based proteomics.基于质谱的蛋白质组学中的错误发现率及相关统计概念。
J Proteome Res. 2008 Jan;7(1):47-50. doi: 10.1021/pr700747q. Epub 2007 Dec 8.
3
Posterior error probabilities and false discovery rates: two sides of the same coin.后验错误概率与错误发现率:同一枚硬币的两面。
J Proteome Res. 2008 Jan;7(1):40-4. doi: 10.1021/pr700739d. Epub 2007 Dec 4.
4
A hierarchical statistical model to assess the confidence of peptides and proteins inferred from tandem mass spectrometry.一种用于评估从串联质谱推断出的肽段和蛋白质可信度的分层统计模型。
Bioinformatics. 2008 Jan 15;24(2):202-8. doi: 10.1093/bioinformatics/btm555. Epub 2007 Nov 17.
5
Probability model for assessing proteins assembled from peptide sequences inferred from tandem mass spectrometry data.用于评估从串联质谱数据推断的肽序列组装而成的蛋白质的概率模型。
Anal Chem. 2007 May 15;79(10):3901-11. doi: 10.1021/ac070202e. Epub 2007 Apr 19.
6
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.用于提高质谱法大规模蛋白质鉴定可信度的靶标-诱饵搜索策略。
Nat Methods. 2007 Mar;4(3):207-14. doi: 10.1038/nmeth1019.
7
Development and validation of a spectral library searching method for peptide identification from MS/MS.用于从串联质谱(MS/MS)中鉴定肽段的光谱库搜索方法的开发与验证。
Proteomics. 2007 Mar;7(5):655-67. doi: 10.1002/pmic.200600625.
8
MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis.MyriMatch:通过多变量超几何分析实现高精度串联质谱肽段鉴定
J Proteome Res. 2007 Feb;6(2):654-61. doi: 10.1021/pr0604054.
9
Protein probabilities in shotgun proteomics: evaluating different estimation methods using a semi-random sampling model.鸟枪法蛋白质组学中的蛋白质概率:使用半随机抽样模型评估不同的估计方法
Proteomics. 2006 Dec;6(23):6134-45. doi: 10.1002/pmic.200600070.
10
Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries.使用谱库对大规模蛋白质组学实验中的肽段MS/MS谱进行分析。
Anal Chem. 2006 Aug 15;78(16):5678-84. doi: 10.1021/ac060279n.