• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于统计模型构建的方法识别 MS/MS 谱图与 PeptideProphet。

A statistical model-building perspective to identification of MS/MS spectra with PeptideProphet.

机构信息

Department of Statistics, Purdue University, 250 N. University Street, West Lafayette, Indiana, USA.

出版信息

BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S1. doi: 10.1186/1471-2105-13-S16-S1. Epub 2012 Nov 5.

DOI:10.1186/1471-2105-13-S16-S1
PMID:23176103
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3489532/
Abstract

PeptideProphet is a post-processing algorithm designed to evaluate the confidence in identifications of MS/MS spectra returned by a database search. In this manuscript we describe the "what and how" of PeptideProphet in a manner aimed at statisticians and life scientists who would like to gain a more in-depth understanding of the underlying statistical modeling. The theory and rationale behind the mixture-modeling approach taken by PeptideProphet is discussed from a statistical model-building perspective followed by a description of how a model can be used to express confidence in the identification of individual peptides or sets of peptides. We also demonstrate how to evaluate the quality of model fit and select an appropriate model from several available alternatives. We illustrate the use of PeptideProphet in association with the Trans-Proteomic Pipeline, a free suite of software used for protein identification.

摘要

PeptideProphet 是一种后处理算法,旨在评估通过数据库搜索返回的 MS/MS 光谱鉴定的置信度。在本文中,我们以一种旨在让希望更深入了解基础统计建模的统计学家和生命科学家能够理解的方式,描述了 PeptideProphet 的“是什么”和“怎么做”。从统计模型构建的角度讨论了 PeptideProphet 采用的混合模型方法的理论和基本原理,然后描述了如何使用模型来表达对单个肽或肽集合鉴定的置信度。我们还展示了如何评估模型拟合的质量并从几个可用的替代方案中选择合适的模型。我们说明了如何将 PeptideProphet 与 Trans-Proteomic Pipeline 一起使用,Trans-Proteomic Pipeline 是一套免费的用于蛋白质鉴定的软件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/e78a6bd5c941/1471-2105-13-S16-S1-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/198a509f9892/1471-2105-13-S16-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/b920cbdb630f/1471-2105-13-S16-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/073b2761e784/1471-2105-13-S16-S1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/eb1dfbcc5bcc/1471-2105-13-S16-S1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/d22ad55f23d7/1471-2105-13-S16-S1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/e8132aa3e452/1471-2105-13-S16-S1-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/38013ffd4d60/1471-2105-13-S16-S1-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/170a4886a8e0/1471-2105-13-S16-S1-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/eae36554ac43/1471-2105-13-S16-S1-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/804227a88f9c/1471-2105-13-S16-S1-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/c5cf1523c715/1471-2105-13-S16-S1-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/b4e103507ec1/1471-2105-13-S16-S1-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/e78a6bd5c941/1471-2105-13-S16-S1-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/198a509f9892/1471-2105-13-S16-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/b920cbdb630f/1471-2105-13-S16-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/073b2761e784/1471-2105-13-S16-S1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/eb1dfbcc5bcc/1471-2105-13-S16-S1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/d22ad55f23d7/1471-2105-13-S16-S1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/e8132aa3e452/1471-2105-13-S16-S1-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/38013ffd4d60/1471-2105-13-S16-S1-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/170a4886a8e0/1471-2105-13-S16-S1-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/eae36554ac43/1471-2105-13-S16-S1-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/804227a88f9c/1471-2105-13-S16-S1-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/c5cf1523c715/1471-2105-13-S16-S1-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/b4e103507ec1/1471-2105-13-S16-S1-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d61/3489532/e78a6bd5c941/1471-2105-13-S16-S1-13.jpg

相似文献

1
A statistical model-building perspective to identification of MS/MS spectra with PeptideProphet.基于统计模型构建的方法识别 MS/MS 谱图与 PeptideProphet。
BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S1. doi: 10.1186/1471-2105-13-S16-S1. Epub 2012 Nov 5.
2
Added value for tandem mass spectrometry shotgun proteomics data validation through isoelectric focusing of peptides.通过肽段等电聚焦对串联质谱鸟枪法蛋白质组学数据进行验证的附加价值。
J Proteome Res. 2005 Nov-Dec;4(6):2273-82. doi: 10.1021/pr050193v.
3
A statistical method for assessing peptide identification confidence in accurate mass and time tag proteomics.一种用于评估精确质量和时间标签蛋白质组学中肽鉴定置信度的统计方法。
Anal Chem. 2011 Aug 15;83(16):6135-40. doi: 10.1021/ac2009806. Epub 2011 Jul 15.
4
iProphet: multi-level integrative analysis of shotgun proteomic data improves peptide and protein identification rates and error estimates.iProphet:高通量蛋白质组学数据的多层次综合分析可提高肽段和蛋白质的鉴定率和错误评估。
Mol Cell Proteomics. 2011 Dec;10(12):M111.007690. doi: 10.1074/mcp.M111.007690. Epub 2011 Aug 29.
5
Statistical validation of peptide identifications in large-scale proteomics using the target-decoy database search strategy and flexible mixture modeling.使用目标-诱饵数据库搜索策略和灵活混合模型对大规模蛋白质组学中的肽段鉴定进行统计验证。
J Proteome Res. 2008 Jan;7(1):286-92. doi: 10.1021/pr7006818. Epub 2007 Dec 14.
6
Semisupervised model-based validation of peptide identifications in mass spectrometry-based proteomics.基于半监督模型的质谱蛋白质组学中肽段鉴定的验证
J Proteome Res. 2008 Jan;7(1):254-65. doi: 10.1021/pr070542g. Epub 2007 Dec 27.
7
COMPASS: a suite of pre- and post-search proteomics software tools for OMSSA.COMPASS:一套用于 OMSSA 的搜索前和搜索后蛋白质组学软件工具。
Proteomics. 2011 Mar;11(6):1064-74. doi: 10.1002/pmic.201000616. Epub 2011 Feb 7.
8
HMMatch: peptide identification by spectral matching of tandem mass spectra using hidden Markov models.HMMatch:使用隐马尔可夫模型通过串联质谱的谱图匹配进行肽段鉴定。
J Comput Biol. 2007 Oct;14(8):1025-43. doi: 10.1089/cmb.2007.0071.
9
The generating function approach for Peptide identification in spectral networks.光谱网络中肽段鉴定的生成函数方法。
J Comput Biol. 2015 May;22(5):353-66. doi: 10.1089/cmb.2014.0165. Epub 2014 Nov 25.
10
Oscore: a combined score to reduce false negative rates for peptide identification in tandem mass spectrometry analysis.Oscore:一种用于降低串联质谱分析中肽段鉴定假阴性率的综合评分。
J Mass Spectrom. 2009 Jan;44(1):25-31. doi: 10.1002/jms.1466.

引用本文的文献

1
Fecal proteomics of wild capuchins reveals impacts of season, diet, age, and, sex on gut physiology.野生卷尾猴的粪便蛋白质组学揭示了季节、饮食、年龄和性别对肠道生理的影响。
bioRxiv. 2025 Jun 21:2025.06.16.659980. doi: 10.1101/2025.06.16.659980.
2
Prosit-XL: enhanced cross-linked peptide identification by fragment intensity prediction to study protein interactions and structures.Prosit-XL:通过片段强度预测增强交联肽鉴定以研究蛋白质相互作用和结构
Nat Commun. 2025 Jul 1;16(1):5429. doi: 10.1038/s41467-025-61203-4.
3
Grape-Pi: graph-based neural networks for enhanced protein identification in proteomics pipelines.

本文引用的文献

1
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。
J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.
2
A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.用于在鸟枪法蛋白质组学中鉴定肽和蛋白质的计算方法和错误率估计程序的调查。
J Proteomics. 2010 Oct 10;73(11):2092-123. doi: 10.1016/j.jprot.2010.08.009. Epub 2010 Sep 8.
3
A guided tour of the Trans-Proteomic Pipeline.
葡萄-Pi:蛋白质组学流程中用于增强蛋白质鉴定的基于图的神经网络。
Bioinform Adv. 2025 Apr 26;5(1):vbaf095. doi: 10.1093/bioadv/vbaf095. eCollection 2025.
4
Unifying the analysis of bottom-up proteomics data with CHIMERYS.利用CHIMERYS统一自下而上蛋白质组学数据的分析
Nat Methods. 2025 May;22(5):1017-1027. doi: 10.1038/s41592-025-02663-w. Epub 2025 Apr 22.
5
Classification of Collagens via Peptide Ambiguation, in a Paleoproteomic LC-MS/MS-Based Taxonomic Pipeline.基于古蛋白质组液相色谱-串联质谱的分类学流程中通过肽段歧义对胶原蛋白进行分类
J Proteome Res. 2025 Apr 4;24(4):1907-1925. doi: 10.1021/acs.jproteome.4c00962. Epub 2025 Mar 13.
6
Query Mix-Max Method for FDR Estimation Supported by Entrapment Queries.由截留查询支持的用于错误发现率(FDR)估计的查询混合最大化方法。
J Proteome Res. 2025 Mar 7;24(3):1135-1147. doi: 10.1021/acs.jproteome.4c00744. Epub 2025 Feb 5.
7
Introducing "Identification Probability" for Automated and Transferable Assessment of Metabolite Identification Confidence in Metabolomics and Related Studies.介绍用于代谢组学及相关研究中代谢物鉴定置信度的自动化和可转移评估的“鉴定概率”。
Anal Chem. 2025 Jan 14;97(1):1-11. doi: 10.1021/acs.analchem.4c04060. Epub 2024 Dec 19.
8
Machine learning-enhanced immunopeptidomics applied to T-cell epitope discovery for COVID-19 vaccines.机器学习增强免疫肽组学在 COVID-19 疫苗 T 细胞表位发现中的应用。
Nat Commun. 2024 Nov 28;15(1):10316. doi: 10.1038/s41467-024-54734-9.
9
Chemotherapeutic agents and leucine deprivation induce codon-biased aberrant protein production in cancer.化疗药物和亮氨酸缺乏会在癌症中诱导密码子偏向性异常蛋白质产生。
Nucleic Acids Res. 2024 Dec 11;52(22):13964-13979. doi: 10.1093/nar/gkae1110.
10
DNA repair and anti-cancer mechanisms in the long-lived bowhead whale.长寿的弓头鲸中的DNA修复与抗癌机制。
bioRxiv. 2024 Nov 5:2023.05.07.539748. doi: 10.1101/2023.05.07.539748.
《跨蛋白质组学分析流程指南》
Proteomics. 2010 Mar;10(6):1150-9. doi: 10.1002/pmic.200900375.
4
Adaptive discriminant function analysis and reranking of MS/MS database search results for improved peptide identification in shotgun proteomics.用于鸟枪法蛋白质组学中改进肽段鉴定的自适应判别函数分析及串联质谱数据库搜索结果的重排
J Proteome Res. 2008 Nov;7(11):4878-89. doi: 10.1021/pr800484x. Epub 2008 Sep 13.
5
Semisupervised model-based validation of peptide identifications in mass spectrometry-based proteomics.基于半监督模型的质谱蛋白质组学中肽段鉴定的验证
J Proteome Res. 2008 Jan;7(1):254-65. doi: 10.1021/pr070542g. Epub 2007 Dec 27.
6
Statistical validation of peptide identifications in large-scale proteomics using the target-decoy database search strategy and flexible mixture modeling.使用目标-诱饵数据库搜索策略和灵活混合模型对大规模蛋白质组学中的肽段鉴定进行统计验证。
J Proteome Res. 2008 Jan;7(1):286-92. doi: 10.1021/pr7006818. Epub 2007 Dec 14.
7
Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.使用诱饵数据库对通过串联质谱鉴定的肽段赋予显著性。
J Proteome Res. 2008 Jan;7(1):29-34. doi: 10.1021/pr700600n. Epub 2007 Dec 8.
8
Posterior error probabilities and false discovery rates: two sides of the same coin.后验错误概率与错误发现率:同一枚硬币的两面。
J Proteome Res. 2008 Jan;7(1):40-4. doi: 10.1021/pr700739d. Epub 2007 Dec 4.
9
The standard protein mix database: a diverse data set to assist in the production of improved Peptide and protein identification software tools.标准蛋白质混合物数据库:一个多样化的数据集,用于协助开发改进的肽和蛋白质鉴定软件工具。
J Proteome Res. 2008 Jan;7(1):96-103. doi: 10.1021/pr070244j. Epub 2007 Aug 21.
10
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.用于提高质谱法大规模蛋白质鉴定可信度的靶标-诱饵搜索策略。
Nat Methods. 2007 Mar;4(3):207-14. doi: 10.1038/nmeth1019.