• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于质谱的蛋白质组学的高通量蛋白质鉴定的蛋白质概率模型。

Protein Probability Model for High-Throughput Protein Identification by Mass Spectrometry-Based Proteomics.

机构信息

Department of Communications Engineering, University of the Basque Country (UPV/EHU), 48013 Bilbao, Spain.

Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), 28049 Madrid, Spain.

出版信息

J Proteome Res. 2020 Mar 6;19(3):1285-1297. doi: 10.1021/acs.jproteome.9b00819. Epub 2020 Feb 25.

DOI:10.1021/acs.jproteome.9b00819
PMID:32037837
Abstract

Shotgun proteomics is the method of choice for high-throughput protein identification; however, robust statistical methods are essential to automatize this task while minimizing the number of false identifications. The standard method for estimating the false discovery rate (FDR) of individual identifications and keeping it below a threshold (typically 1%) is the target-decoy approach. However, numerous works have shown that FDR at the protein level may become much larger than FDR at the peptide level. The development of an appropriate scoring model to identify proteins from their peptides using high-throughput shotgun proteomics is highly needed. In this study, we present a novel protein-level scoring algorithm that uses the scores of the identified peptides and maintains all of the properties expected for a true protein probability. We also present a refinement of the method to calculate FDR at the protein level. These algorithms can be used together as a robust identification workflow suitable for large-scale proteomics, and we show that the identification performance of this workflow is superior to that of other widely used methods in several samples and using different search engines. Our protein probability model offers the scientific community an algorithm that is easy to integrate into protein identification workflows for the automated analysis of shotgun proteomics data.

摘要

shotgun 蛋白质组学是高通量蛋白质鉴定的首选方法;然而,为了在最小化假阳性鉴定数量的同时实现自动化,稳健的统计方法至关重要。估计单个鉴定的假发现率 (FDR) 并将其保持在阈值以下(通常为 1%)的标准方法是靶标-诱饵方法。然而,许多研究表明,蛋白质水平的 FDR 可能比肽水平的 FDR 大得多。需要开发一种合适的评分模型,以便使用高通量 shotgun 蛋白质组学从其肽中识别蛋白质。在这项研究中,我们提出了一种新的蛋白质水平评分算法,该算法使用鉴定肽的分数,并保持所有真正蛋白质概率的预期特性。我们还提出了一种改进的方法来计算蛋白质水平的 FDR。这些算法可以一起用作适合大规模蛋白质组学的稳健鉴定工作流程,我们表明该工作流程的鉴定性能在几个样本和使用不同的搜索引擎时优于其他广泛使用的方法。我们的蛋白质概率模型为科学界提供了一种算法,该算法易于集成到蛋白质鉴定工作流程中,用于 shotgun 蛋白质组学数据的自动分析。

相似文献

1
Protein Probability Model for High-Throughput Protein Identification by Mass Spectrometry-Based Proteomics.基于质谱的蛋白质组学的高通量蛋白质鉴定的蛋白质概率模型。
J Proteome Res. 2020 Mar 6;19(3):1285-1297. doi: 10.1021/acs.jproteome.9b00819. Epub 2020 Feb 25.
2
Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach.基于目标-诱饵法的鸟枪法蛋白质组学无偏错误发现率估计
J Proteome Res. 2017 Feb 3;16(2):393-397. doi: 10.1021/acs.jproteome.6b00144. Epub 2016 Dec 13.
3
False discovery rates in spectral identification.光谱识别中的假发现率。
BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S2. doi: 10.1186/1471-2105-13-S16-S2. Epub 2012 Nov 5.
4
Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.用于鸟枪法蛋白质组学的改进型错误发现率估计程序
J Proteome Res. 2015 Aug 7;14(8):3148-61. doi: 10.1021/acs.jproteome.5b00081. Epub 2015 Jul 27.
5
In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。
J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.
6
Instance based algorithm for posterior probability calculation by target-decoy strategy to improve protein identifications.基于实例的算法,通过目标-诱饵策略计算后验概率以提高蛋白质鉴定率。
Anal Chem. 2008 Dec 1;80(23):9326-35. doi: 10.1021/ac8017229.
7
Repeat-Preserving Decoy Database for False Discovery Rate Estimation in Peptide Identification.重复保留诱饵数据库用于肽鉴定中的错误发现率估计。
J Proteome Res. 2020 Mar 6;19(3):1029-1036. doi: 10.1021/acs.jproteome.9b00555. Epub 2020 Feb 21.
8
Comparative database search engine analysis on massive tandem mass spectra of pork-based food products for halal proteomics.基于猪肉的食品清真蛋白质组学大规模串联质谱的比较数据库搜索引擎分析
J Proteomics. 2021 Jun 15;241:104240. doi: 10.1016/j.jprot.2021.104240. Epub 2021 Apr 21.
9
False Discovery Rate Estimation for Hybrid Mass Spectral Library Search Identifications in Bottom-up Proteomics.用于 Bottom-up 蛋白质组学中混合质谱文库搜索鉴定的假发现率估计。
J Proteome Res. 2019 Sep 6;18(9):3223-3234. doi: 10.1021/acs.jproteome.8b00863. Epub 2019 Aug 14.
10
A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.一种用于大规模蛋白质组学数据集中蛋白质错误发现率估计的可扩展方法。
Mol Cell Proteomics. 2015 Sep;14(9):2394-404. doi: 10.1074/mcp.M114.046995. Epub 2015 May 17.

引用本文的文献

1
Immunoglobulin A/PIGR axis as potential mediators of human abdominal aortic aneurysms revealed by topologically resolved proteomics.拓扑解析蛋白质组学揭示免疫球蛋白A/PIGR轴作为人类腹主动脉瘤的潜在介质
J Transl Med. 2025 Jul 7;23(1):747. doi: 10.1186/s12967-025-06758-y.
2
Exercise and tumor proteome: insights from a neuroblastoma model.运动与肿瘤蛋白质组学:神经母细胞瘤模型的研究进展
Physiol Genomics. 2024 Dec 1;56(12):833-844. doi: 10.1152/physiolgenomics.00064.2024. Epub 2024 Sep 23.
3
Proteome Analysis of Molecular Events in Oral Pathogenesis and Virus: A Review with a Particular Focus on Periodontitis.
口腔发病机制和病毒中分子事件的蛋白质组分析:牙周炎为重点的综述
Int J Mol Sci. 2020 Jul 22;21(15):5184. doi: 10.3390/ijms21155184.