• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过分析多个搜索引擎的错误发现率提高蛋白质组学研究的灵敏度。

Improving sensitivity in proteome studies by analysis of false discovery rates for multiple search engines.

作者信息

Jones Andrew R, Siepen Jennifer A, Hubbard Simon J, Paton Norman W

机构信息

Department of Preclinical Veterinary Science, Faculty of Veterinary Science, University of Liverpool, Liverpool, UK.

出版信息

Proteomics. 2009 Mar;9(5):1220-9. doi: 10.1002/pmic.200800473.

DOI:10.1002/pmic.200800473
PMID:19253293
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2899855/
Abstract

LC-MS experiments can generate large quantities of data, for which a variety of database search engines are available to make peptide and protein identifications. Decoy databases are becoming widely used to place statistical confidence in result sets, allowing the false discovery rate (FDR) to be estimated. Different search engines produce different identification sets so employing more than one search engine could result in an increased number of peptides (and proteins) being identified, if an appropriate mechanism for combining data can be defined. We have developed a search engine independent score, based on FDR, which allows peptide identifications from different search engines to be combined, called the FDR Score. The results demonstrate that the observed FDR is significantly different when analysing the set of identifications made by all three search engines, by each pair of search engines or by a single search engine. Our algorithm assigns identifications to groups according to the set of search engines that have made the identification, and re-assigns the score (combined FDR Score). The combined FDR Score can differentiate between correct and incorrect peptide identifications with high accuracy, allowing on average 35% more peptide identifications to be made at a fixed FDR than using a single search engine.

摘要

液相色谱-质谱联用(LC-MS)实验能够产生大量数据,针对这些数据有多种数据库搜索引擎可用于进行肽段和蛋白质鉴定。反向数据库正被广泛用于对结果集进行统计学置信度评估,从而能够估计错误发现率(FDR)。不同的搜索引擎会产生不同的鉴定集,因此如果能够定义一种合适的数据合并机制,使用多个搜索引擎可能会使鉴定出的肽段(和蛋白质)数量增加。我们基于错误发现率开发了一种独立于搜索引擎的评分方法,它能够将来自不同搜索引擎的肽段鉴定结果进行合并,称为错误发现率评分(FDR评分)。结果表明,在分析由所有三个搜索引擎、每对搜索引擎或单个搜索引擎做出的鉴定集时,观察到的错误发现率存在显著差异。我们的算法根据做出鉴定的搜索引擎集将鉴定结果分配到不同组,并重新分配评分(合并后的错误发现率评分)。合并后的错误发现率评分能够以高精度区分正确和错误的肽段鉴定结果,与使用单个搜索引擎相比,在固定的错误发现率下平均能够多鉴定出35%的肽段。

相似文献

1
Improving sensitivity in proteome studies by analysis of false discovery rates for multiple search engines.通过分析多个搜索引擎的错误发现率提高蛋白质组学研究的灵敏度。
Proteomics. 2009 Mar;9(5):1220-9. doi: 10.1002/pmic.200800473.
2
In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。
J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.
3
Maximizing the sensitivity and reliability of peptide identification in large-scale proteomic experiments by harnessing multiple search engines.利用多个搜索引擎,最大限度地提高大规模蛋白质组学实验中肽鉴定的灵敏度和可靠性。
Proteomics. 2010 Mar;10(6):1172-89. doi: 10.1002/pmic.200900074.
4
MSblender: A probabilistic approach for integrating peptide identifications from multiple database search engines.MSblender:一种整合来自多个数据库搜索引擎的肽鉴定的概率方法。
J Proteome Res. 2011 Jul 1;10(7):2949-58. doi: 10.1021/pr2002116. Epub 2011 Apr 29.
5
Analysis of the resolution limitations of peptide identification algorithms.分析肽鉴定算法的分辨率限制。
J Proteome Res. 2011 Dec 2;10(12):5555-61. doi: 10.1021/pr200913a. Epub 2011 Oct 26.
6
A peptide-retrieval strategy enables significant improvement of quantitative performance without compromising confidence of identification.肽段检索策略可在不影响鉴定置信度的情况下显著提高定量性能。
J Proteomics. 2017 Jan 30;152:276-282. doi: 10.1016/j.jprot.2016.11.020. Epub 2016 Nov 27.
7
Optimization of Search Engines and Postprocessing Approaches to Maximize Peptide and Protein Identification for High-Resolution Mass Data.优化搜索引擎和后处理方法以最大化高分辨率质谱数据的肽段和蛋白质鉴定
J Proteome Res. 2015 Nov 6;14(11):4662-73. doi: 10.1021/acs.jproteome.5b00536. Epub 2015 Sep 30.
8
Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies.通过概率性合并多种串联质谱(MS/MS)搜索方法的结果来提高灵敏度。
J Proteome Res. 2008 Jan;7(1):245-53. doi: 10.1021/pr070540w.
9
False discovery rates in spectral identification.光谱识别中的假发现率。
BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S2. doi: 10.1186/1471-2105-13-S16-S2. Epub 2012 Nov 5.
10
Comparative database search engine analysis on massive tandem mass spectra of pork-based food products for halal proteomics.基于猪肉的食品清真蛋白质组学大规模串联质谱的比较数据库搜索引擎分析
J Proteomics. 2021 Jun 15;241:104240. doi: 10.1016/j.jprot.2021.104240. Epub 2021 Apr 21.

引用本文的文献

1
PgxSAVy: A tool for comprehensive evaluation of variant peptide quality in proteogenomics - catching the (un)usual suspects.PgxSAVy:蛋白质基因组学中变异肽质量综合评估工具——捕捉(不)寻常的可疑对象。
Comput Struct Biotechnol J. 2023 Dec 26;23:711-722. doi: 10.1016/j.csbj.2023.12.033. eCollection 2024 Dec.
2
Top-Down Proteomics and the Challenges of True Proteoform Characterization.自上而下的蛋白质组学和真正的蛋白质组特征分析面临的挑战。
J Proteome Res. 2023 Dec 1;22(12):3663-3675. doi: 10.1021/acs.jproteome.3c00416. Epub 2023 Nov 8.
3
Comparing Top-Down Proteoform Identification: Deconvolution, PrSM Overlap, and PTM Detection.比较自上而下的蛋白组鉴定:去卷积、PrSM 重叠和 PTM 检测。
J Proteome Res. 2023 Jul 7;22(7):2199-2217. doi: 10.1021/acs.jproteome.2c00673. Epub 2023 May 26.
4
Multienzyme deep learning models improve peptide de novo sequencing by mass spectrometry proteomics.多酶深度学习模型通过质谱蛋白质组学提高肽从头测序。
PLoS Comput Biol. 2023 Jan 20;19(1):e1010457. doi: 10.1371/journal.pcbi.1010457. eCollection 2023 Jan.
5
SILAC-based quantitative proteomics to investigate the eicosanoid associated inflammatory response in activated macrophages.基于稳定同位素标记氨基酸的细胞培养定量蛋白质组学技术研究活化巨噬细胞中类花生酸相关的炎症反应
J Inflamm (Lond). 2022 Sep 1;19(1):12. doi: 10.1186/s12950-022-00309-8.
6
Prospects and challenges of cancer systems medicine: from genes to disease networks.癌症系统医学的前景与挑战:从基因到疾病网络。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab343.
7
Cerebrospinal fluid proteome maps detect pathogen-specific host response patterns in meningitis.脑脊液蛋白质组图谱可检测脑膜炎中病原体特异性的宿主反应模式。
Elife. 2021 Apr 6;10:e64159. doi: 10.7554/eLife.64159.
8
Enhancing Open Modification Searches via a Combined Approach Facilitated by Ursgal.通过 Ursgal 辅助的联合方法增强开放修饰搜索。
J Proteome Res. 2021 Apr 2;20(4):1986-1996. doi: 10.1021/acs.jproteome.0c00799. Epub 2021 Jan 29.
9
The Archaeal Proteome Project advances knowledge about archaeal cell biology through comprehensive proteomics.古菌蛋白质组计划通过全面的蛋白质组学研究推进古菌细胞生物学的知识。
Nat Commun. 2020 Jun 19;11(1):3145. doi: 10.1038/s41467-020-16784-7.
10
Automated Nanoflow Two-Dimensional Reversed-Phase Liquid Chromatography System Enables In-Depth Proteome and Phosphoproteome Profiling of Nanoscale Samples.自动化纳流二维反相液相色谱系统可深入分析纳米级样品的蛋白质组和磷酸化蛋白质组图谱。
Anal Chem. 2019 Aug 6;91(15):9707-9715. doi: 10.1021/acs.analchem.9b01248. Epub 2019 Jul 9.

本文引用的文献

1
Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies.通过概率性合并多种串联质谱(MS/MS)搜索方法的结果来提高灵敏度。
J Proteome Res. 2008 Jan;7(1):245-53. doi: 10.1021/pr070540w.
2
Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.使用诱饵数据库对通过串联质谱鉴定的肽段赋予显著性。
J Proteome Res. 2008 Jan;7(1):29-34. doi: 10.1021/pr700600n. Epub 2007 Dec 8.
3
Posterior error probabilities and false discovery rates: two sides of the same coin.后验错误概率与错误发现率:同一枚硬币的两面。
J Proteome Res. 2008 Jan;7(1):40-4. doi: 10.1021/pr700739d. Epub 2007 Dec 4.
4
Methods, algorithms and tools in computational proteomics: a practical point of view.计算蛋白质组学中的方法、算法与工具:实践视角
Proteomics. 2007 Aug;7(16):2815-32. doi: 10.1002/pmic.200700116.
5
Comparative evaluation of tandem MS search algorithms using a target-decoy search strategy.使用目标-诱饵搜索策略对串联质谱搜索算法进行比较评估。
Mol Cell Proteomics. 2007 Sep;6(9):1599-608. doi: 10.1074/mcp.M600469-MCP200. Epub 2007 May 28.
6
The Association of Biomolecular Resource Facilities Proteomics Research Group 2006 study: relative protein quantitation.生物分子资源设施蛋白质组学研究组2006年的研究:相对蛋白质定量
Mol Cell Proteomics. 2007 Aug;6(8):1291-8. doi: 10.1074/mcp.M700165-MCP200. Epub 2007 May 18.
7
ABRF-PRG04: differentiation of protein isoforms.ABRF-PRG04:蛋白质异构体的分化
J Biomol Tech. 2007 Apr;18(2):124-34.
8
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.用于提高质谱法大规模蛋白质鉴定可信度的靶标-诱饵搜索策略。
Nat Methods. 2007 Mar;4(3):207-14. doi: 10.1038/nmeth1019.
9
Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation.绝对蛋白质表达谱分析可估计转录调控和翻译调控的相对贡献。
Nat Biotechnol. 2007 Jan;25(1):117-24. doi: 10.1038/nbt1270. Epub 2006 Dec 24.
10
Overcoming the dynamic range problem in mass spectrometry-based shotgun proteomics.克服基于质谱的鸟枪法蛋白质组学中的动态范围问题。
Expert Rev Proteomics. 2006 Dec;3(6):611-9. doi: 10.1586/14789450.3.6.611.