• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过蛋白质质谱的多次分析实现数据最大化。

Data maximization by multipass analysis of protein mass spectra.

机构信息

Johns Hopkins Bayview Proteomics Center, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA.

出版信息

Proteomics. 2010 Mar;10(6):1160-71. doi: 10.1002/pmic.200900433.

DOI:10.1002/pmic.200900433
PMID:20082346
Abstract

With the proliferation of search engines for the analysis of MS data, multisearch techniques aimed at boosting the discriminating power of the search engines' score functions have recently become popular. Much statistical and algorithmic work has been done, therefore, in order to be able to combine and parse multiple search streams. However, multisearch techniques suffer from long run times, and may have little impact on false negatives because of similar peptide filtering heuristics between searches. This review focuses, rather, on multipass techniques, which use the results of one search to guide the selection of spectra, parameters and sequences in subsequent searches. This reduces the number of false-negative peptide identifications due to peptide candidate filtering while preserving statistical significance of existing (correct) identifications. Furthermore, this technique avoids substantial increases in running time and, by limiting the search space, does not reduce the statistical significance of correct identifications or introduce a statistically significant number of false-positive identifications. However, we argue that the existing combiner tools are not reliably applicable to these multipass situations, because of algorithmic assumptions about search space and statistical assumptions about the rate of true positives. Here we provide an overview of the advantages of and issues in multipass analysis techniques, the existing methods and workflows available to proteomic researchers, and the unsolved statistical and algorithmic issues amenable to future research.

摘要

随着用于分析 MS 数据的搜索引擎的激增,旨在提高搜索引擎评分函数判别能力的多搜索技术最近变得流行起来。因此,为了能够组合和解析多个搜索流,已经完成了大量的统计和算法工作。但是,多搜索技术运行时间长,并且由于搜索之间的类似肽过滤启发式,可能对假阴性的影响不大。本综述侧重于多遍技术,该技术使用一次搜索的结果来指导后续搜索中光谱、参数和序列的选择。这减少了由于肽候选过滤而导致的假阴性肽鉴定数量,同时保留了现有(正确)鉴定的统计显着性。此外,该技术避免了运行时间的大幅增加,并且通过限制搜索空间,不会降低正确鉴定的统计显着性或引入大量假阳性鉴定。然而,我们认为现有的组合工具不适用于这些多遍情况,因为它们对搜索空间的算法假设和对真实阳性率的统计假设。在这里,我们提供了多遍分析技术的优势和问题、蛋白质组学研究人员可用的现有方法和工作流程以及可用于未来研究的未解决的统计和算法问题的概述。

相似文献

1
Data maximization by multipass analysis of protein mass spectra.通过蛋白质质谱的多次分析实现数据最大化。
Proteomics. 2010 Mar;10(6):1160-71. doi: 10.1002/pmic.200900433.
2
Analysis of the resolution limitations of peptide identification algorithms.分析肽鉴定算法的分辨率限制。
J Proteome Res. 2011 Dec 2;10(12):5555-61. doi: 10.1021/pr200913a. Epub 2011 Oct 26.
3
Estimating the statistical significance of peptide identifications from shotgun proteomics experiments.评估鸟枪法蛋白质组学实验中肽段鉴定结果的统计学显著性。
J Proteome Res. 2007 May;6(5):1758-67. doi: 10.1021/pr0605320. Epub 2007 Mar 31.
4
Quality assessments of peptide-spectrum matches in shotgun proteomics.肽谱匹配在鸟枪法蛋白质组学中的质量评估。
Proteomics. 2011 Mar;11(6):1086-93. doi: 10.1002/pmic.201000432. Epub 2011 Feb 7.
5
Maximizing the sensitivity and reliability of peptide identification in large-scale proteomic experiments by harnessing multiple search engines.利用多个搜索引擎,最大限度地提高大规模蛋白质组学实验中肽鉴定的灵敏度和可靠性。
Proteomics. 2010 Mar;10(6):1172-89. doi: 10.1002/pmic.200900074.
6
CHOMPER: a bioinformatic tool for rapid validation of tandem mass spectrometry search results associated with high-throughput proteomic strategies.CHOMPER:一种用于快速验证与高通量蛋白质组学策略相关的串联质谱搜索结果的生物信息学工具。
Proteomics. 2002 Sep;2(9):1097-103. doi: 10.1002/1615-9861(200209)2:9<1097::AID-PROT1097>3.0.CO;2-X.
7
Search and decoy: the automatic identification of mass spectra.搜索与诱饵:质谱的自动识别
Methods Mol Biol. 2012;893:445-88. doi: 10.1007/978-1-61779-885-6_28.
8
Integrated approach for manual evaluation of peptides identified by searching protein sequence databases with tandem mass spectra.通过串联质谱搜索蛋白质序列数据库鉴定肽段的手动评估综合方法。
J Proteome Res. 2005 May-Jun;4(3):998-1005. doi: 10.1021/pr049754t.
9
VEMS 3.0: algorithms and computational tools for tandem mass spectrometry based identification of post-translational modifications in proteins.VEMS 3.0:用于基于串联质谱法鉴定蛋白质翻译后修饰的算法和计算工具
J Proteome Res. 2005 Nov-Dec;4(6):2338-47. doi: 10.1021/pr050264q.
10
Statistical models for protein validation using tandem mass spectral data and protein amino acid sequence databases.使用串联质谱数据和蛋白质氨基酸序列数据库进行蛋白质验证的统计模型。
Anal Chem. 2004 Mar 15;76(6):1664-71. doi: 10.1021/ac035112y.

引用本文的文献

1
An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics.蛋白质基因组学分析,以及转录组信息如何以及何时减少蛋白质数据库可增强真核蛋白质组学。
Genome Biol. 2022 Jun 20;23(1):132. doi: 10.1186/s13059-022-02701-2.
2
Identification of Antibiotic Resistance Proteins via MiCId's Augmented Workflow. A Mass Spectrometry-Based Proteomics Approach.通过 MiCId 增强工作流程鉴定抗生素耐药蛋白。一种基于质谱的蛋白质组学方法。
J Am Soc Mass Spectrom. 2022 Jun 1;33(6):917-931. doi: 10.1021/jasms.1c00347. Epub 2022 May 2.
3
Influence of Post-Translational Modifications on Protein Identification in Database Searches.
翻译后修饰对数据库搜索中蛋白质鉴定的影响。
ACS Omega. 2021 Mar 15;6(11):7469-7477. doi: 10.1021/acsomega.0c05997. eCollection 2021 Mar 23.
4
An Algorithm to Improve the Speed of Semi and Non-Specific Enzyme Searches in Proteomics.一种提高蛋白质组学中半特异性和非特异性酶搜索速度的算法。
Curr Bioinform. 2020;15(9):1065-1074. doi: 10.2174/1574893615999200429123334.
5
Robust Accurate Identification and Biomass Estimates of Microorganisms via Tandem Mass Spectrometry.通过串联质谱法对微生物进行稳健准确的鉴定和生物量估计。
J Am Soc Mass Spectrom. 2020 Jan 2;31(1):85-102. doi: 10.1021/jasms.9b00035. Epub 2019 Nov 20.
6
Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry.基于高分辨串联质谱的高通量、高准确性微生物快速分类鉴定技术
J Am Soc Mass Spectrom. 2018 Aug;29(8):1721-1737. doi: 10.1007/s13361-018-1986-y. Epub 2018 Jun 5.
7
Modifications in acute phase and complement systems predict shifts in cognitive status of HIV-infected patients.急性期和补体系统的改变预示着HIV感染患者认知状态的变化。
AIDS. 2017 Jun 19;31(10):1365-1378. doi: 10.1097/QAD.0000000000001503.
8
Adaptation of Decoy Fusion Strategy for Existing Multi-Stage Search Workflows.诱骗融合策略在现有多阶段搜索工作流中的适应性调整。
J Am Soc Mass Spectrom. 2016 Sep;27(9):1579-82. doi: 10.1007/s13361-016-1436-7. Epub 2016 Jun 27.
9
PyQuant: A Versatile Framework for Analysis of Quantitative Mass Spectrometry Data.PyQuant:用于定量质谱数据分析的通用框架。
Mol Cell Proteomics. 2016 Aug;15(8):2829-38. doi: 10.1074/mcp.O115.056879. Epub 2016 May 26.
10
JUMPg: An Integrative Proteogenomics Pipeline Identifying Unannotated Proteins in Human Brain and Cancer Cells.JUMPg:一种整合蛋白质基因组学流程,用于鉴定人脑中未注释的蛋白质以及癌细胞中的未注释蛋白质。
J Proteome Res. 2016 Jul 1;15(7):2309-20. doi: 10.1021/acs.jproteome.6b00344. Epub 2016 Jun 13.