• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过级联搜索进行串联质谱鉴定

Tandem Mass Spectrum Identification via Cascaded Search.

作者信息

Kertesz-Farkas Attila, Keich Uri, Noble William Stafford

机构信息

Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States.

School of Mathematics and Statistics, University of Sydney, Camperdown, NSW 2006, Australia.

出版信息

J Proteome Res. 2015 Aug 7;14(8):3027-38. doi: 10.1021/pr501173s. Epub 2015 Jun 30.

DOI:10.1021/pr501173s
PMID:26084232
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4533645/
Abstract

Accurate assignment of peptide sequences to observed fragmentation spectra is hindered by the large number of hypotheses that must be considered for each observed spectrum. A high score assigned to a particular peptide-spectrum match (PSM) may not end up being statistically significant after multiple testing correction. Researchers can mitigate this problem by controlling the hypothesis space in various ways: considering only peptides resulting from enzymatic cleavages, ignoring possible post-translational modifications or single nucleotide variants, etc. However, these strategies sacrifice identifications of spectra generated by rarer types of peptides. In this work, we introduce a statistical testing framework, cascade search, that directly addresses this problem. The method requires that the user specify a priori a statistical confidence threshold as well as a series of peptide databases. For instance, such a cascade of databases could include fully tryptic, semitryptic, and nonenzymatic peptides or peptides with increasing numbers of modifications. Cascaded search then gradually expands the list of candidate peptides from more likely peptides toward rare peptides, sequestering at each stage any spectrum that is identified with a specified statistical confidence. We compare cascade search to a standard procedure that lumps all of the peptides into a single database, as well as to a previously described group FDR procedure that computes the FDR separately within each database. We demonstrate, using simulated and real data, that cascade search identifies more spectra at a fixed FDR threshold than with either the ungrouped or grouped approach. Cascade search thus provides a general method for maximizing the number of identified spectra in a statistically rigorous fashion.

摘要

将肽序列准确地分配到观察到的碎片光谱中,会受到大量假设的阻碍,因为对于每个观察到的光谱都必须考虑这些假设。在进行多重检验校正后,赋予特定肽段 - 光谱匹配(PSM)的高分最终可能不具有统计学意义。研究人员可以通过多种方式控制假设空间来缓解这个问题:仅考虑酶切产生的肽段,忽略可能的翻译后修饰或单核苷酸变体等。然而,这些策略牺牲了对由稀有类型肽段产生的光谱的鉴定。在这项工作中,我们引入了一种统计检验框架——级联搜索,它直接解决了这个问题。该方法要求用户事先指定一个统计置信阈值以及一系列肽数据库。例如,这样的数据库级联可以包括完全胰蛋白酶酶切的、半胰蛋白酶酶切的和非酶切的肽段,或者修饰数量不断增加的肽段。然后,级联搜索从更可能的肽段逐渐扩展候选肽段列表,直至稀有肽段,并在每个阶段隔离任何以指定统计置信度鉴定出的光谱。我们将级联搜索与一种将所有肽段集中到单个数据库中的标准程序进行比较,同时也与之前描述的在每个数据库中单独计算错误发现率(FDR)的分组FDR程序进行比较。我们使用模拟数据和真实数据证明,在固定的FDR阈值下,级联搜索比未分组或分组方法鉴定出的光谱更多。因此,级联搜索提供了一种以统计严谨的方式最大化鉴定光谱数量的通用方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/ff0eeae52061/pr-2014-01173s_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/2d4dd76bfc53/pr-2014-01173s_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/b01bcf90bf2b/pr-2014-01173s_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/81f62a26f23a/pr-2014-01173s_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/88b51af8281a/pr-2014-01173s_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/34bd5524f9d4/pr-2014-01173s_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/ff0eeae52061/pr-2014-01173s_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/2d4dd76bfc53/pr-2014-01173s_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/b01bcf90bf2b/pr-2014-01173s_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/81f62a26f23a/pr-2014-01173s_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/88b51af8281a/pr-2014-01173s_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/34bd5524f9d4/pr-2014-01173s_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6950/4533645/ff0eeae52061/pr-2014-01173s_0003.jpg

相似文献

1
Tandem Mass Spectrum Identification via Cascaded Search.通过级联搜索进行串联质谱鉴定
J Proteome Res. 2015 Aug 7;14(8):3027-38. doi: 10.1021/pr501173s. Epub 2015 Jun 30.
2
Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.用于鸟枪法蛋白质组学的改进型错误发现率估计程序
J Proteome Res. 2015 Aug 7;14(8):3148-61. doi: 10.1021/acs.jproteome.5b00081. Epub 2015 Jul 27.
3
MSblender: A probabilistic approach for integrating peptide identifications from multiple database search engines.MSblender:一种整合来自多个数据库搜索引擎的肽鉴定的概率方法。
J Proteome Res. 2011 Jul 1;10(7):2949-58. doi: 10.1021/pr2002116. Epub 2011 Apr 29.
4
Interpretation of Tandem Mass Spectra of Posttranslationally Modified Peptides.翻译后修饰肽段的串联质谱解析
Methods Mol Biol. 2020;2051:199-230. doi: 10.1007/978-1-4939-9744-2_8.
5
A peptide-retrieval strategy enables significant improvement of quantitative performance without compromising confidence of identification.肽段检索策略可在不影响鉴定置信度的情况下显著提高定量性能。
J Proteomics. 2017 Jan 30;152:276-282. doi: 10.1016/j.jprot.2016.11.020. Epub 2016 Nov 27.
6
APIR: Aggregating Universal Proteomics Database Search Algorithms for Peptide Identification with FDR Control.APIR:用于肽鉴定的聚合通用蛋白质组学数据库搜索算法,同时控制 FDR。
Genomics Proteomics Bioinformatics. 2024 Jul 3;22(2). doi: 10.1093/gpbjnl/qzae042.
7
Analysis of Tandem Mass Spectrometry Data with CONGA: Combining Open and Narrow Searches with Group-Wise Analysis.CONGA 分析串联质谱数据:开放和窄搜索与群组分析相结合。
J Proteome Res. 2024 Jun 7;23(6):1894-1906. doi: 10.1021/acs.jproteome.3c00399. Epub 2024 Apr 23.
8
Improving Peptide-Level Mass Spectrometry Analysis via Double Competition.通过双重竞争提高肽段水平的质谱分析。
J Proteome Res. 2022 Oct 7;21(10):2412-2420. doi: 10.1021/acs.jproteome.2c00282. Epub 2022 Sep 27.
9
Reinvestigating the Correctness of Decoy-Based False Discovery Rate Control in Proteomics Tandem Mass Spectrometry.重新考察基于诱饵的蛋白质组学串联质谱假发现率控制的正确性。
J Proteome Res. 2024 Jun 7;23(6):1907-1914. doi: 10.1021/acs.jproteome.3c00902. Epub 2024 Apr 30.
10
Two-dimensional target decoy strategy for shotgun proteomics. shotgun 蛋白质组学的二维靶标诱饵策略。
J Proteome Res. 2011 Dec 2;10(12):5296-301. doi: 10.1021/pr200780j. Epub 2011 Nov 7.

引用本文的文献

1
Benchmarking Spectral Library and Database Search Approaches for Metaproteomics Using a Ground-Truth Microbiome Dataset.使用真实微生物组数据集对宏蛋白质组学的光谱库和数据库搜索方法进行基准测试。
bioRxiv. 2025 May 20:2025.05.15.654320. doi: 10.1101/2025.05.15.654320.
2
Sequence-to-sequence translation from mass spectra to peptides with a transformer model.基于 Transformer 模型的从质谱到肽的序列到序列翻译。
Nat Commun. 2024 Jul 30;15(1):6427. doi: 10.1038/s41467-024-49731-x.
3
A novel clinical metaproteomics workflow enables bioinformatic analysis of host-microbe dynamics in disease.

本文引用的文献

1
Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.用于鸟枪法蛋白质组学的改进型错误发现率估计程序
J Proteome Res. 2015 Aug 7;14(8):3148-61. doi: 10.1021/acs.jproteome.5b00081. Epub 2015 Jul 27.
2
On the importance of well-calibrated scores for identifying shotgun proteomics spectra.关于校准良好的分数在识别鸟枪法蛋白质组学谱图中的重要性。
J Proteome Res. 2015 Feb 6;14(2):1147-60. doi: 10.1021/pr5010983. Epub 2014 Dec 17.
3
MS-GF+ makes progress towards a universal database search tool for proteomics.MS-GF+朝着蛋白质组学通用数据库搜索工具的方向取得了进展。
一种新颖的临床代谢组学工作流程能够实现疾病中宿主-微生物动态的生物信息学分析。
mSphere. 2024 Jun 25;9(6):e0079323. doi: 10.1128/msphere.00793-23. Epub 2024 May 23.
4
The Association of Biomolecular Resource Facilities Proteome Informatics Research Group Study on Metaproteomics (iPRG-2020).生物分子资源设施协会蛋白质组信息学研究组关于宏蛋白质组学的研究 (iPRG-2020)。
J Biomol Tech. 2023 Aug 7;34(3). doi: 10.7171/3fc1f5fe.a058bad4. eCollection 2023 Sep 30.
5
Accelerating open modification spectral library searching on tensor core in high-dimensional space.在高维空间的张量核上加速开放修改谱库搜索。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad404.
6
The Crux Toolkit for Analysis of Bottom-Up Tandem Mass Spectrometry Proteomics Data.用于从头串联质谱蛋白质组学数据分析的 Crux 工具包。
J Proteome Res. 2023 Feb 3;22(2):561-569. doi: 10.1021/acs.jproteome.2c00615. Epub 2023 Jan 4.
7
Critical Assessment of MetaProteome Investigation (CAMPI): a multi-laboratory comparison of established workflows.关键评估元蛋白质组学调查 (CAMPI):已建立工作流程的多实验室比较。
Nat Commun. 2021 Dec 15;12(1):7305. doi: 10.1038/s41467-021-27542-8.
8
Accurately Assigning Peptides to Spectra When Only a Subset of Peptides Are Relevant.当只有一部分肽相关时,准确地将肽分配给光谱。
J Proteome Res. 2021 Aug 6;20(8):4153-4164. doi: 10.1021/acs.jproteome.1c00483. Epub 2021 Jul 8.
9
Enhancing Open Modification Searches via a Combined Approach Facilitated by Ursgal.通过 Ursgal 辅助的联合方法增强开放修饰搜索。
J Proteome Res. 2021 Apr 2;20(4):1986-1996. doi: 10.1021/acs.jproteome.0c00799. Epub 2021 Jan 29.
10
Focus on the spectra that matter by clustering of quantification data in shotgun proteomics.通过 shotgun 蛋白质组学中的定量数据聚类来关注重要的光谱。
Nat Commun. 2020 Jun 26;11(1):3234. doi: 10.1038/s41467-020-17037-3.
Nat Commun. 2014 Oct 31;5:5277. doi: 10.1038/ncomms6277.
4
Proteogenomic strategies for identification of aberrant cancer peptides using large-scale next-generation sequencing data.利用大规模下一代测序数据鉴定异常癌症肽段的蛋白质基因组学策略。
Proteomics. 2014 Dec;14(23-24):2719-30. doi: 10.1002/pmic.201400206. Epub 2014 Nov 17.
5
Crux: rapid open source protein tandem mass spectrometry analysis.关键:快速开源蛋白质串联质谱分析
J Proteome Res. 2014 Oct 3;13(10):4488-91. doi: 10.1021/pr500741y. Epub 2014 Sep 9.
6
Computing exact p-values for a cross-correlation shotgun proteomics score function.计算互相关鸟枪法蛋白质组学评分函数的精确p值。
Mol Cell Proteomics. 2014 Sep;13(9):2467-79. doi: 10.1074/mcp.O113.036327. Epub 2014 Jun 2.
7
Fast and accurate database searches with MS-GF+Percolator.使用MS-GF+Percolator进行快速准确的数据库搜索。
J Proteome Res. 2014 Feb 7;13(2):890-7. doi: 10.1021/pr400937n. Epub 2013 Dec 23.
8
Transferred subgroup false discovery rate for rare post-translational modifications detected by mass spectrometry.通过质谱检测到的罕见翻译后修饰的转移亚组错误发现率。
Mol Cell Proteomics. 2014 May;13(5):1359-68. doi: 10.1074/mcp.O113.030189. Epub 2013 Nov 7.
9
ISPTM: an iterative search algorithm for systematic identification of post-translational modifications from complex proteome mixtures.ISPTM:一种从复杂蛋白质混合物中系统鉴定翻译后修饰的迭代搜索算法。
J Proteome Res. 2013 Sep 6;12(9):3831-42. doi: 10.1021/pr4003883. Epub 2013 Aug 6.
10
Variation and genetic control of protein abundance in humans.人类蛋白质丰度的变化和遗传控制。
Nature. 2013 Jul 4;499(7456):79-82. doi: 10.1038/nature12223. Epub 2013 May 15.