Suppr超能文献

Separating the wheat from the chaff: unbiased filtering of background tandem mass spectra improves protein identification.

作者信息

Junqueira Magno, Spirin Victor, Santana Balbuena Tiago, Waridel Patrice, Surendranath Vineeth, Kryukov Grigoriy, Adzhubei Ivan, Thomas Henrik, Sunyaev Shamil, Shevchenko Andrej

机构信息

Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.

出版信息

J Proteome Res. 2008 Aug;7(8):3382-95. doi: 10.1021/pr800140v. Epub 2008 Jun 18.

Abstract

Only a small fraction of spectra acquired in LC-MS/MS runs matches peptides from target proteins upon database searches. The remaining, operationally termed background, spectra originate from a variety of poorly controlled sources and affect the throughput and confidence of database searches. Here, we report an algorithm and its software implementation that rapidly removes background spectra, regardless of their precise origin. The method estimates the dissimilarity distance between screened MS/MS spectra and unannotated spectra from a partially redundant background library compiled from several control and blank runs. Filtering MS/MS queries enhanced the protein identification capacity when searches lacked spectrum to sequence matching specificity. In sequence-similarity searches it reduced by, on average, 30-fold the number of orphan hits, which were not explicitly related to background protein contaminants and required manual validation. Removing high quality background MS/MS spectra, while preserving in the data set the genuine spectra from target proteins, decreased the false positive rate of stringent database searches and improved the identification of low-abundance proteins.

摘要

相似文献

2
Spectral Library Search Improves Assignment of TMT Labeled MS/MS Spectra.
J Proteome Res. 2018 Sep 7;17(9):3325-3331. doi: 10.1021/acs.jproteome.8b00594. Epub 2018 Aug 16.
3
DISMS2: A flexible algorithm for direct proteome- wide distance calculation of LC-MS/MS runs.
BMC Bioinformatics. 2017 Mar 3;18(1):148. doi: 10.1186/s12859-017-1514-2.
4
Micro-Data-Independent Acquisition for High-Throughput Proteomics and Sensitive Peptide Mass Spectrum Identification.
Anal Chem. 2018 Aug 7;90(15):8905-8911. doi: 10.1021/acs.analchem.8b01026. Epub 2018 Jul 23.
7
A dynamic noise level algorithm for spectral screening of peptide MS/MS spectra.
BMC Bioinformatics. 2010 Aug 23;11:436. doi: 10.1186/1471-2105-11-436.
9
Simplified validation of borderline hits of database searches.
Proteomics. 2008 Oct;8(20):4173-7. doi: 10.1002/pmic.200800250.
10
Enhanced peptide quantification using spectral count clustering and cluster abundance.
BMC Bioinformatics. 2011 Oct 28;12:423. doi: 10.1186/1471-2105-12-423.

引用本文的文献

1
Bipartite graphs in systems biology and medicine: a survey of methods and applications.
Gigascience. 2018 Apr 1;7(4):1-31. doi: 10.1093/gigascience/giy014.
2
Systematic Errors in Peptide and Protein Identification and Quantification by Modified Peptides.
Mol Cell Proteomics. 2016 Aug;15(8):2791-801. doi: 10.1074/mcp.M115.055103. Epub 2016 May 23.
5
One-step purification of assembly-competent tubulin from diverse eukaryotic sources.
Mol Biol Cell. 2012 Nov;23(22):4393-401. doi: 10.1091/mbc.E12-06-0444. Epub 2012 Sep 19.
6
Current challenges in software solutions for mass spectrometry-based quantitative proteomics.
Amino Acids. 2012 Sep;43(3):1087-108. doi: 10.1007/s00726-012-1289-8. Epub 2012 Jul 22.
7
Role for Rif1 in the checkpoint response to damaged DNA in Xenopus egg extracts.
Cell Cycle. 2012 Mar 15;11(6):1183-94. doi: 10.4161/cc.11.6.19636.
8
Direct regulation of Treslin by cyclin-dependent kinase is essential for the onset of DNA replication.
J Cell Biol. 2011 Jun 13;193(6):995-1007. doi: 10.1083/jcb.201102003. Epub 2011 Jun 6.

本文引用的文献

1
Optimization and testing of mass spectral library search algorithms for compound identification.
J Am Soc Mass Spectrom. 1994 Sep;5(9):859-66. doi: 10.1016/1044-0305(94)87009-8.
4
Tandem affinity purification of functional TAP-tagged proteins from human cells.
Nat Protoc. 2007;2(5):1145-51. doi: 10.1038/nprot.2007.172.
5
In-gel digestion for mass spectrometric characterization of proteins and proteomes.
Nat Protoc. 2006;1(6):2856-60. doi: 10.1038/nprot.2006.468.
7
An integrated mass spectrometric and computational framework for the analysis of protein interaction networks.
Nat Biotechnol. 2007 Mar;25(3):345-52. doi: 10.1038/nbt1289. Epub 2007 Feb 25.
9
Proteome informatics I: bioinformatics tools for processing experimental data.
Proteomics. 2006 Oct;6(20):5435-44. doi: 10.1002/pmic.200600273.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验