使用串联质谱进行大规模数据库搜索：在书的后面查找答案。

Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book.

作者信息

Sadygov Rovshan G, Cociorva Daniel, Yates John R

机构信息

Department of Cell Biology, The Scripps Research Institute, La Jolla, California 92037, USA.

出版信息

Nat Methods. 2004 Dec;1(3):195-202. doi: 10.1038/nmeth725.

DOI:10.1038/nmeth725

PMID:15789030

Abstract

Database searching is an essential element of large-scale proteomics. Because these methods are widely used, it is important to understand the rationale of the algorithms. Most algorithms are based on concepts first developed in SEQUEST and PeptideSearch. Four basic approaches are used to determine a match between a spectrum and sequence: descriptive, interpretative, stochastic and probability-based matching. We review the basic concepts used by most search algorithms, the computational modeling of peptide identification and current challenges and limitations of this approach for protein identification.

摘要

数据库搜索是大规模蛋白质组学的一个基本要素。由于这些方法被广泛使用，了解算法的基本原理很重要。大多数算法基于最初在SEQUEST和PeptideSearch中开发的概念。有四种基本方法用于确定质谱图与序列之间的匹配：描述性、解释性、随机和基于概率的匹配。我们回顾了大多数搜索算法使用的基本概念、肽段鉴定的计算模型以及这种蛋白质鉴定方法当前面临的挑战和局限性。

相似文献

Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book.

Nat Methods. 2004 Dec;1(3):195-202. doi: 10.1038/nmeth725.

On distance and similarity in fold space.

Bioinformatics. 2008 Mar 15;24(6):872-3. doi: 10.1093/bioinformatics/btn040. Epub 2008 Jan 28.

A predictive model for identifying proteins by a single peptide match.

Bioinformatics. 2007 Feb 1;23(3):277-80. doi: 10.1093/bioinformatics/btl595. Epub 2006 Nov 22.

Intensity-based protein identification by machine learning from a library of tandem mass spectra.

Nat Biotechnol. 2004 Feb;22(2):214-9. doi: 10.1038/nbt930. Epub 2004 Jan 18.

Probability-based pattern recognition and statistical framework for randomization: modeling tandem mass spectrum/peptide sequence false match frequencies.

Bioinformatics. 2007 Sep 1;23(17):2210-7. doi: 10.1093/bioinformatics/btm267. Epub 2007 May 17.

Reconsidering complete search algorithms for protein backbone NMR assignment.

Bioinformatics. 2005 Sep 1;21 Suppl 2:ii230-6. doi: 10.1093/bioinformatics/bti1138.

Data pre-processing in liquid chromatography-mass spectrometry-based proteomics.

Bioinformatics. 2005 Nov 1;21(21):4054-9. doi: 10.1093/bioinformatics/bti660. Epub 2005 Sep 8.

Parmodel: a web server for automated comparative modeling of proteins.

Biochem Biophys Res Commun. 2004 Dec 24;325(4):1481-6. doi: 10.1016/j.bbrc.2004.10.192.

Bioinformatics. 2005 Sep 1;21 Suppl 2:ii42-6. doi: 10.1093/bioinformatics/bti1107.

BRAGI: linking and visualization of database information in a 3D viewer and modeling tool.

Bioinformatics. 2005 Apr 1;21(7):1291-3. doi: 10.1093/bioinformatics/bti138. Epub 2004 Nov 16.

引用本文的文献

Detection Methods for Pine Wilt Disease: A Comprehensive Review.

Plants (Basel). 2024 Oct 14;13(20):2876. doi: 10.3390/plants13202876.

Pine wilt disease: what do we know from proteomics?

BMC Plant Biol. 2024 Feb 9;24(1):98. doi: 10.1186/s12870-024-04771-9.

Deep learning-driven fragment ion series classification enables highly precise and sensitive de novo peptide sequencing.

Nat Commun. 2024 Jan 2;15(1):151. doi: 10.1038/s41467-023-44323-7.

Data-Driven Compound Identification in Atmospheric Mass Spectrometry.

Adv Sci (Weinh). 2024 Feb;11(8):e2306235. doi: 10.1002/advs.202306235. Epub 2023 Dec 14.

Starch treatment improves the salivary proteome for subject identification purposes.

Forensic Sci Med Pathol. 2024 Mar;20(1):117-128. doi: 10.1007/s12024-023-00629-y. Epub 2023 Apr 21.

Omics technologies in allergy and asthma research: An EAACI position paper.

Allergy. 2022 Oct;77(10):2888-2908. doi: 10.1111/all.15412. Epub 2022 Jun 30.

Proteomics with Enhanced In-Source Fragmentation/Annotation: Applying XCMS-EISA Informatics and Q-MRM High-Sensitivity Quantification.

J Am Soc Mass Spectrom. 2021 Nov 3;32(11):2644-2654. doi: 10.1021/jasms.1c00188. Epub 2021 Oct 11.

Large-scale tandem mass spectrum clustering using fast nearest neighbor searching.

Rapid Commun Mass Spectrom. 2025 May;39 Suppl 1(Suppl 1):e9153. doi: 10.1002/rcm.9153. Epub 2021 Jul 20.

Machine-learning-enhanced time-of-flight mass spectrometry analysis.

Patterns (N Y). 2021 Jan 21;2(2):100192. doi: 10.1016/j.patter.2020.100192. eCollection 2021 Feb 12.

Proteoform Identification by Combining RNA-Seq and Top-Down Mass Spectrometry.

J Proteome Res. 2021 Jan 1;20(1):261-269. doi: 10.1021/acs.jproteome.0c00369. Epub 2020 Nov 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用串联质谱进行大规模数据库搜索：在书的后面查找答案。

Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献