用于串联质谱中肽段鉴定的改进序列标签生成方法。

Improved sequence tag generation method for peptide identification in tandem mass spectrometry.

作者信息

Cao Xia, Nesvizhskii Alexey I

机构信息

Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, USA.

出版信息

J Proteome Res. 2008 Oct;7(10):4422-34. doi: 10.1021/pr800400q. Epub 2008 Sep 12.

DOI:10.1021/pr800400q

PMID:18785767

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3744226/

Abstract

The sequence tag-based peptide identification methods are a promising alternative to the traditional database search approach. However, a more comprehensive analysis, optimization, and comparison with established methods are necessary before these methods can gain widespread use in the proteomics community. Using the InsPecT open source code base ( Tanner et al., Anal. Chem. 2005, 77, 4626- 39 ), we present an improved sequence tag generation method that directly incorporates multicharged fragment ion peaks present in many tandem mass spectra of higher charge states. We also investigate the performance of sequence tagging under different settings using control data sets generated on five different types of mass spectrometers, as well as using a complex phosphopeptide-enriched sample. We also demonstrate that additional modeling of InsPecT search scores using a semiparametric approach incorporating the accuracy of the precursor ion mass measurement provides additional improvement in the ability to discriminate between correct and incorrect peptide identifications. The overall superior performance of the sequence tag-based peptide identification method is demonstrated by comparison with a commonly used SEQUEST/PeptideProphet approach.

摘要

基于序列标签的肽段鉴定方法是传统数据库搜索方法的一种有前景的替代方法。然而，在这些方法能够在蛋白质组学界广泛应用之前，需要进行更全面的分析、优化以及与现有方法的比较。利用InsPecT开源代码库（Tanner等人，《分析化学》，2005年，77卷，4626 - 39页），我们提出了一种改进的序列标签生成方法，该方法直接纳入了许多更高电荷态串联质谱中存在的多电荷碎片离子峰。我们还使用在五种不同类型质谱仪上生成的对照数据集以及使用复杂的富含磷酸肽的样品，研究了在不同设置下序列标签的性能。我们还证明，使用结合前体离子质量测量准确性的半参数方法对InsPecT搜索分数进行额外建模，在区分正确和错误肽段鉴定的能力方面提供了额外的改进。通过与常用的SEQUEST/PeptideProphet方法比较，证明了基于序列标签的肽段鉴定方法的总体优越性能。

相似文献

Improved sequence tag generation method for peptide identification in tandem mass spectrometry.

J Proteome Res. 2008 Oct;7(10):4422-34. doi: 10.1021/pr800400q. Epub 2008 Sep 12.

Characterization of strategies for obtaining confident identifications in bottom-up proteomics measurements using hybrid FTMS instruments.

Anal Chem. 2008 Nov 15;80(22):8514-25. doi: 10.1021/ac801376g. Epub 2008 Oct 15.

Statistical validation of peptide identifications in large-scale proteomics using the target-decoy database search strategy and flexible mixture modeling.

J Proteome Res. 2008 Jan;7(1):286-92. doi: 10.1021/pr7006818. Epub 2007 Dec 14.

SQID: an intensity-incorporated protein identification algorithm for tandem mass spectrometry.

J Proteome Res. 2011 Apr 1;10(4):1593-602. doi: 10.1021/pr100959y. Epub 2011 Feb 23.

Adaptive discriminant function analysis and reranking of MS/MS database search results for improved peptide identification in shotgun proteomics.

J Proteome Res. 2008 Nov;7(11):4878-89. doi: 10.1021/pr800484x. Epub 2008 Sep 13.

Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?

Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.

Multiplexed Post-Experimental Monoisotopic Mass Refinement (mPE-MMR) to Increase Sensitivity and Accuracy in Peptide Identifications from Tandem Mass Spectra of Cofragmentation.

Anal Chem. 2017 Jan 17;89(2):1244-1253. doi: 10.1021/acs.analchem.6b03874. Epub 2016 Dec 22.

Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries.

Anal Chem. 2006 Aug 15;78(16):5678-84. doi: 10.1021/ac060279n.

MAZIE: a mass and charge inference engine to enhance database searching of tandem mass spectra.

J Am Soc Mass Spectrom. 2010 Jan;21(1):80-7. doi: 10.1016/j.jasms.2009.09.007. Epub 2009 Sep 17.

Using SEQUEST with theoretically complete sequence databases.

J Am Soc Mass Spectrom. 2015 Nov;26(11):1858-64. doi: 10.1007/s13361-015-1228-5. Epub 2015 Aug 4.

引用本文的文献

Validation of De Novo Peptide Sequences with Bottom-Up Tag Convolution.

Proteomes. 2021 Dec 29;10(1):1. doi: 10.3390/proteomes10010001.

A graph-based filtering method for top-down mass spectral identification.

BMC Genomics. 2018 Sep 24;19(Suppl 7):666. doi: 10.1186/s12864-018-5026-x.

A Spectrum Graph-Based Protein Sequence Filtering Algorithm for Proteoform Identification by Top-Down Mass Spectrometry.

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2017 Nov;2017:222-229. doi: 10.1109/BIBM.2017.8217653. Epub 2017 Dec 18.

Systematic Evaluation of Protein Sequence Filtering Algorithms for Proteoform Identification Using Top-Down Mass Spectrometry.

Proteomics. 2018 Feb;18(3-4). doi: 10.1002/pmic.201700306. Epub 2018 Feb 6.

Protein analysis by shotgun/bottom-up proteomics.

Chem Rev. 2013 Apr 10;113(4):2343-94. doi: 10.1021/cr3003533. Epub 2013 Feb 26.

Speeding up tandem mass spectral identification using indexes.

Bioinformatics. 2012 Jul 1;28(13):1692-7. doi: 10.1093/bioinformatics/bts244. Epub 2012 Apr 27.

Towards an understanding of wheat chloroplasts: a methodical investigation of thylakoid proteome.

Mol Biol Rep. 2012 May;39(5):5069-83. doi: 10.1007/s11033-011-1302-4. Epub 2011 Dec 11.

A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

J Proteomics. 2010 Oct 10;73(11):2092-123. doi: 10.1016/j.jprot.2010.08.009. Epub 2010 Sep 8.

Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra.

Mol Cell Proteomics. 2009 Jan;8(1):53-69. doi: 10.1074/mcp.M800103-MCP200. Epub 2008 Aug 14.

本文引用的文献

An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.

J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.

Phosphorylation-specific MS/MS scoring for rapid and accurate phosphoproteome analysis.

J Proteome Res. 2008 Aug;7(8):3373-81. doi: 10.1021/pr800129m. Epub 2008 Jun 19.

Semisupervised model-based validation of peptide identifications in mass spectrometry-based proteomics.

J Proteome Res. 2008 Jan;7(1):254-65. doi: 10.1021/pr070542g. Epub 2007 Dec 27.

Statistical validation of peptide identifications in large-scale proteomics using the target-decoy database search strategy and flexible mixture modeling.

J Proteome Res. 2008 Jan;7(1):286-92. doi: 10.1021/pr7006818. Epub 2007 Dec 14.

Analysis and validation of proteomic data generated by tandem mass spectrometry.

Nat Methods. 2007 Oct;4(10):787-97. doi: 10.1038/nmeth1088.

Investigating MS2/MS3 matching statistics: a model for coupling consecutive stage mass spectrometry data for increased peptide identification confidence.

Mol Cell Proteomics. 2008 Jan;7(1):71-87. doi: 10.1074/mcp.M700128-MCP200. Epub 2007 Sep 13.

The standard protein mix database: a diverse data set to assist in the production of improved Peptide and protein identification software tools.

J Proteome Res. 2008 Jan;7(1):96-103. doi: 10.1021/pr070244j. Epub 2007 Aug 21.

The Paragon Algorithm, a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra.

Mol Cell Proteomics. 2007 Sep;6(9):1638-55. doi: 10.1074/mcp.T600050-MCP200. Epub 2007 May 27.

Modeling and characterization of multi-charge mass spectra for peptide sequencing.

J Bioinform Comput Biol. 2006 Dec;4(6):1329-52. doi: 10.1142/s021972000600248x.

Lookup peaks: a hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry.

Anal Chem. 2007 Feb 15;79(4):1393-400. doi: 10.1021/ac0617013. Epub 2007 Jan 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于串联质谱中肽段鉴定的改进序列标签生成方法。

Improved sequence tag generation method for peptide identification in tandem mass spectrometry.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献