Suppr超能文献

用于串联质谱中肽段鉴定的改进序列标签生成方法。

Improved sequence tag generation method for peptide identification in tandem mass spectrometry.

作者信息

Cao Xia, Nesvizhskii Alexey I

机构信息

Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, USA.

出版信息

J Proteome Res. 2008 Oct;7(10):4422-34. doi: 10.1021/pr800400q. Epub 2008 Sep 12.

Abstract

The sequence tag-based peptide identification methods are a promising alternative to the traditional database search approach. However, a more comprehensive analysis, optimization, and comparison with established methods are necessary before these methods can gain widespread use in the proteomics community. Using the InsPecT open source code base ( Tanner et al., Anal. Chem. 2005, 77, 4626- 39 ), we present an improved sequence tag generation method that directly incorporates multicharged fragment ion peaks present in many tandem mass spectra of higher charge states. We also investigate the performance of sequence tagging under different settings using control data sets generated on five different types of mass spectrometers, as well as using a complex phosphopeptide-enriched sample. We also demonstrate that additional modeling of InsPecT search scores using a semiparametric approach incorporating the accuracy of the precursor ion mass measurement provides additional improvement in the ability to discriminate between correct and incorrect peptide identifications. The overall superior performance of the sequence tag-based peptide identification method is demonstrated by comparison with a commonly used SEQUEST/PeptideProphet approach.

摘要

基于序列标签的肽段鉴定方法是传统数据库搜索方法的一种有前景的替代方法。然而,在这些方法能够在蛋白质组学界广泛应用之前,需要进行更全面的分析、优化以及与现有方法的比较。利用InsPecT开源代码库(Tanner等人,《分析化学》,2005年,77卷,4626 - 39页),我们提出了一种改进的序列标签生成方法,该方法直接纳入了许多更高电荷态串联质谱中存在的多电荷碎片离子峰。我们还使用在五种不同类型质谱仪上生成的对照数据集以及使用复杂的富含磷酸肽的样品,研究了在不同设置下序列标签的性能。我们还证明,使用结合前体离子质量测量准确性的半参数方法对InsPecT搜索分数进行额外建模,在区分正确和错误肽段鉴定的能力方面提供了额外的改进。通过与常用的SEQUEST/PeptideProphet方法比较,证明了基于序列标签的肽段鉴定方法的总体优越性能。

相似文献

1
Improved sequence tag generation method for peptide identification in tandem mass spectrometry.
J Proteome Res. 2008 Oct;7(10):4422-34. doi: 10.1021/pr800400q. Epub 2008 Sep 12.
4
SQID: an intensity-incorporated protein identification algorithm for tandem mass spectrometry.
J Proteome Res. 2011 Apr 1;10(4):1593-602. doi: 10.1021/pr100959y. Epub 2011 Feb 23.
9
MAZIE: a mass and charge inference engine to enhance database searching of tandem mass spectra.
J Am Soc Mass Spectrom. 2010 Jan;21(1):80-7. doi: 10.1016/j.jasms.2009.09.007. Epub 2009 Sep 17.
10
Using SEQUEST with theoretically complete sequence databases.
J Am Soc Mass Spectrom. 2015 Nov;26(11):1858-64. doi: 10.1007/s13361-015-1228-5. Epub 2015 Aug 4.

引用本文的文献

1
Validation of De Novo Peptide Sequences with Bottom-Up Tag Convolution.
Proteomes. 2021 Dec 29;10(1):1. doi: 10.3390/proteomes10010001.
2
A graph-based filtering method for top-down mass spectral identification.
BMC Genomics. 2018 Sep 24;19(Suppl 7):666. doi: 10.1186/s12864-018-5026-x.
3
A Spectrum Graph-Based Protein Sequence Filtering Algorithm for Proteoform Identification by Top-Down Mass Spectrometry.
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2017 Nov;2017:222-229. doi: 10.1109/BIBM.2017.8217653. Epub 2017 Dec 18.
5
Protein analysis by shotgun/bottom-up proteomics.
Chem Rev. 2013 Apr 10;113(4):2343-94. doi: 10.1021/cr3003533. Epub 2013 Feb 26.
6
Speeding up tandem mass spectral identification using indexes.
Bioinformatics. 2012 Jul 1;28(13):1692-7. doi: 10.1093/bioinformatics/bts244. Epub 2012 Apr 27.
7
Towards an understanding of wheat chloroplasts: a methodical investigation of thylakoid proteome.
Mol Biol Rep. 2012 May;39(5):5069-83. doi: 10.1007/s11033-011-1302-4. Epub 2011 Dec 11.
8
A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.
J Proteomics. 2010 Oct 10;73(11):2092-123. doi: 10.1016/j.jprot.2010.08.009. Epub 2010 Sep 8.
9
Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra.
Mol Cell Proteomics. 2009 Jan;8(1):53-69. doi: 10.1074/mcp.M800103-MCP200. Epub 2008 Aug 14.

本文引用的文献

1
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.
J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.
2
Phosphorylation-specific MS/MS scoring for rapid and accurate phosphoproteome analysis.
J Proteome Res. 2008 Aug;7(8):3373-81. doi: 10.1021/pr800129m. Epub 2008 Jun 19.
3
Semisupervised model-based validation of peptide identifications in mass spectrometry-based proteomics.
J Proteome Res. 2008 Jan;7(1):254-65. doi: 10.1021/pr070542g. Epub 2007 Dec 27.
5
Analysis and validation of proteomic data generated by tandem mass spectrometry.
Nat Methods. 2007 Oct;4(10):787-97. doi: 10.1038/nmeth1088.
9
Modeling and characterization of multi-charge mass spectra for peptide sequencing.
J Bioinform Comput Biol. 2006 Dec;4(6):1329-52. doi: 10.1142/s021972000600248x.
10
Lookup peaks: a hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry.
Anal Chem. 2007 Feb 15;79(4):1393-400. doi: 10.1021/ac0617013. Epub 2007 Jan 23.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验