使用改良的SEQUEST算法对蛋白质鉴定结果进行基于概率的验证。

Probability-based validation of protein identifications using a modified SEQUEST algorithm.

作者信息

MacCoss Michael J, Wu Christine C, Yates John R

机构信息

Department of Cell Biology, The Scripps Research Institute, La Jolla, California 92037, USA.

出版信息

Anal Chem. 2002 Nov 1;74(21):5593-9. doi: 10.1021/ac025826t.

DOI:10.1021/ac025826t

PMID:12433093

Abstract

Database-searching algorithms compatible with shotgun proteomics match a peptide tandem mass spectrum to a predicted mass spectrum for an amino acid sequence within a database. SEQUEST is one of the most common software algorithms used for the analysis of peptide tandem mass spectra by using a cross-correlation (XCorr) scoring routine to match tandem mass spectra to model spectra derived from peptide sequences. To assess a match, SEQUEST uses the difference between the first- and second-ranked sequences (ACn). This value is dependent on the database size, search parameters, and sequence homologies. In this report, we demonstrate the use of a scoring routine (SEQUEST-NORM) that normalizes XCorr values to be independent of peptide size and the database used to perform the search. This new scoring routine is used to objectively calculate the percent confidence of protein identifications and posttranslational modifications based solely on the XCorr value.

摘要

与鸟枪法蛋白质组学兼容的数据库搜索算法，会将肽段串联质谱与数据库中氨基酸序列的预测质谱进行匹配。SEQUEST是用于分析肽段串联质谱的最常用软件算法之一，它通过互相关（XCorr）评分程序，将串联质谱与从肽段序列推导出来的模型质谱进行匹配。为了评估匹配情况，SEQUEST使用排名第一和第二的序列之间的差异（ACn）。该值取决于数据库大小、搜索参数和序列同源性。在本报告中，我们展示了一种评分程序（SEQUEST-NORM）的使用，该程序将XCorr值标准化，使其独立于肽段大小和用于执行搜索的数据库。这种新的评分程序用于仅基于XCorr值客观地计算蛋白质鉴定和翻译后修饰的置信度百分比。

相似文献

Probability-based validation of protein identifications using a modified SEQUEST algorithm.

Anal Chem. 2002 Nov 1;74(21):5593-9. doi: 10.1021/ac025826t.

ProLuCID: An improved SEQUEST-like algorithm with enhanced sensitivity and specificity.

J Proteomics. 2015 Nov 3;129:16-24. doi: 10.1016/j.jprot.2015.07.001. Epub 2015 Jul 11.

Optimization of filtering criterion for SEQUEST database searching to improve proteome coverage in shotgun proteomics.

BMC Bioinformatics. 2007 Aug 31;8:323. doi: 10.1186/1471-2105-8-323.

Added value for tandem mass spectrometry shotgun proteomics data validation through isoelectric focusing of peptides.

J Proteome Res. 2005 Nov-Dec;4(6):2273-82. doi: 10.1021/pr050193v.

Qscore: an algorithm for evaluating SEQUEST database search results.

J Am Soc Mass Spectrom. 2002 Apr;13(4):378-86. doi: 10.1016/S1044-0305(02)00352-5.

Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book.

Nat Methods. 2004 Dec;1(3):195-202. doi: 10.1038/nmeth725.

Comparative evaluation of tandem MS search algorithms using a target-decoy search strategy.

Mol Cell Proteomics. 2007 Sep;6(9):1599-608. doi: 10.1074/mcp.M600469-MCP200. Epub 2007 May 28.

Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database.

Anal Chem. 1995 Apr 15;67(8):1426-36. doi: 10.1021/ac00104a020.

MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis.

J Proteome Res. 2007 Feb;6(2):654-61. doi: 10.1021/pr0604054.

General framework for developing and evaluating database scoring algorithms using the TANDEM search engine.

Bioinformatics. 2006 Nov 15;22(22):2830-2. doi: 10.1093/bioinformatics/btl379. Epub 2006 Jul 28.

引用本文的文献

Phages ZC01 and ZC03 require type-IV pilus for infection and have a potential for therapeutic applications.

Microbiol Spectr. 2024 Oct 29;12(12):e0152724. doi: 10.1128/spectrum.01527-24.

Proteomic Insights into Osteoporosis: Unraveling Diagnostic Markers of and Therapeutic Targets for the Metabolic Bone Disease.

Biomolecules. 2024 May 4;14(5):554. doi: 10.3390/biom14050554.

Python workflow for the selection and identification of marker peptides-proof-of-principle study with heated milk.

Anal Bioanal Chem. 2024 Jun;416(14):3349-3360. doi: 10.1007/s00216-024-05286-w. Epub 2024 Apr 12.

Elevated ITGA1 levels in type 2 diabetes: implications for cardiac function impairment.

Diabetologia. 2024 May;67(5):850-863. doi: 10.1007/s00125-024-06109-4. Epub 2024 Feb 27.

DeGlyPHER: Highly sensitive site-specific analysis of N-linked glycans on proteins.

Methods Enzymol. 2023;682:137-185. doi: 10.1016/bs.mie.2022.09.004. Epub 2022 Dec 26.

Non-canonical role of wild-type SEC23B in the cellular stress response pathway.

Cell Death Dis. 2021 Mar 22;12(4):304. doi: 10.1038/s41419-021-03589-9.

Integrated Glycoproteomics Identifies a Role of N-Glycosylation and Galectin-1 on Myogenesis and Muscle Development.

Mol Cell Proteomics. 2021;20:100030. doi: 10.1074/mcp.RA120.002166. Epub 2020 Dec 19.

A Small Membrane Stabilizing Protein Critical to the Pathogenicity of Staphylococcus aureus.

Infect Immun. 2020 Aug 19;88(9). doi: 10.1128/IAI.00162-20.

A Triple Knockout Isobaric-Labeling Quality Control Platform with an Integrated Online Database Search.

J Am Soc Mass Spectrom. 2020 Jul 1;31(7):1344-1349. doi: 10.1021/jasms.0c00029. Epub 2020 Mar 27.

Single-cell proteomics in complex tissues using microprobe capillary electrophoresis mass spectrometry.

Methods Enzymol. 2019;628:263-292. doi: 10.1016/bs.mie.2019.07.001. Epub 2019 Aug 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用改良的SEQUEST算法对蛋白质鉴定结果进行基于概率的验证。

Probability-based validation of protein identifications using a modified SEQUEST algorithm.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献