Suppr超能文献

一种用于估计串联质谱(MS/MS)和数据库搜索进行糖胺聚糖鉴定准确性的多变量混合模型。

A Multivariate Mixture Model to Estimate the Accuracy of Glycosaminoglycan Identifications Made by Tandem Mass Spectrometry (MS/MS) and Database Search.

作者信息

Chiu Yulun, Schliekelman Paul, Orlando Ron, Sharp Joshua S

机构信息

From the ‡Complex Carbohydrate Research Center.

§Institute of Bioinformatics.

出版信息

Mol Cell Proteomics. 2017 Feb;16(2):255-264. doi: 10.1074/mcp.M116.062588. Epub 2016 Dec 9.

Abstract

We present a statistical model to estimate the accuracy of derivatized heparin and heparan sulfate (HS) glycosaminoglycan (GAG) assignments to tandem mass (MS/MS) spectra made by the first published database search application, GAG-ID. Employing a multivariate expectation-maximization algorithm, this statistical model distinguishes correct from ambiguous and incorrect database search results when computing the probability that heparin/HS GAG assignments to spectra are correct based upon database search scores. Using GAG-ID search results for spectra generated from a defined mixture of 21 synthesized tetrasaccharide sequences as well as seven spectra of longer defined oligosaccharides, we demonstrate that the computed probabilities are accurate and have high power to discriminate between correctly, ambiguously, and incorrectly assigned heparin/HS GAGs. This analysis makes it possible to filter large MS/MS database search results with predictable false identification error rates.

摘要

我们提出了一种统计模型,用于估计首个已发表的数据库搜索应用程序GAG-ID对串联质谱(MS/MS)谱图进行衍生化肝素和硫酸乙酰肝素(HS)糖胺聚糖(GAG)归属的准确性。该统计模型采用多元期望最大化算法,在根据数据库搜索分数计算肝素/HS GAG对谱图的归属正确的概率时,能够区分正确、模糊和错误的数据库搜索结果。利用GAG-ID对由21种合成四糖序列的定义混合物产生的谱图以及7种更长的定义寡糖的谱图的搜索结果,我们证明计算出的概率是准确的,并且具有很高的能力来区分正确、模糊和错误归属的肝素/HS GAG。这种分析使得可以用可预测的错误识别率过滤大型MS/MS数据库搜索结果。

相似文献

3
GAG-ID: Heparan Sulfate (HS) and Heparin Glycosaminoglycan High-Throughput Identification Software.
Mol Cell Proteomics. 2015 Jun;14(6):1720-30. doi: 10.1074/mcp.M114.045856. Epub 2015 Apr 17.
4
De Novo Sequencing of Heparin /Heparan Sulfate Oligosaccharides by Chemical Derivatization and LC-MS /MS.
Methods Mol Biol. 2022;2303:163-172. doi: 10.1007/978-1-0716-1398-6_14.
5
A statistical model for identifying proteins by tandem mass spectrometry.
Anal Chem. 2003 Sep 1;75(17):4646-58. doi: 10.1021/ac0341261.
8
An approach for separation and complete structural sequencing of heparin/heparan sulfate-like oligosaccharides.
Anal Chem. 2013 Jun 18;85(12):5787-95. doi: 10.1021/ac400439a. Epub 2013 May 28.
9
Software for Peak Finding and Elemental Composition Assignment for Glycosaminoglycan Tandem Mass Spectra.
Mol Cell Proteomics. 2018 Jul;17(7):1448-1456. doi: 10.1074/mcp.RA118.000590. Epub 2018 Apr 3.
10
Protein identification by tandem mass spectrometry and sequence database searching.
Methods Mol Biol. 2007;367:87-119. doi: 10.1385/1-59745-275-0:87.

引用本文的文献

2
Developments in Mass Spectrometry for Glycosaminoglycan Analysis: A Review.
Mol Cell Proteomics. 2021;20:100025. doi: 10.1074/mcp.R120.002267. Epub 2021 Jan 6.
3
Peracylation Coupled with Tandem Mass Spectrometry for Structural Sequencing of Sulfated Glycosaminoglycan Mixtures without Depolymerization.
J Am Soc Mass Spectrom. 2020 Oct 7;31(10):2061-2072. doi: 10.1021/jasms.0c00178. Epub 2020 Sep 18.
4
Analysis of the Glycosaminoglycan Chains of Proteoglycans.
J Histochem Cytochem. 2021 Feb;69(2):121-135. doi: 10.1369/0022155420937154. Epub 2020 Jul 6.
5
Software for Peak Finding and Elemental Composition Assignment for Glycosaminoglycan Tandem Mass Spectra.
Mol Cell Proteomics. 2018 Jul;17(7):1448-1456. doi: 10.1074/mcp.RA118.000590. Epub 2018 Apr 3.

本文引用的文献

1
GAG-ID: Heparan Sulfate (HS) and Heparin Glycosaminoglycan High-Throughput Identification Software.
Mol Cell Proteomics. 2015 Jun;14(6):1720-30. doi: 10.1074/mcp.M114.045856. Epub 2015 Apr 17.
2
A computational framework for heparan sulfate sequencing using high-resolution tandem mass spectra.
Mol Cell Proteomics. 2014 Sep;13(9):2490-502. doi: 10.1074/mcp.M114.039560. Epub 2014 Jun 12.
3
Heparan sulfate signaling in cancer.
Trends Biochem Sci. 2014 Jun;39(6):277-88. doi: 10.1016/j.tibs.2014.03.001. Epub 2014 Apr 19.
4
Oligosaccharide analysis by mass spectrometry: a review of recent developments.
Anal Chem. 2014 Jan 7;86(1):196-212. doi: 10.1021/ac403969n. Epub 2013 Dec 16.
5
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.
J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.
6
An approach for separation and complete structural sequencing of heparin/heparan sulfate-like oligosaccharides.
Anal Chem. 2013 Jun 18;85(12):5787-95. doi: 10.1021/ac400439a. Epub 2013 May 28.
7
Glycosaminoglycan-binding cytokines as tumor markers.
Proteomics. 2008 Aug;8(16):3350-9. doi: 10.1002/pmic.200800042.
8
GlycoWorkbench: a tool for the computer-assisted annotation of mass spectra of glycans.
J Proteome Res. 2008 Apr;7(4):1650-9. doi: 10.1021/pr7008252. Epub 2008 Mar 1.
9
Semisupervised model-based validation of peptide identifications in mass spectrometry-based proteomics.
J Proteome Res. 2008 Jan;7(1):254-65. doi: 10.1021/pr070542g. Epub 2007 Dec 27.
10
Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.
J Proteome Res. 2008 Jan;7(1):29-34. doi: 10.1021/pr700600n. Epub 2007 Dec 8.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验