一种从聚类串联质谱数据中鉴定肽段的统计方法。

A statistical approach to peptide identification from clustered tandem mass spectrometry data.

作者信息

Ryu Soyoung, Goodlett David R, Noble William S, Minin Vladimir N

机构信息

Department of Statistics, University of Washington, Seattle, WA, USA,

出版信息

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2012 Oct 4:648-653. doi: 10.1109/BIBMW.2012.6470214.

DOI:10.1109/BIBMW.2012.6470214

PMID:23828149

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3698614/

Abstract

Tandem mass spectrometry experiments generate from thousands to millions of spectra. These spectra can be used to identify the presence of proteins in biological samples. In this work, we propose a new method to identify peptides, substrings of proteins, based on clustered tandem mass spectrometry data. In contrast to previously proposed approaches, which identify one representative spectrum for each cluster using traditional database searching algorithms, our method uses all available information to score all the spectra in a cluster against candidate peptides using Bayesian model selection. We illustrate the performance of our method by applying it to seven-standard-protein mixture data.

摘要

串联质谱实验会产生数千到数百万个光谱。这些光谱可用于识别生物样品中蛋白质的存在。在这项工作中，我们提出了一种基于聚类串联质谱数据来识别肽段（蛋白质的子串）的新方法。与先前提出的方法不同，那些方法使用传统数据库搜索算法为每个聚类识别一个代表性光谱，而我们的方法利用所有可用信息，使用贝叶斯模型选择针对候选肽段对聚类中的所有光谱进行评分。我们通过将该方法应用于七标准蛋白质混合物数据来说明其性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/130a/3698614/5c99cb9937ea/nihms451422f1.jpg

相似文献

A statistical approach to peptide identification from clustered tandem mass spectrometry data.一种从聚类串联质谱数据中鉴定肽段的统计方法。

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2012 Oct 4:648-653. doi: 10.1109/BIBMW.2012.6470214.

Faster and more accurate graphical model identification of tandem mass spectra using trellises.使用格架对串联质谱进行更快、更准确的图形模型识别。

Bioinformatics. 2016 Jun 15;32(12):i322-i331. doi: 10.1093/bioinformatics/btw269.

Comparative database search engine analysis on massive tandem mass spectra of pork-based food products for halal proteomics.基于猪肉的食品清真蛋白质组学大规模串联质谱的比较数据库搜索引擎分析

J Proteomics. 2021 Jun 15;241:104240. doi: 10.1016/j.jprot.2021.104240. Epub 2021 Apr 21.

FPTMS: Frequency-based approach to identify the peptide from the low-energy collision-induced dissociation tandem mass spectra.FPTMS：基于频率的方法，用于从低能量碰撞诱导解离串联质谱中鉴定肽。

J Proteomics. 2021 Mar 20;235:104116. doi: 10.1016/j.jprot.2021.104116. Epub 2021 Jan 13.

Peptide identification by database search of mixture tandem mass spectra.通过混合串联质谱数据库搜索进行肽鉴定。

Mol Cell Proteomics. 2011 Dec;10(12):M111.010017. doi: 10.1074/mcp.M111.010017. Epub 2011 Aug 23.

Peptide reranking with protein-peptide correspondence and precursor peak intensity information.利用蛋白-肽对应关系和前体峰强度信息对肽进行重新排序。

IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1212-9. doi: 10.1109/TCBB.2012.29.

Clustering millions of tandem mass spectra.对数百万个串联质谱进行聚类。

J Proteome Res. 2008 Jan;7(1):113-22. doi: 10.1021/pr070361e. Epub 2007 Dec 8.

msCRUSH: Fast Tandem Mass Spectral Clustering Using Locality Sensitive Hashing.msCRUSH：基于局部敏感哈希的快速串联质谱聚类。

J Proteome Res. 2019 Jan 4;18(1):147-158. doi: 10.1021/acs.jproteome.8b00448. Epub 2018 Dec 14.

Interpretation of Tandem Mass Spectra of Posttranslationally Modified Peptides.翻译后修饰肽段的串联质谱解析

Methods Mol Biol. 2020;2051:199-230. doi: 10.1007/978-1-4939-9744-2_8.

A suffix tree approach to the interpretation of tandem mass spectra: applications to peptides of non-specific digestion and post-translational modifications.一种用于串联质谱解释的后缀树方法：应用于非特异性消化和翻译后修饰的肽段

Bioinformatics. 2003 Oct;19 Suppl 2:ii113-21. doi: 10.1093/bioinformatics/btg1068.

引用本文的文献

Insight on physicochemical properties governing peptide MS1 response in HPLC-ESI-MS/MS: A deep learning approach.关于HPLC-ESI-MS/MS中控制肽段MS1响应的物理化学性质的见解：一种深度学习方法。

Comput Struct Biotechnol J. 2023 Jul 22;21:3715-3727. doi: 10.1016/j.csbj.2023.07.027. eCollection 2023.

JUMP: a tag-based database search tool for peptide identification with high sensitivity and accuracy.JUMP：一种用于肽段鉴定的基于标签的数据库搜索工具，具有高灵敏度和准确性。

Mol Cell Proteomics. 2014 Dec;13(12):3663-73. doi: 10.1074/mcp.O114.039586. Epub 2014 Sep 8.

本文引用的文献

An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。

J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.

Precursor acquisition independent from ion count: how to dive deeper into the proteomics ocean.前体获取与离子计数无关：如何深入探索蛋白质组学海洋。

Anal Chem. 2009 Aug 1;81(15):6481-8. doi: 10.1021/ac900888s.

Comparison of a label-free quantitative proteomic method based on peptide ion current area to the isotope coded affinity tag method.基于肽离子电流面积的无标记定量蛋白质组学方法与同位素编码亲和标签法的比较。

Cancer Inform. 2008;6:243-55. doi: 10.4137/cin.s385. Epub 2008 Apr 17.

Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification.使用动态贝叶斯网络对肽段进行建模以用于肽段鉴定

Bioinformatics. 2008 Jul 1;24(13):i348-56. doi: 10.1093/bioinformatics/btn189.

Semisupervised model-based validation of peptide identifications in mass spectrometry-based proteomics.基于半监督模型的质谱蛋白质组学中肽段鉴定的验证

J Proteome Res. 2008 Jan;7(1):254-65. doi: 10.1021/pr070542g. Epub 2007 Dec 27.

Clustering millions of tandem mass spectra.对数百万个串联质谱进行聚类。

J Proteome Res. 2008 Jan;7(1):113-22. doi: 10.1021/pr070361e. Epub 2007 Dec 8.

Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.使用诱饵数据库对通过串联质谱鉴定的肽段赋予显著性。

J Proteome Res. 2008 Jan;7(1):29-34. doi: 10.1021/pr700600n. Epub 2007 Dec 8.

Semi-supervised learning for peptide identification from shotgun proteomics datasets.基于鸟枪法蛋白质组学数据集的肽段鉴定的半监督学习

Nat Methods. 2007 Nov;4(11):923-5. doi: 10.1038/nmeth1113. Epub 2007 Oct 21.

A computational approach toward label-free protein quantification using predicted peptide detectability.一种使用预测肽可检测性进行无标记蛋白质定量的计算方法。

Bioinformatics. 2006 Jul 15;22(14):e481-8. doi: 10.1093/bioinformatics/btl237.

Estimation and control of multiple testing error rates for microarray studies.微阵列研究中多重检验错误率的估计与控制。

Brief Bioinform. 2006 Mar;7(1):25-36. doi: 10.1093/bib/bbk002.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验