从贝叶斯观点看蛋白质鉴定问题。

Protein identification problem from a Bayesian point of view.

作者信息

Li Yong Fuga, Arnold Randy J, Radivojac Predrag, Tang Haixu

机构信息

School of Informatics and Computing, Indiana University, Bloomington, IN 47405, USA.

Department of Chemistry, Indiana University, Bloomington, IN 47406, USA.

出版信息

Stat Interface. 2012 Jan 1;5(1):21-37. doi: 10.4310/SII.2012.v5.n1.a3.

DOI:10.4310/SII.2012.v5.n1.a3

PMID:24761189

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3992622/

Abstract

We present a generic Bayesian framework for the peptide and protein identification in proteomics, and provide a unified interpretation for the database searching and the peptide sequencing approaches that are used in peptide identification. We describe several probabilistic graphical models and a variety of prior distributions that can be incorporated into the Bayesian framework to model different types of prior information, such as the known protein sequences, the known protein abundances, the peptide precursor masses, the estimated peptide retention time and the peptide detectabilities. Various applications of the Bayesian framework are discussed theoretically, including its application to the identification of peptides containing mutations and post-translational modifications.

摘要

我们提出了一种用于蛋白质组学中肽和蛋白质鉴定的通用贝叶斯框架，并对肽鉴定中使用的数据库搜索和肽测序方法提供了统一的解释。我们描述了几种概率图形模型和各种先验分布，这些可以纳入贝叶斯框架以对不同类型的先验信息进行建模，例如已知的蛋白质序列、已知的蛋白质丰度、肽前体质量、估计的肽保留时间和肽可检测性。从理论上讨论了贝叶斯框架的各种应用，包括其在鉴定含有突变和翻译后修饰的肽方面的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc9d/3992622/370c8c19d037/nihms369618f1.jpg

相似文献

Protein identification problem from a Bayesian point of view.从贝叶斯观点看蛋白质鉴定问题。

Stat Interface. 2012 Jan 1;5(1):21-37. doi: 10.4310/SII.2012.v5.n1.a3.

A bayesian approach to protein inference problem in shotgun proteomics.一种用于鸟枪法蛋白质组学中蛋白质推断问题的贝叶斯方法。

J Comput Biol. 2009 Aug;16(8):1183-93. doi: 10.1089/cmb.2009.0018.

Evaluation of an integrative Bayesian peptide detection approach on a combinatorial peptide library.

Eur J Mass Spectrom (Chichester). 2021 Dec;27(6):217-234. doi: 10.1177/14690667211066725. Epub 2022 Jan 6.

NovoHMM: a hidden Markov model for de novo peptide sequencing.NovoHMM：一种用于从头肽测序的隐马尔可夫模型。

Anal Chem. 2005 Nov 15;77(22):7265-73. doi: 10.1021/ac0508853.

Flying blind, or just flying under the radar? The underappreciated power of de novo methods of mass spectrometric peptide identification.盲目飞行，还是只是在雷达下飞行？从头开始的质谱肽鉴定方法的未被充分认识的威力。

Protein Sci. 2020 Sep;29(9):1864-1878. doi: 10.1002/pro.3919. Epub 2020 Aug 17.

sequencing of proteins by mass spectrometry.质谱法对蛋白质进行测序。

Expert Rev Proteomics. 2020 Jul-Aug;17(7-8):595-607. doi: 10.1080/14789450.2020.1831387. Epub 2020 Oct 21.

Database Creator for Mass Analysis of Peptides and Proteins, DC-MAPP: A Standalone Tool for Simplifying Manual Analysis of Mass Spectral Data to Identify Peptide/Protein Sequences.数据库创建工具用于肽和蛋白质的质量分析，简称 DC-MAPP：一种简化手动分析质谱数据以识别肽/蛋白质序列的独立工具。

J Am Soc Mass Spectrom. 2023 Sep 6;34(9):1962-1969. doi: 10.1021/jasms.3c00030. Epub 2023 Aug 1.

Shotgun protein identification and quantification by mass spectrometry.通过质谱法进行鸟枪法蛋白质鉴定和定量分析。

Methods Mol Biol. 2009;564:261-88. doi: 10.1007/978-1-60761-157-8_15.

Shotgun protein identification and quantification by mass spectrometry in neuroproteomics.神经蛋白质组学中通过质谱法进行鸟枪法蛋白质鉴定和定量分析

Methods Mol Biol. 2009;566:229-59. doi: 10.1007/978-1-59745-562-6_16.

Improved de novo peptide sequencing using LC retention time information.利用液相色谱保留时间信息改进从头肽测序

Algorithms Mol Biol. 2018 Aug 29;13:14. doi: 10.1186/s13015-018-0132-5. eCollection 2018.

引用本文的文献

New mixture models for decoy-free false discovery rate estimation in mass spectrometry proteomics.无诱饵的质谱蛋白质组学中假发现率估计的新混合模型。

Bioinformatics. 2020 Dec 30;36(Suppl_2):i745-i753. doi: 10.1093/bioinformatics/btaa807.

Constrained Sequencing of neo-Epitope Peptides using Tandem Mass Spectrometry.使用串联质谱法对新表位肽进行受限测序

Res Comput Mol Biol. 2018;10812:138-153. doi: 10.1007/978-3-319-89929-9_9. Epub 2018 Apr 18.

The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.概率卷积树：用于更快的液相色谱-串联质谱蛋白质推断的高效精确贝叶斯推理

PLoS One. 2014 Mar 13;9(3):e91507. doi: 10.1371/journal.pone.0091507. eCollection 2014.

Computational approaches to protein inference in shotgun proteomics.基于高通量蛋白质组学的蛋白质推断的计算方法。

BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S4. doi: 10.1186/1471-2105-13-S16-S4. Epub 2012 Nov 5.

本文引用的文献

The importance of peptide detectability for protein identification, quantification, and experiment design in MS/MS proteomics.肽段可检测性对 MS/MS 蛋白质组学中蛋白质鉴定、定量和实验设计的重要性。

J Proteome Res. 2010 Dec 3;9(12):6288-97. doi: 10.1021/pr1005586. Epub 2010 Nov 10.

Efficient marginalization to compute protein posterior probabilities from shotgun mass spectrometry data.从鸟枪法质谱数据中计算蛋白质后验概率的有效边缘化。

J Proteome Res. 2010 Oct 1;9(10):5346-57. doi: 10.1021/pr100594k.

Combinatorial libraries of synthetic peptides as a model for shotgun proteomics.合成肽组合文库作为鸟枪法蛋白质组学的模型。

Anal Chem. 2010 Aug 1;82(15):6559-68. doi: 10.1021/ac100910a.

Identification of tandem mass spectra of mixtures of isomeric peptides.混合异构体肽串联质谱的鉴定。

J Proteome Res. 2010 Jun 4;9(6):3270-9. doi: 10.1021/pr100205k.

Repeatability and reproducibility in proteomic identifications by liquid chromatography-tandem mass spectrometry.液相色谱-串联质谱法在蛋白质组学鉴定中的可重复性和可再现性。

J Proteome Res. 2010 Feb 5;9(2):761-76. doi: 10.1021/pr9006365.

The Universal Protein Resource (UniProt) in 2010.2010 年的通用蛋白质资源（UniProt）。

Nucleic Acids Res. 2010 Jan;38(Database issue):D142-8. doi: 10.1093/nar/gkp846. Epub 2009 Oct 20.

A guide to the Proteomics Identifications Database proteomics data repository.蛋白质组学鉴定数据库蛋白质组学数据储存库指南。

Proteomics. 2009 Sep;9(18):4276-83. doi: 10.1002/pmic.200900402.

A bayesian approach to protein inference problem in shotgun proteomics.一种用于鸟枪法蛋白质组学中蛋白质推断问题的贝叶斯方法。

J Comput Biol. 2009 Aug;16(8):1183-93. doi: 10.1089/cmb.2009.0018.

Protein identification false discovery rates for very large proteomics data sets generated by tandem mass spectrometry.串联质谱产生的超大蛋白质组学数据集的蛋白质鉴定假发现率。

Mol Cell Proteomics. 2009 Nov;8(11):2405-17. doi: 10.1074/mcp.M900317-MCP200. Epub 2009 Jul 16.

IDPicker 2.0: Improved protein assembly with high discrimination peptide identification filtering.IDPicker 2.0：通过高分辨率肽段鉴定筛选实现蛋白质组装的改进

J Proteome Res. 2009 Aug;8(8):3872-81. doi: 10.1021/pr900360j.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验