Li Yong Fuga, Arnold Randy J, Radivojac Predrag, Tang Haixu
School of Informatics and Computing, Indiana University, Bloomington, IN 47405, USA.
Department of Chemistry, Indiana University, Bloomington, IN 47406, USA.
Stat Interface. 2012 Jan 1;5(1):21-37. doi: 10.4310/SII.2012.v5.n1.a3.
We present a generic Bayesian framework for the peptide and protein identification in proteomics, and provide a unified interpretation for the database searching and the peptide sequencing approaches that are used in peptide identification. We describe several probabilistic graphical models and a variety of prior distributions that can be incorporated into the Bayesian framework to model different types of prior information, such as the known protein sequences, the known protein abundances, the peptide precursor masses, the estimated peptide retention time and the peptide detectabilities. Various applications of the Bayesian framework are discussed theoretically, including its application to the identification of peptides containing mutations and post-translational modifications.
我们提出了一种用于蛋白质组学中肽和蛋白质鉴定的通用贝叶斯框架,并对肽鉴定中使用的数据库搜索和肽测序方法提供了统一的解释。我们描述了几种概率图形模型和各种先验分布,这些可以纳入贝叶斯框架以对不同类型的先验信息进行建模,例如已知的蛋白质序列、已知的蛋白质丰度、肽前体质量、估计的肽保留时间和肽可检测性。从理论上讨论了贝叶斯框架的各种应用,包括其在鉴定含有突变和翻译后修饰的肽方面的应用。