Suppr超能文献

基于谱图的 LC-MS 数据对齐——一种贝叶斯方法。

Profile-Based LC-MS data alignment--a Bayesian approach.

机构信息

Department of Electrical and Computer Engineering,Virginia Tech, Washington, DC 20057, USA

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):494-503. doi: 10.1109/TCBB.2013.25.

Abstract

A Bayesian alignment model (BAM) is proposed for alignment of liquid chromatography-mass spectrometry (LC-MS) data. BAM belongs to the category of profile-based approaches, which are composed of two major components: a prototype function and a set of mapping functions. Appropriate estimation of these functions is crucial for good alignment results. BAM uses Markov chain Monte Carlo (MCMC) methods to draw inference on the model parameters and improves on existing MCMC-based alignment methods through 1) the implementation of an efficient MCMC sampler and 2) an adaptive selection of knots. A block Metropolis-Hastings algorithm that mitigates the problem of the MCMC sampler getting stuck at local modes of the posterior distribution is used for the update of the mapping function coefficients. In addition, a stochastic search variable selection (SSVS) methodology is used to determine the number and positions of knots. We applied BAM to a simulated data set, an LC-MS proteomic data set, and two LC-MS metabolomic data sets, and compared its performance with the Bayesian hierarchical curve registration (BHCR) model, the dynamic time-warping (DTW) model, and the continuous profile model (CPM). The advantage of applying appropriate profile-based retention time correction prior to performing a feature-based approach is also demonstrated through the metabolomic data sets.

摘要

提出了一种用于液相色谱-质谱 (LC-MS) 数据对齐的贝叶斯对齐模型 (BAM)。BAM 属于基于轮廓的方法类别,由两个主要组件组成:原型函数和一组映射函数。这些函数的适当估计对于良好的对齐结果至关重要。BAM 使用马尔可夫链蒙特卡罗 (MCMC) 方法对模型参数进行推断,并通过以下方式改进现有的基于 MCMC 的对齐方法:1)实现有效的 MCMC 采样器;2)自适应选择节点。使用块 Metropolis-Hastings 算法来更新映射函数系数,该算法缓解了 MCMC 采样器卡在后验分布局部模式的问题。此外,使用随机搜索变量选择 (SSVS) 方法来确定节点的数量和位置。我们将 BAM 应用于模拟数据集、LC-MS 蛋白质组学数据集和两个 LC-MS 代谢组学数据集,并将其性能与贝叶斯层次曲线注册 (BHCR) 模型、动态时间扭曲 (DTW) 模型和连续轮廓模型 (CPM) 进行了比较。通过代谢组学数据集还证明了在执行基于特征的方法之前应用适当的基于轮廓的保留时间校正的优势。

相似文献

1
Profile-Based LC-MS data alignment--a Bayesian approach.基于谱图的 LC-MS 数据对齐——一种贝叶斯方法。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):494-503. doi: 10.1109/TCBB.2013.25.
7
Searching for efficient Markov chain Monte Carlo proposal kernels.搜索高效的马尔可夫链蒙特卡罗提议核。
Proc Natl Acad Sci U S A. 2013 Nov 26;110(48):19307-12. doi: 10.1073/pnas.1311790110. Epub 2013 Nov 11.
9
Bayesian phylogeny analysis via stochastic approximation Monte Carlo.通过随机近似蒙特卡罗法进行贝叶斯系统发育分析。
Mol Phylogenet Evol. 2009 Nov;53(2):394-403. doi: 10.1016/j.ympev.2009.06.019. Epub 2009 Jul 7.
10
Probabilistic mixture regression models for alignment of LC-MS data.基于 LC-MS 数据对齐的概率混合回归模型。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1417-24. doi: 10.1109/TCBB.2010.88.

本文引用的文献

3
SIMA: simultaneous multiple alignment of LC/MS peak lists.SIMA:LC/MS 峰列表的同时多重比对。
Bioinformatics. 2011 Apr 1;27(7):987-93. doi: 10.1093/bioinformatics/btr051. Epub 2011 Feb 3.
4
Mass spectrometry and glycomics.质谱分析和糖组学。
OMICS. 2010 Aug;14(4):401-18. doi: 10.1089/omi.2009.0146.
6
Precision proteomics: the case for high resolution and high mass accuracy.精准蛋白质组学:高分辨率和高质量精度的实例
Proc Natl Acad Sci U S A. 2008 Nov 25;105(47):18132-8. doi: 10.1073/pnas.0800788105. Epub 2008 Sep 25.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验