基于谱图的 LC-MS 数据对齐——一种贝叶斯方法。

Profile-Based LC-MS data alignment--a Bayesian approach.

机构信息

Department of Electrical and Computer Engineering,Virginia Tech, Washington, DC 20057, USA

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):494-503. doi: 10.1109/TCBB.2013.25.

DOI:10.1109/TCBB.2013.25

PMID:23929872

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3993096/

Abstract

A Bayesian alignment model (BAM) is proposed for alignment of liquid chromatography-mass spectrometry (LC-MS) data. BAM belongs to the category of profile-based approaches, which are composed of two major components: a prototype function and a set of mapping functions. Appropriate estimation of these functions is crucial for good alignment results. BAM uses Markov chain Monte Carlo (MCMC) methods to draw inference on the model parameters and improves on existing MCMC-based alignment methods through 1) the implementation of an efficient MCMC sampler and 2) an adaptive selection of knots. A block Metropolis-Hastings algorithm that mitigates the problem of the MCMC sampler getting stuck at local modes of the posterior distribution is used for the update of the mapping function coefficients. In addition, a stochastic search variable selection (SSVS) methodology is used to determine the number and positions of knots. We applied BAM to a simulated data set, an LC-MS proteomic data set, and two LC-MS metabolomic data sets, and compared its performance with the Bayesian hierarchical curve registration (BHCR) model, the dynamic time-warping (DTW) model, and the continuous profile model (CPM). The advantage of applying appropriate profile-based retention time correction prior to performing a feature-based approach is also demonstrated through the metabolomic data sets.

摘要

提出了一种用于液相色谱-质谱 (LC-MS) 数据对齐的贝叶斯对齐模型 (BAM)。BAM 属于基于轮廓的方法类别，由两个主要组件组成：原型函数和一组映射函数。这些函数的适当估计对于良好的对齐结果至关重要。BAM 使用马尔可夫链蒙特卡罗 (MCMC) 方法对模型参数进行推断，并通过以下方式改进现有的基于 MCMC 的对齐方法：1）实现有效的 MCMC 采样器；2）自适应选择节点。使用块 Metropolis-Hastings 算法来更新映射函数系数，该算法缓解了 MCMC 采样器卡在后验分布局部模式的问题。此外，使用随机搜索变量选择 (SSVS) 方法来确定节点的数量和位置。我们将 BAM 应用于模拟数据集、LC-MS 蛋白质组学数据集和两个 LC-MS 代谢组学数据集，并将其性能与贝叶斯层次曲线注册 (BHCR) 模型、动态时间扭曲 (DTW) 模型和连续轮廓模型 (CPM) 进行了比较。通过代谢组学数据集还证明了在执行基于特征的方法之前应用适当的基于轮廓的保留时间校正的优势。

相似文献

Profile-Based LC-MS data alignment--a Bayesian approach.基于谱图的 LC-MS 数据对齐——一种贝叶斯方法。

IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):494-503. doi: 10.1109/TCBB.2013.25.

Time alignment algorithms based on selected mass traces for complex LC-MS data.基于选定质量轨迹的复杂 LC-MS 数据时间对齐算法。

J Proteome Res. 2010 Mar 5;9(3):1483-95. doi: 10.1021/pr9010124.

A Bayesian based functional mixed-effects model for analysis of LC-MS data.一种基于贝叶斯的功能混合效应模型，用于分析液相色谱-质谱数据。

Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:6743-6. doi: 10.1109/IEMBS.2009.5332859.

Bayesian adaptive Markov chain Monte Carlo estimation of genetic parameters.贝叶斯自适应马尔可夫链蒙特卡罗遗传参数估计。

Heredity (Edinb). 2012 Oct;109(4):235-45. doi: 10.1038/hdy.2012.35. Epub 2012 Jul 18.

Multi-profile Bayesian alignment model for LC-MS data analysis with integration of internal standards.多谱图贝叶斯对齐模型，用于结合内标进行 LC-MS 数据分析。

Bioinformatics. 2013 Nov 1;29(21):2774-80. doi: 10.1093/bioinformatics/btt461. Epub 2013 Sep 6.

Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计

BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.

Searching for efficient Markov chain Monte Carlo proposal kernels.搜索高效的马尔可夫链蒙特卡罗提议核。

Proc Natl Acad Sci U S A. 2013 Nov 26;110(48):19307-12. doi: 10.1073/pnas.1311790110. Epub 2013 Nov 11.

Hierarchical Bayesian estimates of distributed MEG sources: theoretical aspects and comparison of variational and MCMC methods.分布式脑磁图源的分层贝叶斯估计：理论方面以及变分法与马尔可夫链蒙特卡罗方法的比较

Neuroimage. 2007 Apr 1;35(2):669-85. doi: 10.1016/j.neuroimage.2006.05.001. Epub 2007 Feb 12.

Bayesian phylogeny analysis via stochastic approximation Monte Carlo.通过随机近似蒙特卡罗法进行贝叶斯系统发育分析。

Mol Phylogenet Evol. 2009 Nov;53(2):394-403. doi: 10.1016/j.ympev.2009.06.019. Epub 2009 Jul 7.

Probabilistic mixture regression models for alignment of LC-MS data.基于 LC-MS 数据对齐的概率混合回归模型。

IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1417-24. doi: 10.1109/TCBB.2010.88.

引用本文的文献

Bayesian time-aligned factor analysis of paired multivariate time series.配对多变量时间序列的贝叶斯时间对齐因子分析。

J Mach Learn Res. 2021 Jan-Dec;22.

Mass Spectrometry-based Metabolomics in Translational Research.基于质谱的代谢组学在转化研究中的应用。

Adv Exp Med Biol. 2021;1310:509-531. doi: 10.1007/978-981-33-6064-8_19.

Preprocessing and Analysis of LC-MS-Based Proteomic Data.基于液相色谱-质谱联用的蛋白质组学数据的预处理与分析

Methods Mol Biol. 2016;1362:63-76. doi: 10.1007/978-1-4939-3106-4_3.

Bayesian Normalization Model for Label-Free Quantitative Analysis by LC-MS.用于液相色谱-质谱无标记定量分析的贝叶斯归一化模型

IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):914-27. doi: 10.1109/TCBB.2014.2377723.

Multi-profile Bayesian alignment model for LC-MS data analysis with integration of internal standards.多谱图贝叶斯对齐模型，用于结合内标进行 LC-MS 数据分析。

Bioinformatics. 2013 Nov 1;29(21):2774-80. doi: 10.1093/bioinformatics/btt461. Epub 2013 Sep 6.

本文引用的文献

Innovation: Metabolomics: the apogee of the omics trilogy.创新：代谢组学：组学三部曲的巅峰。

Nat Rev Mol Cell Biol. 2012 Mar 22;13(4):263-9. doi: 10.1038/nrm3314.

Liquid Chromatography Mass Spectrometry-Based Proteomics: Biological and Technological Aspects.基于液相色谱-质谱联用的蛋白质组学：生物学与技术层面

Ann Appl Stat. 2010;4(4):1797-1823. doi: 10.1214/10-AOAS341.

SIMA: simultaneous multiple alignment of LC/MS peak lists.SIMA：LC/MS 峰列表的同时多重比对。

Bioinformatics. 2011 Apr 1;27(7):987-93. doi: 10.1093/bioinformatics/btr051. Epub 2011 Feb 3.

Mass spectrometry and glycomics.质谱分析和糖组学。

OMICS. 2010 Aug;14(4):401-18. doi: 10.1089/omi.2009.0146.

Retention time alignment algorithms for LC/MS data must consider non-linear shifts.用于液相色谱/质谱数据的保留时间校准算法必须考虑非线性偏移。

Bioinformatics. 2009 Mar 15;25(6):758-64. doi: 10.1093/bioinformatics/btp052. Epub 2009 Jan 28.

Precision proteomics: the case for high resolution and high mass accuracy.精准蛋白质组学：高分辨率和高质量精度的实例

Proc Natl Acad Sci U S A. 2008 Nov 25;105(47):18132-8. doi: 10.1073/pnas.0800788105. Epub 2008 Sep 25.

Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements.液相色谱-质谱联用蛋白质组学和代谢组学测量校准程序的批判性评估。

BMC Bioinformatics. 2008 Sep 15;9:375. doi: 10.1186/1471-2105-9-375.

OpenMS - an open-source software framework for mass spectrometry.OpenMS——一个用于质谱分析的开源软件框架。

BMC Bioinformatics. 2008 Mar 26;9:163. doi: 10.1186/1471-2105-9-163.

Alignment of LC-MS images, with applications to biomarker discovery and protein identification.液相色谱-质谱成像的比对及其在生物标志物发现和蛋白质鉴定中的应用。

Proteomics. 2008 Feb;8(4):650-72. doi: 10.1002/pmic.200700791.

A geometric approach for the alignment of liquid chromatography-mass spectrometry data.一种用于液相色谱-质谱数据比对的几何方法。

Bioinformatics. 2007 Jul 1;23(13):i273-81. doi: 10.1093/bioinformatics/btm209.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验