整合峰分组信息以对齐多个液相色谱-质谱数据集。

Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets.

机构信息

School of Computing Science, University of Glasgow, Glasgow, UK, School of Computing and Mathematical Sciences, Liverpool John Moores University, Merseyside, UK and Manchester Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology, University of Manchester, Manchester, UK.

出版信息

Bioinformatics. 2015 Jun 15;31(12):1999-2006. doi: 10.1093/bioinformatics/btv072. Epub 2015 Feb 2.

DOI:10.1093/bioinformatics/btv072

PMID:25649621

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4760236/

Abstract

MOTIVATION

The combination of liquid chromatography and mass spectrometry (LC/MS) has been widely used for large-scale comparative studies in systems biology, including proteomics, glycomics and metabolomics. In almost all experimental design, it is necessary to compare chromatograms across biological or technical replicates and across sample groups. Central to this is the peak alignment step, which is one of the most important but challenging preprocessing steps. Existing alignment tools do not take into account the structural dependencies between related peaks that coelute and are derived from the same metabolite or peptide. We propose a direct matching peak alignment method for LC/MS data that incorporates related peaks information (within each LC/MS run) and investigate its effect on alignment performance (across runs). The groupings of related peaks necessary for our method can be obtained from any peak clustering method and are built into a pair-wise peak similarity score function. The similarity score matrix produced is used by an approximation algorithm for the weighted matching problem to produce the actual alignment result.

RESULTS

We demonstrate that related peak information can improve alignment performance. The performance is evaluated on a set of benchmark datasets, where our method performs competitively compared to other popular alignment tools.

AVAILABILITY

The proposed alignment method has been implemented as a stand-alone application in Python, available for download at http://github.com/joewandy/peak-grouping-alignment.

摘要

动机

液相色谱和质谱联用（LC/MS）已广泛应用于系统生物学的大规模比较研究，包括蛋白质组学、糖组学和代谢组学。在几乎所有的实验设计中，都需要比较跨生物学或技术重复以及跨样本组的色谱图。这其中的核心是峰对齐步骤，这是最重要但最具挑战性的预处理步骤之一。现有的对齐工具没有考虑到共洗脱且源自同一代谢物或肽的相关峰之间的结构依赖性。我们提出了一种用于 LC/MS 数据的直接匹配峰对齐方法，该方法纳入了相关峰信息（在每个 LC/MS 运行中），并研究了其对对齐性能（跨运行）的影响。我们方法所需的相关峰分组可以从任何峰聚类方法获得，并构建成两两峰相似性得分函数。生成的相似性得分矩阵可由加权匹配问题的近似算法使用，以生成实际的对齐结果。

结果

我们证明了相关峰信息可以提高对齐性能。该方法在一组基准数据集上进行了评估，与其他流行的对齐工具相比，我们的方法具有竞争力。

可用性

所提出的对齐方法已作为独立的 Python 应用程序实现，可在 http://github.com/joewandy/peak-grouping-alignment 下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daa5/4760236/d08a33f29286/btv072f1p.jpg

相似文献

Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets.

Bioinformatics. 2015 Jun 15;31(12):1999-2006. doi: 10.1093/bioinformatics/btv072. Epub 2015 Feb 2.

Multi-profile Bayesian alignment model for LC-MS data analysis with integration of internal standards.

Bioinformatics. 2013 Nov 1;29(21):2774-80. doi: 10.1093/bioinformatics/btt461. Epub 2013 Sep 6.

G-Aligner: a graph-based feature alignment method for untargeted LC-MS-based metabolomics.

BMC Bioinformatics. 2023 Nov 14;24(1):431. doi: 10.1186/s12859-023-05525-4.

MultiAlign: a multiple LC-MS analysis tool for targeted omics analysis.

BMC Bioinformatics. 2013 Feb 12;14:49. doi: 10.1186/1471-2105-14-49.

Time alignment algorithms based on selected mass traces for complex LC-MS data.

J Proteome Res. 2010 Mar 5;9(3):1483-95. doi: 10.1021/pr9010124.

Combining peak- and chromatogram-based retention time alignment algorithms for multiple chromatography-mass spectrometry datasets.

BMC Bioinformatics. 2012 Aug 27;13:214. doi: 10.1186/1471-2105-13-214.

PeakLink: a new peptide peak linking method in LC-MS/MS using wavelet and SVM.

Bioinformatics. 2014 Sep 1;30(17):2464-70. doi: 10.1093/bioinformatics/btu299. Epub 2014 May 9.

Graph-based peak alignment algorithms for multiple liquid chromatography-mass spectrometry datasets.

Bioinformatics. 2013 Oct 1;29(19):2469-76. doi: 10.1093/bioinformatics/btt435. Epub 2013 Jul 30.

Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements.

BMC Bioinformatics. 2008 Sep 15;9:375. doi: 10.1186/1471-2105-9-375.

Fast parametric time warping of peak lists.

Bioinformatics. 2015 Sep 15;31(18):3063-5. doi: 10.1093/bioinformatics/btv299. Epub 2015 May 13.

引用本文的文献

Alignstein: Optimal transport for improved LC-MS retention time alignment.

Gigascience. 2022 Nov 3;11. doi: 10.1093/gigascience/giac101.

Peptidomic Approach for the Identification of Peptides with Potential Antioxidant and Anti-Hyperthensive Effects Derived From Asparagus By-Products.

Molecules. 2019 Oct 8;24(19):3627. doi: 10.3390/molecules24193627.

From chromatogram to analyte to metabolite. How to pick horses for courses from the massive web resources for mass spectral plant metabolomics.

Gigascience. 2017 Jul 1;6(7):1-20. doi: 10.1093/gigascience/gix037.

本文引用的文献

MetAssign: probabilistic annotation of metabolites from LC-MS data using a Bayesian clustering approach.

Bioinformatics. 2014 Oct;30(19):2764-71. doi: 10.1093/bioinformatics/btu370. Epub 2014 Jun 9.

LC-MS alignment in theory and practice: a comprehensive algorithmic review.

Brief Bioinform. 2015 Jan;16(1):104-17. doi: 10.1093/bib/bbt080. Epub 2013 Nov 21.

Multi-profile Bayesian alignment model for LC-MS data analysis with integration of internal standards.

Bioinformatics. 2013 Nov 1;29(21):2774-80. doi: 10.1093/bioinformatics/btt461. Epub 2013 Sep 6.

Graph-based peak alignment algorithms for multiple liquid chromatography-mass spectrometry datasets.

Bioinformatics. 2013 Oct 1;29(19):2469-76. doi: 10.1093/bioinformatics/btt435. Epub 2013 Jul 30.

A combinatorial approach to the peptide feature matching problem for label-free quantification.

Bioinformatics. 2013 Jul 15;29(14):1768-75. doi: 10.1093/bioinformatics/btt274. Epub 2013 May 10.

Combining peak- and chromatogram-based retention time alignment algorithms for multiple chromatography-mass spectrometry datasets.

BMC Bioinformatics. 2012 Aug 27;13:214. doi: 10.1186/1471-2105-13-214.

Toward global metabolomics analysis with hydrophilic interaction liquid chromatography-mass spectrometry: improved metabolite identification by retention time prediction.

Anal Chem. 2011 Nov 15;83(22):8703-10. doi: 10.1021/ac2021823. Epub 2011 Oct 21.

MassUntangler: a novel alignment tool for label-free liquid chromatography-mass spectrometry proteomic data.

J Chromatogr A. 2011 Dec 9;1218(49):8859-68. doi: 10.1016/j.chroma.2011.06.062. Epub 2011 Jun 22.

SIMA: simultaneous multiple alignment of LC/MS peak lists.

Bioinformatics. 2011 Apr 1;27(7):987-93. doi: 10.1093/bioinformatics/btr051. Epub 2011 Feb 3.

Simple data-reduction method for high-resolution LC-MS data in metabolomics.

Bioanalysis. 2009 Dec;1(9):1551-7. doi: 10.4155/bio.09.146.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

整合峰分组信息以对齐多个液相色谱-质谱数据集。

Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets.

机构信息