液相色谱-质谱联用蛋白质组学和代谢组学测量校准程序的批判性评估。

Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements.

作者信息

Lange Eva, Tautenhahn Ralf, Neumann Steffen, Gröpl Clemens

机构信息

Beatson Institute for Cancer Research, Proteomics and Mass Spectrometry Group, Scotland, UK.

出版信息

BMC Bioinformatics. 2008 Sep 15;9:375. doi: 10.1186/1471-2105-9-375.

DOI:10.1186/1471-2105-9-375

PMID:18793413

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2570366/

Abstract

BACKGROUND

Liquid chromatography coupled to mass spectrometry (LC-MS) has become a prominent tool for the analysis of complex proteomics and metabolomics samples. In many applications multiple LC-MS measurements need to be compared, e. g. to improve reliability or to combine results from different samples in a statistical comparative analysis. As in all physical experiments, LC-MS data are affected by uncertainties, and variability of retention time is encountered in all data sets. It is therefore necessary to estimate and correct the underlying distortions of the retention time axis to search for corresponding compounds in different samples. To this end, a variety of so-called LC-MS map alignment algorithms have been developed during the last four years. Most of these approaches are well documented, but they are usually evaluated on very specific samples only. So far, no publication has been assessing different alignment algorithms using a standard LC-MS sample along with commonly used quality criteria.

RESULTS

We propose two LC-MS proteomics as well as two LC-MS metabolomics data sets that represent typical alignment scenarios. Furthermore, we introduce a new quality measure for the evaluation of LC-MS alignment algorithms. Using the four data sets to compare six freely available alignment algorithms proposed for the alignment of metabolomics and proteomics LC-MS measurements, we found significant differences with respect to alignment quality, running time, and usability in general.

CONCLUSION

The multitude of available alignment methods necessitates the generation of standard data sets and quality measures that allow users as well as developers to benchmark and compare their map alignment tools on a fair basis. Our study represents a first step in this direction. Currently, the installation and evaluation of the "correct" parameter settings can be quite a time-consuming task, and the success of a particular method is still highly dependent on the experience of the user. Therefore, we propose to continue and extend this type of study to a community-wide competition. All data as well as our evaluation scripts are available at http://msbi.ipb-halle.de/msbi/caap.

摘要

背景

液相色谱-质谱联用（LC-MS）已成为分析复杂蛋白质组学和代谢组学样品的重要工具。在许多应用中，需要比较多次LC-MS测量结果，例如提高可靠性或在统计比较分析中合并来自不同样品的结果。与所有物理实验一样，LC-MS数据会受到不确定性的影响，并且在所有数据集中都会遇到保留时间的变异性。因此，有必要估计并校正保留时间轴的潜在偏差，以便在不同样品中寻找相应的化合物。为此，在过去四年中开发了各种所谓的LC-MS图谱比对算法。这些方法大多有详细记录，但通常仅在非常特定的样品上进行评估。到目前为止，还没有出版物使用标准LC-MS样品以及常用的质量标准来评估不同的比对算法。

结果

我们提出了两个LC-MS蛋白质组学数据集和两个LC-MS代谢组学数据集，它们代表了典型的比对场景。此外，我们引入了一种新的质量指标来评估LC-MS比对算法。使用这四个数据集来比较为代谢组学和蛋白质组学LC-MS测量比对而提出的六种免费可用的比对算法，我们发现总体上在比对质量、运行时间和可用性方面存在显著差异。

结论

众多可用的比对方法需要生成标准数据集和质量指标，以便用户和开发者能够在公平的基础上对他们的图谱比对工具进行基准测试和比较。我们的研究代表了朝这个方向迈出的第一步。目前，安装和评估“正确的”参数设置可能是一项相当耗时的任务，并且特定方法的成功仍然高度依赖于用户的经验。因此，我们建议继续并将这类研究扩展为全社区范围的竞赛。所有数据以及我们的评估脚本可在http://msbi.ipb-halle.de/msbi/caap获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9122/2570366/69c5c3171f02/1471-2105-9-375-1.jpg

相似文献

Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements.

BMC Bioinformatics. 2008 Sep 15;9:375. doi: 10.1186/1471-2105-9-375.

A geometric approach for the alignment of liquid chromatography-mass spectrometry data.

Bioinformatics. 2007 Jul 1;23(13):i273-81. doi: 10.1093/bioinformatics/btm209.

Semi-supervised LC/MS alignment for differential proteomics.

Bioinformatics. 2006 Jul 15;22(14):e132-40. doi: 10.1093/bioinformatics/btl219.

Design and analysis of quantitative differential proteomics investigations using LC-MS technology.

J Bioinform Comput Biol. 2008 Feb;6(1):107-23. doi: 10.1142/s0219720008003321.

Data reduction of isotope-resolved LC-MS spectra.

Bioinformatics. 2007 Jun 1;23(11):1394-400. doi: 10.1093/bioinformatics/btm083. Epub 2007 May 11.

MASPECTRAS: a platform for management and analysis of proteomics LC-MS/MS data.

BMC Bioinformatics. 2007 Jun 13;8:197. doi: 10.1186/1471-2105-8-197.

Data pre-processing in liquid chromatography-mass spectrometry-based proteomics.

Bioinformatics. 2005 Nov 1;21(21):4054-9. doi: 10.1093/bioinformatics/bti660. Epub 2005 Sep 8.

Time alignment algorithms based on selected mass traces for complex LC-MS data.

J Proteome Res. 2010 Mar 5;9(3):1483-95. doi: 10.1021/pr9010124.

Graph-based peak alignment algorithms for multiple liquid chromatography-mass spectrometry datasets.

Bioinformatics. 2013 Oct 1;29(19):2469-76. doi: 10.1093/bioinformatics/btt435. Epub 2013 Jul 30.

Installation and use of the Computational Proteomics Analysis System (CPAS).

Curr Protoc Bioinformatics. 2007 Jun;Chapter 13:Unit 13.5. doi: 10.1002/0471250953.bi1305s18.

引用本文的文献

Data Treatment for LC-MS Untargeted Analysis.

Methods Mol Biol. 2025;2891:91-108. doi: 10.1007/978-1-0716-4334-1_5.

Antimicrobial Activity of -Mediated Gold Nanoparticles against : A Metabolomic and Docking Study.

Int J Mol Sci. 2024 Sep 19;25(18):10090. doi: 10.3390/ijms251810090.

A stochastic approach for parameter optimization of feature detection algorithms for non-target screening in mass spectrometry.

Anal Bioanal Chem. 2024 Jul 12. doi: 10.1007/s00216-024-05425-3.

DeepRTAlign: toward accurate retention time alignment for large cohort mass spectrometry data analysis.

Nat Commun. 2023 Dec 11;14(1):8188. doi: 10.1038/s41467-023-43909-5.

Plasma glycoproteomics delivers high-specificity disease biomarkers by detecting site-specific glycosylation abnormalities.

J Adv Res. 2024 Jul;61:179-192. doi: 10.1016/j.jare.2023.09.002. Epub 2023 Sep 6.

Retention Time Alignment for Protein Turnover Studies Using Heavy Water Metabolic Labeling.

J Proteome Res. 2023 Feb 3;22(2):410-419. doi: 10.1021/acs.jproteome.2c00592. Epub 2023 Jan 24.

An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics.

Front Mol Biosci. 2022 Nov 9;9:930204. doi: 10.3389/fmolb.2022.930204. eCollection 2022.

A New Strategy Based on LC-Q TRAP-MS for Determining the Distribution of Polyphenols in Different Apple Varieties.

Foods. 2022 Oct 27;11(21):3390. doi: 10.3390/foods11213390.

Alignstein: Optimal transport for improved LC-MS retention time alignment.

Gigascience. 2022 Nov 3;11. doi: 10.1093/gigascience/giac101.

A matching algorithm with isotope distribution pattern in LC-MS based on support vector machine (SVM) learning model.

RSC Adv. 2019 Sep 4;9(48):27874-27882. doi: 10.1039/c9ra03789f. eCollection 2019 Sep 3.

本文引用的文献

Metabolome analysis of biosynthetic mutants reveals a diversity of metabolic changes and allows identification of a large number of new compounds in Arabidopsis.

Plant Physiol. 2008 Aug;147(4):2107-20. doi: 10.1104/pp.108.117754. Epub 2008 Jun 13.

Current trends and future requirements for the mass spectrometric investigation of microbial, mammalian and plant metabolomes.

Phys Biol. 2008 Feb 20;5(1):011001. doi: 10.1088/1478-3975/5/1/011001.

OpenMS - an open-source software framework for mass spectrometry.

BMC Bioinformatics. 2008 Mar 26;9:163. doi: 10.1186/1471-2105-9-163.

Comparative LC-MS: a landscape of peaks and valleys.

Proteomics. 2008 Feb;8(4):731-49. doi: 10.1002/pmic.200700694.

Alignment of LC-MS images, with applications to biomarker discovery and protein identification.

Proteomics. 2008 Feb;8(4):650-72. doi: 10.1002/pmic.200700791.

Critical assessment of methods of protein structure prediction-Round VII.

Proteins. 2007;69 Suppl 8(S8):3-9. doi: 10.1002/prot.21767.

Introduction to computational proteomics.

PLoS Comput Biol. 2007 Jul;3(7):e114. doi: 10.1371/journal.pcbi.0030114.

A geometric approach for the alignment of liquid chromatography-mass spectrometry data.

Bioinformatics. 2007 Jul 1;23(13):i273-81. doi: 10.1093/bioinformatics/btm209.

Temporal analysis of sucrose-induced phosphorylation changes in plasma membrane proteins of Arabidopsis.

Mol Cell Proteomics. 2007 Oct;6(10):1711-26. doi: 10.1074/mcp.M700164-MCP200. Epub 2007 Jun 23.

Analysis and quantification of diagnostic serum markers and protein signatures for Gaucher disease.

Mol Cell Proteomics. 2007 May;6(5):755-66. doi: 10.1074/mcp.M600303-MCP200. Epub 2007 Feb 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

液相色谱-质谱联用蛋白质组学和代谢组学测量校准程序的批判性评估。

Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements.

作者信息

Lange Eva, Tautenhahn Ralf, Neumann Steffen, Gröpl Clemens

机构信息

Beatson Institute for Cancer Research, Proteomics and Mass Spectrometry Group, Scotland, UK.