Genome Informatics, Faculty of Technology and CeBiTec, Algae Biotechnology & Bioenergy, Faculty of Biology and CeBiTec, Proteomics and Metabolomics Research, Bielefeld University, 33501 Bielefeld, Germany.
Bioinformatics. 2014 Apr 1;30(7):988-95. doi: 10.1093/bioinformatics/btt738. Epub 2013 Dec 20.
Comprehensive 2D gas chromatography-mass spectrometry is an established method for the analysis of complex mixtures in analytical chemistry and metabolomics. It produces large amounts of data that require semiautomatic, but preferably automatic handling. This involves the location of significant signals (peaks) and their matching and alignment across different measurements. To date, there exist only a few openly available algorithms for the retention time alignment of peaks originating from such experiments that scale well with increasing sample and peak numbers, while providing reliable alignment results.
We describe BiPACE 2D, an automated algorithm for retention time alignment of peaks from 2D gas chromatography-mass spectrometry experiments and evaluate it on three previously published datasets against the mSPA, SWPA and Guineu algorithms. We also provide a fourth dataset from an experiment studying the H2 production of two different strains of Chlamydomonas reinhardtii that is available from the MetaboLights database together with the experimental protocol, peak-detection results and manually curated multiple peak alignment for future comparability with newly developed algorithms.
BiPACE 2D is contained in the freely available Maltcms framework, version 1.3, hosted at http://maltcms.sf.net, under the terms of the L-GPL v3 or Eclipse Open Source licenses. The software used for the evaluation along with the underlying datasets is available at the same location. The C.reinhardtii dataset is freely available at http://www.ebi.ac.uk/metabolights/MTBLS37.
全面的二维气相色谱-质谱联用分析是分析化学和代谢组学中分析复杂混合物的一种既定方法。它产生了大量的数据,需要半自动,最好是自动处理。这涉及到显著信号(峰)的定位,以及它们在不同测量中的匹配和对齐。迄今为止,对于源自此类实验的峰的保留时间对齐,仅存在少数几个公开可用的算法,这些算法可以很好地扩展到增加的样本和峰数量,同时提供可靠的对齐结果。
我们描述了 BiPACE 2D,这是一种用于二维气相色谱-质谱实验中峰的保留时间对齐的自动算法,并在三个先前发布的数据集上对其进行了评估,与 mSPA、SWPA 和 Guineu 算法进行了比较。我们还提供了第四个数据集,该数据集来自一项研究两种不同的莱茵衣藻(Chlamydomonas reinhardtii)菌株的 H2 产生的实验,该数据集可从 MetaboLights 数据库中获得,同时还提供了实验方案、峰检测结果和手动整理的多个峰对齐,以便将来与新开发的算法进行比较。
BiPACE 2D 包含在免费提供的 Maltcms 框架版本 1.3 中,该框架托管在 http://maltcms.sf.net 上,根据 L-GPL v3 或 Eclipse 开源许可证的规定。用于评估的软件以及基础数据集都可在同一位置获得。C.reinhardtii 数据集可在 http://www.ebi.ac.uk/metabolights/MTBLS37 上免费获得。