Suppr超能文献

一种用于 NMR 光谱数据稳健对齐和简化定量分析的集成工作流程。

An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data.

机构信息

Department of Mathematics and Computer Science, University of Antwerp, Belgium.

出版信息

BMC Bioinformatics. 2011 Oct 20;12:405. doi: 10.1186/1471-2105-12-405.

Abstract

BACKGROUND

Nuclear magnetic resonance spectroscopy (NMR) is a powerful technique to reveal and compare quantitative metabolic profiles of biological tissues. However, chemical and physical sample variations make the analysis of the data challenging, and typically require the application of a number of preprocessing steps prior to data interpretation. For example, noise reduction, normalization, baseline correction, peak picking, spectrum alignment and statistical analysis are indispensable components in any NMR analysis pipeline.

RESULTS

We introduce a novel suite of informatics tools for the quantitative analysis of NMR metabolomic profile data. The core of the processing cascade is a novel peak alignment algorithm, called hierarchical Cluster-based Peak Alignment (CluPA). The algorithm aligns a target spectrum to the reference spectrum in a top-down fashion by building a hierarchical cluster tree from peak lists of reference and target spectra and then dividing the spectra into smaller segments based on the most distant clusters of the tree. To reduce the computational time to estimate the spectral misalignment, the method makes use of Fast Fourier Transformation (FFT) cross-correlation. Since the method returns a high-quality alignment, we can propose a simple methodology to study the variability of the NMR spectra. For each aligned NMR data point the ratio of the between-group and within-group sum of squares (BW-ratio) is calculated to quantify the difference in variability between and within predefined groups of NMR spectra. This differential analysis is related to the calculation of the F-statistic or a one-way ANOVA, but without distributional assumptions. Statistical inference based on the BW-ratio is achieved by bootstrapping the null distribution from the experimental data.

CONCLUSIONS

The workflow performance was evaluated using a previously published dataset. Correlation maps, spectral and grey scale plots show clear improvements in comparison to other methods, and the down-to-earth quantitative analysis works well for the CluPA-aligned spectra. The whole workflow is embedded into a modular and statistically sound framework that is implemented as an R package called "speaq" ("spectrum alignment and quantitation"), which is freely available from http://code.google.com/p/speaq/.

摘要

背景

核磁共振波谱(NMR)是一种强大的技术,可以揭示和比较生物组织的定量代谢谱。然而,化学和物理样本的变化使得数据分析具有挑战性,通常需要在数据解释之前应用许多预处理步骤。例如,降噪、归一化、基线校正、峰提取、谱对齐和统计分析是任何 NMR 分析管道中不可或缺的组成部分。

结果

我们引入了一套新的用于定量分析 NMR 代谢组学图谱数据的信息学工具。处理级联的核心是一种新的峰对齐算法,称为基于层次聚类的峰对齐(CluPA)。该算法通过从参考和目标光谱的峰列表构建层次聚类树,并根据树的最远聚类将光谱分成更小的片段,以自上而下的方式将目标光谱与参考光谱对齐。为了减少估计光谱错位的计算时间,该方法利用快速傅里叶变换(FFT)互相关。由于该方法返回高质量的对齐,因此我们可以提出一种简单的方法来研究 NMR 光谱的可变性。对于每个对齐的 NMR 数据点,计算组间和组内平方和的比率(BW-ratio),以量化预定义 NMR 光谱组之间和组内的可变性差异。这种差异分析与 F 统计量或单向方差分析的计算相关,但没有分布假设。基于 BW-ratio 的统计推断是通过从实验数据中引导 null 分布来实现的。

结论

使用以前发表的数据集评估了工作流程的性能。相关图、光谱和灰度图显示与其他方法相比有明显的改进,并且基于 CluPA 对齐的光谱的实用定量分析效果很好。整个工作流程嵌入到一个模块化和统计上合理的框架中,该框架实现为一个名为“speaq”(“光谱对齐和定量”)的 R 包,并可从 http://code.google.com/p/speaq/ 免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/835d/3217056/f06b8b38ee0f/1471-2105-12-405-1.jpg

相似文献

1
2
speaq 2.0: A complete workflow for high-throughput 1D NMR spectra processing and quantification.
PLoS Comput Biol. 2018 Mar 1;14(3):e1006018. doi: 10.1371/journal.pcbi.1006018. eCollection 2018 Mar.
4
GIPMA: Global Intensity-Guided Peak Matching and Alignment for 2D H-C HSQC-Based Metabolomics.
Anal Chem. 2023 Feb 14;95(6):3195-3203. doi: 10.1021/acs.analchem.2c03323. Epub 2023 Feb 2.
5
AlpsNMR: an R package for signal processing of fully untargeted NMR-based metabolomics.
Bioinformatics. 2020 May 1;36(9):2943-2945. doi: 10.1093/bioinformatics/btaa022.
6
Focus: a robust workflow for one-dimensional NMR spectral analysis.
Anal Chem. 2014 Jan 21;86(2):1160-9. doi: 10.1021/ac403110u. Epub 2013 Dec 31.
7
mQTL.NMR: an integrated suite for genetic mapping of quantitative variations of (1)H NMR-based metabolic profiles.
Anal Chem. 2015 Apr 21;87(8):4377-84. doi: 10.1021/acs.analchem.5b00145. Epub 2015 Apr 2.
8
Preprocessing of NMR metabolomics data.
Scand J Clin Lab Invest. 2015 May;75(3):193-203. doi: 10.3109/00365513.2014.1003593.
9
A comparison of methods for alignment of NMR peaks in the context of cluster analysis.
J Pharm Biomed Anal. 2005 Aug 10;38(5):824-32. doi: 10.1016/j.jpba.2005.01.042. Epub 2005 Apr 2.
10
MagMet: A fully automated web server for targeted nuclear magnetic resonance metabolomics of plasma and serum.
Magn Reson Chem. 2023 Dec;61(12):681-704. doi: 10.1002/mrc.5371. Epub 2023 Jun 2.

引用本文的文献

1
Can NMR-HetCA be a Reliable Prediction Tool for the Direct Identification of Bioactive Substances in Complex Mixtures?
Anal Chem. 2024 Dec 17;96(50):20090-20097. doi: 10.1021/acs.analchem.4c05080. Epub 2024 Dec 6.
2
Low-Field Benchtop NMR to Discover Early-Onset Sepsis: A Proof of Concept.
Metabolites. 2023 Sep 21;13(9):1029. doi: 10.3390/metabo13091029.
3
Problems, principles and progress in computational annotation of NMR metabolomics data.
Metabolomics. 2022 Dec 5;18(12):102. doi: 10.1007/s11306-022-01962-z.
7
Chronic Kidney Disease Cohort Studies: A Guide to Metabolome Analyses.
Metabolites. 2021 Jul 16;11(7):460. doi: 10.3390/metabo11070460.
9
The metaRbolomics Toolbox in Bioconductor and beyond.
Metabolites. 2019 Sep 23;9(10):200. doi: 10.3390/metabo9100200.

本文引用的文献

1
(1)H NMR based metabolomics of CSF and blood serum: a metabolic profile for a transgenic rat model of Huntington disease.
Biochim Biophys Acta. 2011 Nov;1812(11):1371-9. doi: 10.1016/j.bbadis.2011.08.001. Epub 2011 Aug 16.
2
icoshift: A versatile tool for the rapid alignment of 1D NMR spectra.
J Magn Reson. 2010 Feb;202(2):190-202. doi: 10.1016/j.jmr.2009.11.012. Epub 2009 Nov 18.
4
A solution to the 1D NMR alignment problem using an extended generalized fuzzy Hough transform and mode support.
Anal Bioanal Chem. 2009 Sep;395(1):213-23. doi: 10.1007/s00216-009-2940-4. Epub 2009 Jul 22.
5
Application of a clustering-based peak alignment algorithm to analyze various DNA fingerprinting data.
J Microbiol Methods. 2009 Sep;78(3):344-50. doi: 10.1016/j.mimet.2009.07.005. Epub 2009 Jul 17.
6
Comparison of public peak detection algorithms for MALDI mass spectrometry data analysis.
BMC Bioinformatics. 2009 Jan 6;10:4. doi: 10.1186/1471-2105-10-4.
8
Annotated regions of significance of SELDI-TOF-MS spectra for detecting protein biomarkers.
Proteomics. 2006 Dec;6(23):6124-33. doi: 10.1002/pmic.200600505.
9
Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching.
Bioinformatics. 2006 Sep 1;22(17):2059-65. doi: 10.1093/bioinformatics/btl355. Epub 2006 Jul 4.
10
Progressive peak clustering in GC-MS Metabolomic experiments applied to Leishmania parasites.
Bioinformatics. 2006 Jun 1;22(11):1391-6. doi: 10.1093/bioinformatics/btl085. Epub 2006 Mar 9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验