使用方差稳定广义对数变换提高一维和二维核磁共振代谢组学数据的分类准确率。

Improved classification accuracy in 1- and 2-dimensional NMR metabolomics data using the variance stabilising generalised logarithm transformation.

作者信息

Parsons Helen M, Ludwig Christian, Günther Ulrich L, Viant Mark R

机构信息

Centre for Systems Biology, The University of Birmingham, Edgbaston, Birmingham, UK.

出版信息

BMC Bioinformatics. 2007 Jul 2;8:234. doi: 10.1186/1471-2105-8-234.

DOI:10.1186/1471-2105-8-234

PMID:17605789

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1965488/

Abstract

BACKGROUND

Classifying nuclear magnetic resonance (NMR) spectra is a crucial step in many metabolomics experiments. Since several multivariate classification techniques depend upon the variance of the data, it is important to first minimise any contribution from unwanted technical variance arising from sample preparation and analytical measurements, and thereby maximise any contribution from wanted biological variance between different classes. The generalised logarithm (glog) transform was developed to stabilise the variance in DNA microarray datasets, but has rarely been applied to metabolomics data. In particular, it has not been rigorously evaluated against other scaling techniques used in metabolomics, nor tested on all forms of NMR spectra including 1-dimensional (1D) 1H, projections of 2D 1H, 1H J-resolved (pJRES), and intact 2D J-resolved (JRES).

RESULTS

Here, the effects of the glog transform are compared against two commonly used variance stabilising techniques, autoscaling and Pareto scaling, as well as unscaled data. The four methods are evaluated in terms of the effects on the variance of NMR metabolomics data and on the classification accuracy following multivariate analysis, the latter achieved using principal component analysis followed by linear discriminant analysis. For two of three datasets analysed, classification accuracies were highest following glog transformation: 100% accuracy for discriminating 1D NMR spectra of hypoxic and normoxic invertebrate muscle, and 100% accuracy for discriminating 2D JRES spectra of fish livers sampled from two rivers. For the third dataset, pJRES spectra of urine from two breeds of dog, the glog transform and autoscaling achieved equal highest accuracies. Additionally we extended the glog algorithm to effectively suppress noise, which proved critical for the analysis of 2D JRES spectra.

CONCLUSION

We have demonstrated that the glog and extended glog transforms stabilise the technical variance in NMR metabolomics datasets. This significantly improves the discrimination between sample classes and has resulted in higher classification accuracies compared to unscaled, autoscaled or Pareto scaled data. Additionally we have confirmed the broad applicability of the glog approach using three disparate datasets from different biological samples using 1D NMR spectra, 1D projections of 2D JRES spectra, and intact 2D JRES spectra.

摘要

背景

在许多代谢组学实验中，对核磁共振（NMR）光谱进行分类是关键步骤。由于多种多元分类技术依赖于数据的方差，因此首先将样本制备和分析测量中产生的不必要技术方差的贡献降至最低，并从而将不同类别之间所需生物方差的贡献最大化，这一点很重要。广义对数（glog）变换是为了稳定DNA微阵列数据集中的方差而开发的，但很少应用于代谢组学数据。特别是，它尚未与代谢组学中使用的其他缩放技术进行严格评估，也未在包括一维（1D）1H、二维1H投影、1H J分辨（pJRES）和完整二维J分辨（JRES）在内的所有形式的NMR光谱上进行测试。

结果

在此，将glog变换的效果与两种常用的方差稳定技术（自动缩放和帕累托缩放）以及未缩放数据进行了比较。根据对NMR代谢组学数据方差的影响以及多变量分析后的分类准确性对这四种方法进行了评估，后者通过主成分分析然后进行线性判别分析来实现。对于分析的三个数据集中的两个，glog变换后的分类准确率最高：区分缺氧和常氧无脊椎动物肌肉的1D NMR光谱的准确率为100%，区分从两条河流采集的鱼肝的二维JRES光谱的准确率为100%。对于第三个数据集，两种犬类尿液的pJRES光谱，glog变换和自动缩放达到了相同的最高准确率。此外，我们扩展了glog算法以有效抑制噪声，这被证明对二维JRES光谱的分析至关重要。

结论

我们已经证明，glog和扩展的glog变换稳定了NMR代谢组学数据集中的技术方差。与未缩放、自动缩放或帕累托缩放的数据相比，这显著提高了样本类别之间的区分度，并导致了更高的分类准确率。此外，我们使用来自不同生物样本的三个不同数据集（使用1D NMR光谱、二维JRES光谱的1D投影和完整的二维JRES光谱）证实了glog方法的广泛适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6f1/1965488/79d418e7bd0c/1471-2105-8-234-1.jpg

相似文献

Improved classification accuracy in 1- and 2-dimensional NMR metabolomics data using the variance stabilising generalised logarithm transformation.

BMC Bioinformatics. 2007 Jul 2;8:234. doi: 10.1186/1471-2105-8-234.

Effects of the application of different window functions and projection methods on processing of 1H J-resolved nuclear magnetic resonance spectra for metabolomics.

Anal Chim Acta. 2008 Mar 3;610(1):80-8. doi: 10.1016/j.aca.2008.01.030. Epub 2008 Jan 18.

Line-shape analysis of J-resolved NMR spectra: application to metabolomics and quantification of intensity errors from signal processing and high signal congestion.

Magn Reson Chem. 2009 Dec;47 Suppl 1:S86-95. doi: 10.1002/mrc.2501.

Tackling the Peak Overlap Issue in NMR Metabolomics Studies: 1D Projected Correlation Traces from Statistical Correlation Analysis on Nontilted 2D H NMR J-Resolved Spectra.

J Proteome Res. 2019 May 3;18(5):2241-2253. doi: 10.1021/acs.jproteome.9b00093. Epub 2019 Apr 8.

Consecutive Queries to Assess Biological Correlation in NMR Metabolomics: Performance of Comprehensive Search of Multiplets over Typical 1D H NMR Database Search.

J Proteome Res. 2020 Aug 7;19(8):2977-2988. doi: 10.1021/acs.jproteome.9b00872. Epub 2020 Jun 8.

Evaluation of full-resolution J-resolved 1H NMR projections of biofluids for metabonomics information retrieval and biomarker identification.

Anal Chem. 2010 Mar 1;82(5):1811-21. doi: 10.1021/ac902443k.

Improved methods for the acquisition and interpretation of NMR metabolomic data.

Biochem Biophys Res Commun. 2003 Oct 24;310(3):943-8. doi: 10.1016/j.bbrc.2003.09.092.

J-Resolved H NMR 1D-Projections for Large-Scale Metabolic Phenotyping Studies: Application to Blood Plasma Analysis.

Anal Chem. 2017 Nov 7;89(21):11405-11412. doi: 10.1021/acs.analchem.7b02374. Epub 2017 Oct 13.

Comparing normalization methods and the impact of noise.

Metabolomics. 2018 Aug 10;14(8):108. doi: 10.1007/s11306-018-1400-6.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification

引用本文的文献

MetaboLabPy-An Open-Source Software Package for Metabolomics NMR Data Processing and Metabolic Tracer Data Analysis.

Metabolites. 2025 Jan 14;15(1):48. doi: 10.3390/metabo15010048.

Metabolomics Simultaneously Derives Benchmark Dose Estimates and Discovers Metabolic Biotransformations in a Rat Bioassay.

Chem Res Toxicol. 2024 Jun 17;37(6):923-934. doi: 10.1021/acs.chemrestox.4c00002. Epub 2024 Jun 6.

Label-Free Quantitation of Endogenous Peptides.

Methods Mol Biol. 2024;2758:125-150. doi: 10.1007/978-1-0716-3646-6_7.

Data processing solutions to render metabolomics more quantitative: case studies in food and clinical metabolomics using Metabox 2.0.

Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae005.

Pretreating and normalizing metabolomics data for statistical analysis.

Genes Dis. 2023 Jul 7;11(3):100979. doi: 10.1016/j.gendis.2023.04.018. eCollection 2024 May.

Statistical normalization methods in microbiome data with application to microbiome cancer research.

Gut Microbes. 2023 Dec;15(2):2244139. doi: 10.1080/19490976.2023.2244139.

Integrated NMR and MS Analysis of the Plasma Metabolome Reveals Major Changes in One-Carbon, Lipid, and Amino Acid Metabolism in Severe and Fatal Cases of COVID-19.

Metabolites. 2023 Jul 24;13(7):879. doi: 10.3390/metabo13070879.

Performance comparison of three scaling algorithms in NMR-based metabolomics analysis.

Open Life Sci. 2023 Mar 27;18(1):20220556. doi: 10.1515/biol-2022-0556. eCollection 2023.

GC-MS Techniques Investigating Potential Biomarkers of Dying in the Last Weeks with Lung Cancer.

Int J Mol Sci. 2023 Jan 13;24(2):1591. doi: 10.3390/ijms24021591.

Preanalytical Pitfalls in Untargeted Plasma Nuclear Magnetic Resonance Metabolomics of Endocrine Hypertension.

Metabolites. 2022 Jul 24;12(8):679. doi: 10.3390/metabo12080679.

本文引用的文献

Direct sampling of organisms from the field and knowledge of their phenotype: key recommendations for environmental metabolomics.

Environ Sci Technol. 2007 May 1;41(9):3375-81. doi: 10.1021/es062745w.

Metabolomic differentiation of Brassica rapa following herbivory by different insect instars using two-dimensional nuclear magnetic resonance spectroscopy.

J Chem Ecol. 2006 Nov;32(11):2417-28. doi: 10.1007/s10886-006-9152-6.

Metabolomic analysis of methyl jasmonate treated Brassica rapa leaves by 2-dimensional NMR spectroscopy.

Phytochemistry. 2006 Nov;67(22):2503-11. doi: 10.1016/j.phytochem.2006.08.018.

Centering, scaling, and transformations: improving the biological information content of metabolomics data.

BMC Genomics. 2006 Jun 8;7:142. doi: 10.1186/1471-2164-7-142.

Scaling and normalization effects in NMR spectroscopic metabonomic data sets.

Anal Chem. 2006 Apr 1;78(7):2262-7. doi: 10.1021/ac0519312.

A functional analysis of mouse models of cardiac disease through metabolic profiling.

J Biol Chem. 2005 Mar 4;280(9):7530-9. doi: 10.1074/jbc.M410200200. Epub 2004 Nov 16.

Discrimination models using variance-stabilizing transformation of metabolomic NMR data.

OMICS. 2004 Summer;8(2):118-30. doi: 10.1089/1536231041388348.

Metabolomics by numbers: acquiring and understanding global metabolite data.

Trends Biotechnol. 2004 May;22(5):245-52. doi: 10.1016/j.tibtech.2004.03.007.

Spectral editing and pattern recognition methods applied to high-resolution magic-angle spinning 1H nuclear magnetic resonance spectroscopy of liver tissues.

Anal Biochem. 2003 Dec 1;323(1):26-32. doi: 10.1016/j.ab.2003.07.026.

Improved methods for the acquisition and interpretation of NMR metabolomic data.

Biochem Biophys Res Commun. 2003 Oct 24;310(3):943-8. doi: 10.1016/j.bbrc.2003.09.092.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用方差稳定广义对数变换提高一维和二维核磁共振代谢组学数据的分类准确率。

Improved classification accuracy in 1- and 2-dimensional NMR metabolomics data using the variance stabilising generalised logarithm transformation.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献