文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

A tutorial review: Metabolomics and partial least squares-discriminant analysis--a marriage of convenience or a shotgun wedding.

作者信息

Gromski Piotr S, Muhamadali Howbeer, Ellis David I, Xu Yun, Correa Elon, Turner Michael L, Goodacre Royston

机构信息

School of Chemistry, Manchester Institute of Biotechnology, The University of Manchester, 131 Princess Street, Manchester M1 7DN, UK.

School of Chemistry, Brunswick Street, The University of Manchester, Manchester M13 9PL, UK.

出版信息

Anal Chim Acta. 2015 Jun 16;879:10-23. doi: 10.1016/j.aca.2015.02.012. Epub 2015 Feb 11.


DOI:10.1016/j.aca.2015.02.012
PMID:26002472
Abstract

The predominance of partial least squares-discriminant analysis (PLS-DA) used to analyze metabolomics datasets (indeed, it is the most well-known tool to perform classification and regression in metabolomics), can be said to have led to the point that not all researchers are fully aware of alternative multivariate classification algorithms. This may in part be due to the widespread availability of PLS-DA in most of the well-known statistical software packages, where its implementation is very easy if the default settings are used. In addition, one of the perceived advantages of PLS-DA is that it has the ability to analyze highly collinear and noisy data. Furthermore, the calibration model is known to provide a variety of useful statistics, such as prediction accuracy as well as scores and loadings plots. However, this method may provide misleading results, largely due to a lack of suitable statistical validation, when used by non-experts who are not aware of its potential limitations when used in conjunction with metabolomics. This tutorial review aims to provide an introductory overview to several straightforward statistical methods such as principal component-discriminant function analysis (PC-DFA), support vector machines (SVM) and random forests (RF), which could very easily be used either to augment PLS or as alternative supervised learning methods to PLS-DA. These methods can be said to be particularly appropriate for the analysis of large, highly-complex data sets which are common output(s) in metabolomics studies where the numbers of variables often far exceed the number of samples. In addition, these alternative techniques may be useful tools for generating parsimonious models through feature selection and data reduction, as well as providing more propitious results. We sincerely hope that the general reader is left with little doubt that there are several promising and readily available alternatives to PLS-DA, to analyze large and highly complex data sets.

摘要

相似文献

[1]
A tutorial review: Metabolomics and partial least squares-discriminant analysis--a marriage of convenience or a shotgun wedding.

Anal Chim Acta. 2015-6-16

[2]
A comparative evaluation of the generalised predictive ability of eight machine learning algorithms across ten clinical metabolomics data sets for binary classification.

Metabolomics. 2019-11-15

[3]
A comparative investigation of modern feature selection and classification approaches for the analysis of mass spectrometry data.

Anal Chim Acta. 2014-6-4

[4]
Classification of structurally related commercial contrast media by near infrared spectroscopy.

J Pharm Biomed Anal. 2014-3

[5]
Primal-dual for classification with rejection (PD-CR): a novel method for classification and feature selection-an application in metabolomics studies.

BMC Bioinformatics. 2021-12-15

[6]
Evaluation of Multivariate Classification Models for Analyzing NMR Metabolomics Data.

J Proteome Res. 2019-8-22

[7]
Chemometrics-assisted simultaneous voltammetric determination of ascorbic acid, uric acid, dopamine and nitrite: application of non-bilinear voltammetric data for exploiting first-order advantage.

Talanta. 2014-2

[8]
Processing and analysis of GC/LC-MS-based metabolomics data.

Methods Mol Biol. 2011

[9]
Comparative analysis of targeted metabolomics: dominance-based rough set approach versus orthogonal partial least square-discriminant analysis.

J Biomed Inform. 2015-2

[10]
Evaluation of Classifier Performance for Multiclass Phenotype Discrimination in Untargeted Metabolomics.

Metabolites. 2017-6-21

引用本文的文献

[1]
Combining clinical chemistry with metabolomics for metabolic phenotyping at population levels.

Metabolomics. 2025-8-29

[2]
Machine Learning-Driven Insights in Cancer Metabolomics: From Subtyping to Biomarker Discovery and Prognostic Modeling.

Metabolites. 2025-8-1

[3]
Integrated Analysis of Serum and Fecal Metabolites Reveals the Role of Bile Acid Metabolism in Drug-induced Liver Injury: Implications for Diagnostic and Prognostic Biomarkers.

J Clin Transl Hepatol. 2025-8-28

[4]
Integrated Spectroscopic Analysis of Wild Beers: Molecular Composition and Antioxidant Properties.

Int J Mol Sci. 2025-7-21

[5]
Unveiling unique metabolomic and transcriptomic profiles in three Brassicaceae crops.

Front Plant Sci. 2025-7-3

[6]
Chronic stress is associated with altered gut microbiota profile and relevant metabolites in adolescents.

BMC Microbiol. 2025-7-8

[7]
On Selecting Robust Approaches for Learning Predictive Biomarkers in Metabolomics Data Sets.

Anal Chem. 2025-6-24

[8]
Metabolite microextraction on surface-enhanced Raman scattering nanofibres and DO probing accelerate antibiotic susceptibility testing.

NPJ Biosens. 2025

[9]
A Data-Driven Approach to Link GC-MS and LC-MS with Sensory Attributes of Chicken Bouillon with Added Yeast-Derived Flavor Products in a Combined Prediction Model.

Metabolites. 2025-5-8

[10]
MMEASE: enhanced analytical workflow for single-cell metabolomics.

Nucleic Acids Res. 2025-7-7

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索