Suppr超能文献

结合多元统计工具提高代谢组学功能分析的洞察力。

Improving insights from metabolomic functional analysis combining multivariate tools.

机构信息

Neonatal Research Group, Health Research Institute La Fe (IISLAFE), Avda Fernando Abril Martorell 106, 46026, Valencia, Spain; Primary Care Interventions to Prevent Maternal and Child Chronic Diseases of Perinatal and Developmental Origin Network (RICORS-SAMID), Instituto de Salud Carlos III, Madrid, Spain; Servicio de Análisis de Vesículas Extracelulares (SAVE), Health Research Institute La Fe (IISLAFE), Avda Fernando Abril Martorell 106, 46026, Valencia, Spain.

Departamento de Bioquímica y Biología Molecular, Universidad de Valencia, Valencia, Spain; Unit for Experimental Hepatology, Health Research Institute La Fe (IISLAFE), Valencia, Spain; Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Madrid, Spain.

出版信息

Anal Chim Acta. 2024 Sep 22;1323:343062. doi: 10.1016/j.aca.2024.343062. Epub 2024 Aug 5.

Abstract

BACKGROUND

Metabolomics is a scientific field that relies on the comprehensive analysis of metabolites to provide direct insights into functional processes in biological systems. Metabolomic data provides valuable insights into the functional processes of biological systems, often analyzed through univariate and multivariate approaches, and well as with functional or pathway analysis using different methods such as mummichog. Yet, the integration of results from these sources to aid the interpretation of their biological significance remains challenging. This represents a significant bottleneck limiting the applicability of multivariate analysis of metabolomic data, despite its potential for providing deep biological insights.

RESULTS

In this work we propose two straightforward methods to facilitate the interpretation of results from multivariate analysis and functional metabolic analysis using: i) p-values from multivariate tests as input in functional analysis, and ii) cluster-CV to assess the impact on the predictive performance of a multivariate model at the pathway level. Four simulated data sets were analyzed including a data set with no class separation, and three data sets with a statistically significant discrimination between classes by including either univariate, multivariate, or both types of discriminant effects. The data sets were analyzed using univariate tests and OPLS-DA. Furthermore, p-values for each feature estimated by univariate analysis and OPLS-DA were used as input for functional analysis in mummichog. Cluster-CV was then used to assess the effect of detected metabolic pathways on the class separation observed by OPLS-DA.

SIGNIFICANCE

Through simulated data, we show how these approaches enhance the interpretation of biological effects driving multivariate models and support the identification of altered pathways not detected by univariate analysis. By providing a deeper understanding of metabolic phenotypes, these methods might improve the biological insights derived from statistical and functional analysis of future or previous studies.

摘要

背景

代谢组学是一个依赖于对代谢物的全面分析来提供对生物系统功能过程的直接洞察的科学领域。代谢组学数据为生物系统的功能过程提供了有价值的见解,通常通过单变量和多变量方法进行分析,并且使用不同的方法(如 mummichog)进行功能或途径分析。然而,整合这些来源的结果以帮助解释其生物学意义仍然具有挑战性。这是一个重大的瓶颈,限制了代谢组学数据的多变量分析的适用性,尽管它有提供深入的生物学见解的潜力。

结果

在这项工作中,我们提出了两种简单的方法来促进多变量分析和功能代谢分析结果的解释:i)将多变量检验的 p 值作为功能分析的输入,ii)使用聚类-CV 评估多变量模型在途径水平上对预测性能的影响。分析了四个模拟数据集,包括一个没有类分离的数据集,以及三个通过包含单变量、多变量或这两种判别效应来在类之间进行统计上显著区分的数据集。使用单变量检验和 OPLS-DA 分析数据集。此外,还使用单变量分析和 OPLS-DA 估计的每个特征的 p 值作为 mummichog 中功能分析的输入。然后使用聚类-CV 评估检测到的代谢途径对 OPLS-DA 观察到的类分离的影响。

意义

通过模拟数据,我们展示了这些方法如何增强对驱动多变量模型的生物学效应的解释,并支持识别未通过单变量分析检测到的改变途径。通过提供对代谢表型的更深入了解,这些方法可能会改善从未来或以前研究的统计和功能分析中得出的生物学见解。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验