PathwayMultiomics：一个用于对具有匹配或不匹配样本的多组学数据集进行高效综合分析的R包。

PathwayMultiomics: An R Package for Efficient Integrative Analysis of Multi-Omics Datasets With Matched or Un-matched Samples.

作者信息

Odom Gabriel J, Colaprico Antonio, Silva Tiago C, Chen X Steven, Wang Lily

机构信息

Department of Biostatistics, Stempel College of Public Health, Florida International University, Miami, FL, United States.

Department of Public Health Sciences, Miller School of Medicine, University of Miami, Miami, FL, United States.

出版信息

Front Genet. 2021 Dec 22;12:783713. doi: 10.3389/fgene.2021.783713. eCollection 2021.

DOI:10.3389/fgene.2021.783713

PMID:35003218

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8729182/

Abstract

Recent advances in technology have made multi-omics datasets increasingly available to researchers. To leverage the wealth of information in multi-omics data, a number of integrative analysis strategies have been proposed recently. However, effectively extracting biological insights from these large, complex datasets remains challenging. In particular, matched samples with multiple types of omics data measured on each sample are often required for multi-omics analysis tools, which can significantly reduce the sample size. Another challenge is that analysis techniques such as dimension reductions, which extract association signals in high dimensional datasets by estimating a few variables that explain most of the variations in the samples, are typically applied to whole-genome data, which can be computationally demanding. Here we present pathwayMultiomics, a pathway-based approach for integrative analysis of multi-omics data with categorical, continuous, or survival outcome variables. The input of pathwayMultiomics is pathway values for individual omics data types, which are then integrated using a novel statistic, the MiniMax statistic, to prioritize pathways dysregulated in multiple types of omics datasets. Importantly, pathwayMultiomics is computationally efficient and does not require matched samples in multi-omics data. We performed a comprehensive simulation study to show that pathwayMultiomics significantly outperformed currently available multi-omics tools with improved power and well-controlled false-positive rates. In addition, we also analyzed real multi-omics datasets to show that pathwayMultiomics was able to recover known biology by nominating biologically meaningful pathways in complex diseases such as Alzheimer's disease.

摘要

技术的最新进展使多组学数据集越来越多地可供研究人员使用。为了利用多组学数据中的丰富信息，最近提出了一些综合分析策略。然而，从这些庞大而复杂的数据集中有效提取生物学见解仍然具有挑战性。特别是，多组学分析工具通常需要对每个样本测量多种类型组学数据的匹配样本，这可能会显著减少样本量。另一个挑战是，诸如降维之类的分析技术，通过估计少数几个解释样本中大部分变异的变量来提取高维数据集中的关联信号，通常应用于全基因组数据，这在计算上要求很高。在这里，我们介绍pathwayMultiomics，这是一种基于通路的方法，用于对具有分类、连续或生存结果变量的多组学数据进行综合分析。pathwayMultiomics的输入是各个组学数据类型的通路值，然后使用一种新颖的统计量——极小极大统计量进行整合，以对在多种组学数据集中失调的通路进行优先级排序。重要的是，pathwayMultiomics在计算上效率很高，并且不需要多组学数据中的匹配样本。我们进行了一项全面的模拟研究，以表明pathwayMultiomics在提高功效和良好控制假阳性率方面显著优于目前可用的多组学工具。此外，我们还分析了真实的多组学数据集，以表明pathwayMultiomics能够通过在阿尔茨海默病等复杂疾病中提名具有生物学意义的通路来恢复已知的生物学信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b78b/8729182/4d088e7f4be2/fgene-12-783713-g001.jpg

相似文献

PathwayMultiomics: An R Package for Efficient Integrative Analysis of Multi-Omics Datasets With Matched or Un-matched Samples.

Front Genet. 2021 Dec 22;12:783713. doi: 10.3389/fgene.2021.783713. eCollection 2021.

Clustering and variable selection evaluation of 13 unsupervised methods for multi-omics data integration.

Brief Bioinform. 2020 Dec 1;21(6):2011-2030. doi: 10.1093/bib/bbz138.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification

An integrative imputation method based on multi-omics datasets.

BMC Bioinformatics. 2016 Jun 21;17:247. doi: 10.1186/s12859-016-1122-6.

A Review of Integrative Imputation for Multi-Omics Datasets.

Front Genet. 2020 Oct 15;11:570255. doi: 10.3389/fgene.2020.570255. eCollection 2020.

A network embedding based method for partial multi-omics integration in cancer subtyping.

Methods. 2021 Aug;192:67-76. doi: 10.1016/j.ymeth.2020.08.001. Epub 2020 Aug 14.

Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification.

BMC Genomics. 2015 Dec 1;16:1022. doi: 10.1186/s12864-015-2223-8.

Integrative Biology Approaches Applied to Human Diseases

mosGraphGen: a novel tool to generate multi-omics signaling graphs to facilitate integrative and interpretable graph AI model development.

bioRxiv. 2024 Aug 27:2024.05.15.594360. doi: 10.1101/2024.05.15.594360.

PathwayPCA: an R/Bioconductor Package for Pathway Based Integrative Analysis of Multi-Omics Data.

Proteomics. 2020 Nov;20(21-22):e1900409. doi: 10.1002/pmic.201900409. Epub 2020 Jul 2.

引用本文的文献

Brain high-throughput multi-omics data reveal molecular heterogeneity in Alzheimer's disease.

PLoS Biol. 2024 Apr 30;22(4):e3002607. doi: 10.1371/journal.pbio.3002607. eCollection 2024 Apr.

PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration.

PLoS Comput Biol. 2024 Mar 25;20(3):e1011814. doi: 10.1371/journal.pcbi.1011814. eCollection 2024 Mar.

Multi-omics in stress and health research: study designs that will drive the field forward.

Stress. 2024 Jan;27(1):2321610. doi: 10.1080/10253890.2024.2321610. Epub 2024 Feb 29.

PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration.

bioRxiv. 2024 Jan 9:2024.01.09.574780. doi: 10.1101/2024.01.09.574780.

The promise of multi-omics approaches to discover biological alterations with clinical relevance in Alzheimer's disease.

Front Aging Neurosci. 2022 Dec 7;14:1065904. doi: 10.3389/fnagi.2022.1065904. eCollection 2022.

A comprehensive survey of the approaches for pathway analysis using multi-omics data integration.

Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac435.

Cross-tissue analysis of blood and brain epigenome-wide association studies in Alzheimer's disease.

Nat Commun. 2022 Aug 18;13(1):4852. doi: 10.1038/s41467-022-32475-x.

Applications of Omics Technology for Livestock Selection and Improvement.

Front Genet. 2022 Jun 2;13:774113. doi: 10.3389/fgene.2022.774113. eCollection 2022.

本文引用的文献

Sex-specific DNA methylation differences in Alzheimer's disease pathology.

Acta Neuropathol Commun. 2021 Apr 26;9(1):77. doi: 10.1186/s40478-021-01177-8.

Epigenome-wide meta-analysis of DNA methylation differences in prefrontal cortex implicates the immune processes in Alzheimer's disease.

Nat Commun. 2020 Nov 30;11(1):6114. doi: 10.1038/s41467-020-19791-w.

mitch: multi-contrast pathway enrichment for multi-omics and single-cell profiling data.

BMC Genomics. 2020 Jun 29;21(1):447. doi: 10.1186/s12864-020-06856-9.

PathwayPCA: an R/Bioconductor Package for Pathway Based Integrative Analysis of Multi-Omics Data.

Proteomics. 2020 Nov;20(21-22):e1900409. doi: 10.1002/pmic.201900409. Epub 2020 Jul 2.

Histone Deacetylases Inhibitors in Neurodegenerative Diseases, Neuroprotection and Neuronal Differentiation.

Front Pharmacol. 2020 Apr 24;11:537. doi: 10.3389/fphar.2020.00537. eCollection 2020.

The Role of Chemokines in Alzheimer's Disease.

Endocr Metab Immune Disord Drug Targets. 2020;20(9):1383-1390. doi: 10.2174/1871530320666200131110744.

Clonally expanded CD8 T cells patrol the cerebrospinal fluid in Alzheimer's disease.

Nature. 2020 Jan;577(7790):399-404. doi: 10.1038/s41586-019-1895-7. Epub 2020 Jan 8.

Insights into Impact of DNA Copy Number Alteration and Methylation on the Proteogenomic Landscape of Human Ovarian Cancer via a Multi-omics Integrative Analysis.

Mol Cell Proteomics. 2019 Aug 9;18(8 suppl 1):S52-S65. doi: 10.1074/mcp.RA118.001220. Epub 2019 Jun 21.

Genetic meta-analysis of diagnosed Alzheimer's disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing.

Nat Genet. 2019 Mar;51(3):414-430. doi: 10.1038/s41588-019-0358-2. Epub 2019 Feb 28.

A focus on CXCR4 in Alzheimer's disease.

Brain Circ. 2017 Oct-Dec;3(4):199-203. doi: 10.4103/bc.bc_13_17. Epub 2017 Dec 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PathwayMultiomics：一个用于对具有匹配或不匹配样本的多组学数据集进行高效综合分析的R包。

PathwayMultiomics: An R Package for Efficient Integrative Analysis of Multi-Omics Datasets With Matched or Un-matched Samples.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献