• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PathIntegrate:基于通路的多组学数据整合的多元建模方法。

PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration.

机构信息

Section of Bioinformatics, Division of Systems Medicine, Department of Metabolism, Digestion, and Reproduction, Faculty of Medicine, Imperial College London, London, United Kingdom.

Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France.

出版信息

PLoS Comput Biol. 2024 Mar 25;20(3):e1011814. doi: 10.1371/journal.pcbi.1011814. eCollection 2024 Mar.

DOI:10.1371/journal.pcbi.1011814
PMID:38527092
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10994553/
Abstract

As terabytes of multi-omics data are being generated, there is an ever-increasing need for methods facilitating the integration and interpretation of such data. Current multi-omics integration methods typically output lists, clusters, or subnetworks of molecules related to an outcome. Even with expert domain knowledge, discerning the biological processes involved is a time-consuming activity. Here we propose PathIntegrate, a method for integrating multi-omics datasets based on pathways, designed to exploit knowledge of biological systems and thus provide interpretable models for such studies. PathIntegrate employs single-sample pathway analysis to transform multi-omics datasets from the molecular to the pathway-level, and applies a predictive single-view or multi-view model to integrate the data. Model outputs include multi-omics pathways ranked by their contribution to the outcome prediction, the contribution of each omics layer, and the importance of each molecule in a pathway. Using semi-synthetic data we demonstrate the benefit of grouping molecules into pathways to detect signals in low signal-to-noise scenarios, as well as the ability of PathIntegrate to precisely identify important pathways at low effect sizes. Finally, using COPD and COVID-19 data we showcase how PathIntegrate enables convenient integration and interpretation of complex high-dimensional multi-omics datasets. PathIntegrate is available as an open-source Python package.

摘要

随着兆字节的多组学数据的产生,对于能够促进此类数据的整合和解释的方法的需求也在不断增加。目前的多组学整合方法通常输出与结果相关的分子的列表、聚类或子网络。即使具有专业的领域知识,辨别所涉及的生物过程也是一项耗时的活动。在这里,我们提出了基于途径的多组学数据集整合方法 PathIntegrate,旨在利用对生物系统的了解,从而为此类研究提供可解释的模型。PathIntegrate 采用单样本途径分析将多组学数据集从分子水平转换为途径水平,并应用预测性单视图或多视图模型来整合数据。模型输出包括按对结果预测的贡献、每个组学层的贡献以及途径中每个分子的重要性对途径进行排名的多组学途径。使用半合成数据,我们证明了将分子分组到途径中以在低信噪比情况下检测信号的益处,以及 PathIntegrate 以低效应大小精确识别重要途径的能力。最后,使用 COPD 和 COVID-19 数据,我们展示了 PathIntegrate 如何方便地整合和解释复杂的高维多组学数据集。PathIntegrate 作为一个开源的 Python 包提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/117d8e77aed9/pcbi.1011814.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/14212068a730/pcbi.1011814.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/37254d076842/pcbi.1011814.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/ad9408d809df/pcbi.1011814.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/873bb01d13b0/pcbi.1011814.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/eb022bbe6a20/pcbi.1011814.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/196a8aa6e0c3/pcbi.1011814.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/117d8e77aed9/pcbi.1011814.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/14212068a730/pcbi.1011814.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/37254d076842/pcbi.1011814.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/ad9408d809df/pcbi.1011814.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/873bb01d13b0/pcbi.1011814.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/eb022bbe6a20/pcbi.1011814.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/196a8aa6e0c3/pcbi.1011814.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f44c/10994553/117d8e77aed9/pcbi.1011814.g007.jpg

相似文献

1
PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration.PathIntegrate:基于通路的多组学数据整合的多元建模方法。
PLoS Comput Biol. 2024 Mar 25;20(3):e1011814. doi: 10.1371/journal.pcbi.1011814. eCollection 2024 Mar.
2
PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration.路径整合:基于通路的多组学数据整合的多变量建模方法。
bioRxiv. 2024 Jan 9:2024.01.09.574780. doi: 10.1101/2024.01.09.574780.
3
Amogel: a multi-omics classification framework using associative graph neural networks with prior knowledge for biomarker identification.Amogel:一种使用具有先验知识的关联图神经网络进行生物标志物识别的多组学分类框架。
BMC Bioinformatics. 2025 Mar 28;26(1):94. doi: 10.1186/s12859-025-06111-6.
4
scMFG: a single-cell multi-omics integration method based on feature grouping.scMFG:一种基于特征分组的单细胞多组学整合方法。
BMC Genomics. 2025 Feb 11;26(1):132. doi: 10.1186/s12864-025-11319-0.
5
DeepKEGG: a multi-omics data integration framework with biological insights for cancer recurrence prediction and biomarker discovery.DeepKEGG:一个具有生物学见解的多组学数据集成框架,可用于癌症复发预测和生物标志物发现。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae185.
6
A primer on correlation-based dimension reduction methods for multi-omics analysis.基于相关性的多维数据分析方法概论。
J R Soc Interface. 2023 Oct;20(207):20230344. doi: 10.1098/rsif.2023.0344. Epub 2023 Oct 11.
7
A supervised Bayesian factor model for the identification of multi-omics signatures.基于监督贝叶斯因子模型的多组学特征识别。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae202.
8
timeOmics: an R package for longitudinal multi-omics data integration.timeOmics:一个用于纵向多组学数据整合的 R 包。
Bioinformatics. 2022 Jan 3;38(2):577-579. doi: 10.1093/bioinformatics/btab664.
9
GSTRPCA: irregular tensor singular value decomposition for single-cell multi-omics data clustering.GSTRPCA:用于单细胞多组学数据聚类的不规则张量奇异值分解
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae649.
10
Interpretation of network-based integration from multi-omics longitudinal data.基于多组学纵向数据的网络整合解释。
Nucleic Acids Res. 2022 Mar 21;50(5):e27. doi: 10.1093/nar/gkab1200.

引用本文的文献

1
Pathway Analysis Interpretation in the Multi-Omic Era.多组学时代的通路分析解读
BioTech (Basel). 2025 Jul 29;14(3):58. doi: 10.3390/biotech14030058.
2
MAMSI: Integration of Multiassay Liquid Chromatography-Mass Spectrometry Metabolomics Data Using Multiview Machine Learning.MAMSI:使用多视图机器学习整合多检测液相色谱-质谱代谢组学数据
Anal Chem. 2025 Jul 22;97(28):15138-15145. doi: 10.1021/acs.analchem.5c01327. Epub 2025 Jul 10.
3
The Future of a Myriad of Accelerated Biodiscoveries Lies in AI-Powered Mass Spectrometry and Multiomics Integration.

本文引用的文献

1
Missing data in multi-omics integration: Recent advances through artificial intelligence.多组学整合中的缺失数据:通过人工智能取得的最新进展
Front Artif Intell. 2023 Feb 9;6:1098308. doi: 10.3389/frai.2023.1098308. eCollection 2023.
2
Single sample pathway analysis in metabolomics: performance evaluation and application.代谢组学中单样本通路分析:性能评估与应用。
BMC Bioinformatics. 2022 Nov 14;23(1):481. doi: 10.1186/s12859-022-05005-1.
3
A comprehensive survey of the approaches for pathway analysis using multi-omics data integration.
众多加速生物发现的未来在于人工智能驱动的质谱分析和多组学整合。
J Mass Spectrom. 2025 Aug;60(8):e5157. doi: 10.1002/jms.5157.
4
Phoenics: a novel statistical approach for longitudinal metabolomic pathway analysis.Phoenics:一种用于纵向代谢组学通路分析的新型统计方法。
BMC Bioinformatics. 2025 Apr 16;26(1):105. doi: 10.1186/s12859-025-06118-z.
5
Omics Approaches in Understanding Insecticide Resistance in Mosquito Vectors.组学方法在理解蚊媒抗药性中的应用
Int J Mol Sci. 2025 Feb 21;26(5):1854. doi: 10.3390/ijms26051854.
6
Pathway level metabolomics analysis identifies carbon metabolism as a key factor of incident hypertension in the Estonian Biobank.通路水平代谢组学分析表明,碳代谢是爱沙尼亚生物银行中发生高血压的关键因素。
Sci Rep. 2025 Mar 12;15(1):8470. doi: 10.1038/s41598-025-92840-w.
7
PathX-CNN: An Enhanced Explainable Convolutional Neural Network for Survival Prediction and Pathway Analysis in Glioblastoma.PathX-CNN:一种用于胶质母细胞瘤生存预测和通路分析的增强型可解释卷积神经网络
bioRxiv. 2025 Jan 27:2025.01.24.634827. doi: 10.1101/2025.01.24.634827.
8
Deciphering the molecular heterogeneity of intermediate- and (very-)high-risk non-muscle-invasive bladder cancer using multi-layered studies.运用多层次研究解析中高危和(极)高危非肌层浸润性膀胱癌的分子异质性
Front Oncol. 2024 Oct 21;14:1424293. doi: 10.3389/fonc.2024.1424293. eCollection 2024.
9
Synthetic data generation methods in healthcare: A review on open-source tools and methods.医疗保健领域的合成数据生成方法:关于开源工具和方法的综述
Comput Struct Biotechnol J. 2024 Jul 9;23:2892-2910. doi: 10.1016/j.csbj.2024.07.005. eCollection 2024 Dec.
多组学数据整合的通路分析方法的全面综述。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac435.
4
Integrated Machine Learning and Single-Sample Gene Set Enrichment Analysis Identifies a TGF-Beta Signaling Pathway Derived Score in Headneck Squamous Cell Carcinoma.整合机器学习与单样本基因集富集分析鉴定头颈部鳞状细胞癌中源自转化生长因子-β信号通路的评分
J Oncol. 2022 Sep 1;2022:3140263. doi: 10.1155/2022/3140263. eCollection 2022.
5
OmicsNet 2.0: a web-based platform for multi-omics integration and network visual analytics.OmicsNet 2.0:一个基于网络的多组学整合和网络可视化分析平台。
Nucleic Acids Res. 2022 Jul 5;50(W1):W527-W533. doi: 10.1093/nar/gkac376.
6
PaintOmics 4: new tools for the integrative analysis of multi-omics datasets supported by multiple pathway databases.PaintOmics 4:新工具支持多种途径数据库,可用于多组学数据集的综合分析。
Nucleic Acids Res. 2022 Jul 5;50(W1):W551-W559. doi: 10.1093/nar/gkac352.
7
On the influence of several factors on pathway enrichment analysis.几种因素对通路富集分析的影响。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac143.
8
Unsupervised Multi-Omics Data Integration Methods: A Comprehensive Review.无监督多组学数据整合方法:全面综述
Front Genet. 2022 Mar 22;13:854752. doi: 10.3389/fgene.2022.854752. eCollection 2022.
9
Programmed cell death: the pathways to severe COVID-19?程序性细胞死亡:引发重症 COVID-19 的途径?
Biochem J. 2022 Mar 18;479(5):609-628. doi: 10.1042/BCJ20210602.
10
PathwayMultiomics: An R Package for Efficient Integrative Analysis of Multi-Omics Datasets With Matched or Un-matched Samples.PathwayMultiomics:一个用于对具有匹配或不匹配样本的多组学数据集进行高效综合分析的R包。
Front Genet. 2021 Dec 22;12:783713. doi: 10.3389/fgene.2021.783713. eCollection 2021.