利用现代张量分解进行纵向 'omics 数据的降维。

Dimensionality reduction of longitudinal 'omics data using modern tensor factorizations.

机构信息

Systems Immunology Department, Weizmann Institute of Science, Rehovot, Israel.

School of Mathematical Sciences, Tel Aviv University, Tel Aviv, Israel.

出版信息

PLoS Comput Biol. 2022 Jul 15;18(7):e1010212. doi: 10.1371/journal.pcbi.1010212. eCollection 2022 Jul.

DOI:10.1371/journal.pcbi.1010212

PMID:35839259

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9328521/

Abstract

Longitudinal 'omics analytical methods are extensively used in the evolving field of precision medicine, by enabling 'big data' recording and high-resolution interpretation of complex datasets, driven by individual variations in response to perturbations such as disease pathogenesis, medical treatment or changes in lifestyle. However, inherent technical limitations in biomedical studies often result in the generation of feature-rich and sample-limited datasets. Analyzing such data using conventional modalities often proves to be challenging since the repeated, high-dimensional measurements overload the outlook with inconsequential variations that must be filtered from the data in order to find the true, biologically relevant signal. Tensor methods for the analysis and meaningful representation of multiway data may prove useful to the biological research community by their advertised ability to tackle this challenge. In this study, we present tcam-a new unsupervised tensor factorization method for the analysis of multiway data. Building on top of cutting-edge developments in the field of tensor-tensor algebra, we characterize the unique mathematical properties of our method, namely, 1) preservation of geometric and statistical traits of the data, which enable uncovering information beyond the inter-individual variation that often takes over the focus, especially in human studies. 2) Natural and straightforward out-of-sample extension, making tcam amenable for integration in machine learning workflows. A series of re-analyses of real-world, human experimental datasets showcase these theoretical properties, while providing empirical confirmation of tcam's utility in the analysis of longitudinal 'omics data.

摘要

纵向 'omics 分析方法在不断发展的精准医学领域得到了广泛应用，通过记录 '大数据' 和对个体对疾病发病机制、医学治疗或生活方式改变等干扰的反应的复杂数据集进行高分辨率解释，实现了这一目标。然而，生物医学研究中的固有技术限制通常导致生成富含特征但样本有限的数据集。使用传统模式分析此类数据通常具有挑战性，因为重复的高维测量会使结果过载，出现无关的变化，必须从数据中过滤这些变化，才能找到真正的、具有生物学相关性的信号。张量方法用于分析和表示多向数据，通过其宣称的能力来应对这一挑战，可能对生物研究界有用。在这项研究中，我们提出了 tcam-一种用于分析多向数据的新的无监督张量分解方法。基于张量张量代数领域的最新发展，我们描述了我们方法的独特数学特性，即 1）保留数据的几何和统计特征，这使得能够揭示超越个体变异的信息，个体变异通常占据了焦点，特别是在人类研究中。2）自然而直接的样本外扩展，使 tcam 适用于机器学习工作流程的集成。对真实的、人类实验数据集的一系列重新分析展示了这些理论特性，同时证实了 tcam 在分析纵向 'omics 数据方面的实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/448f/9328521/e583dbe3f1ee/pcbi.1010212.g001.jpg

相似文献

Dimensionality reduction of longitudinal 'omics data using modern tensor factorizations.利用现代张量分解进行纵向 'omics 数据的降维。

PLoS Comput Biol. 2022 Jul 15;18(7):e1010212. doi: 10.1371/journal.pcbi.1010212. eCollection 2022 Jul.

Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction.射血分数保留的心力衰竭精准医学中的张量分解

J Cardiovasc Transl Res. 2017 Jun;10(3):305-312. doi: 10.1007/s12265-016-9727-8. Epub 2017 Jan 23.

Tensor factorization toward precision medicine.面向精准医学的张量分解

Brief Bioinform. 2017 May 1;18(3):511-514. doi: 10.1093/bib/bbw026.

Revisit of Machine Learning Supported Biological and Biomedical Studies.机器学习支持的生物学和生物医学研究回顾

Methods Mol Biol. 2018;1754:183-204. doi: 10.1007/978-1-4939-7717-8_11.

Explainable biology for improved therapies in precision medicine: AI is not enough.精准医学中用于改进治疗方法的可解释生物学：仅靠人工智能是不够的。

Best Pract Res Clin Rheumatol. 2024 Dec;38(4):102006. doi: 10.1016/j.berh.2024.102006. Epub 2024 Sep 26.

Detecting time-evolving phenotypic topics via tensor factorization on electronic health records: Cardiovascular disease case study.基于电子健康记录的张量分解检测时变表型主题：心血管疾病案例研究。

J Biomed Inform. 2019 Oct;98:103270. doi: 10.1016/j.jbi.2019.103270. Epub 2019 Aug 22.

Multi-omics data integration approaches for precision oncology.多组学数据整合方法在精准肿瘤学中的应用。

Mol Omics. 2022 Jul 11;18(6):469-479. doi: 10.1039/d1mo00411e.

Integrate multi-omics data with biological interaction networks using Multi-view Factorization AutoEncoder (MAE).使用多视图因子分解自动编码器（MAE）将多组学数据与生物相互作用网络集成。

BMC Genomics. 2019 Dec 20;20(Suppl 11):944. doi: 10.1186/s12864-019-6285-x.

Integrative Analysis of Omics Big Data.组学大数据的综合分析

Methods Mol Biol. 2018;1754:109-135. doi: 10.1007/978-1-4939-7717-8_7.

Tensor-tensor algebra for optimal representation and compression of multiway data.张量张量代数在多向数据的最优表示和压缩中的应用。

Proc Natl Acad Sci U S A. 2021 Jul 13;118(28). doi: 10.1073/pnas.2015851118.

引用本文的文献

LorDist: a novel method for calculating the distance based on functional data analysis with application to longitudinal microbial data.LorDist：一种基于功能数据分析计算距离的新方法及其在纵向微生物数据中的应用

Microbiol Spectr. 2025 Aug 5;13(8):e0154225. doi: 10.1128/spectrum.01542-25. Epub 2025 Jul 11.

Targeting CD38 immunometabolic checkpoint improves metabolic fitness and cognition in a mouse model of Alzheimer's disease.靶向CD38免疫代谢检查点可改善阿尔茨海默病小鼠模型的代谢适应性和认知能力。

Nat Commun. 2025 Apr 20;16(1):3736. doi: 10.1038/s41467-025-58494-y.

TEMPTED: time-informed dimensionality reduction for longitudinal microbiome studies.TEMPTED：用于纵向微生物组研究的时间信息降维方法

Genome Biol. 2024 Dec 19;25(1):317. doi: 10.1186/s13059-024-03453-x.

MeTEor: an R Shiny app for exploring longitudinal metabolomics data.MeTEor：一款用于探索纵向代谢组学数据的R Shiny应用程序。

Bioinform Adv. 2024 Nov 14;4(1):vbae178. doi: 10.1093/bioadv/vbae178. eCollection 2024.

Bioinformatics approaches for studying molecular sex differences in complex diseases.生物信息学方法研究复杂疾病中的分子性别差异。

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae499.

Recent advances in precision nutrition and cardiometabolic diseases.精准营养与心血管代谢疾病的最新进展

Rev Esp Cardiol (Engl Ed). 2025 Mar;78(3):263-271. doi: 10.1016/j.rec.2024.09.003. Epub 2024 Sep 30.

The structure is the message: Preserving experimental context through tensor decomposition.结构即信息：通过张量分解保存实验背景。

Cell Syst. 2024 Aug 21;15(8):679-693. doi: 10.1016/j.cels.2024.07.004.

Maternal antibiotic prophylaxis during cesarean section has a limited impact on the infant gut microbiome.剖宫产术中的母亲抗生素预防对婴儿肠道微生物组的影响有限。

Cell Host Microbe. 2024 Aug 14;32(8):1444-1454.e6. doi: 10.1016/j.chom.2024.07.010.

Longitudinal single-cell data informs deterministic modelling of inflammatory bowel disease.纵向单细胞数据为炎症性肠病的确定性建模提供信息。

NPJ Syst Biol Appl. 2024 Jun 24;10(1):69. doi: 10.1038/s41540-024-00395-9.

Stable tensor neural networks for efficient deep learning.用于高效深度学习的稳定张量神经网络。

Front Big Data. 2024 May 30;7:1363978. doi: 10.3389/fdata.2024.1363978. eCollection 2024.

本文引用的文献

Machine learning in clinical decision making.机器学习在临床决策中的应用。

Med. 2021 Jun 11;2(6):642-665. doi: 10.1016/j.medj.2021.04.006. Epub 2021 Apr 30.

Low-Rank High-Order Tensor Completion With Applications in Visual Data.低秩高阶张量补全及其在视觉数据中的应用

IEEE Trans Image Process. 2022;31:2433-2448. doi: 10.1109/TIP.2022.3155949. Epub 2022 Mar 15.

Tensor-tensor algebra for optimal representation and compression of multiway data.张量张量代数在多向数据的最优表示和压缩中的应用。

Proc Natl Acad Sci U S A. 2021 Jul 13;118(28). doi: 10.1073/pnas.2015851118.

Evaluating microbiome-directed fibre snacks in gnotobiotic mice and humans.评估肠道菌群定向纤维零食在无菌小鼠和人类中的作用。

Nature. 2021 Jul;595(7865):91-95. doi: 10.1038/s41586-021-03671-4. Epub 2021 Jun 23.

Deep longitudinal multiomics profiling reveals two biological seasonal patterns in California.深度纵向多组学分析揭示了加利福尼亚州的两种生物季节性模式。

Nat Commun. 2020 Oct 1;11(1):4933. doi: 10.1038/s41467-020-18758-1.

Context-aware dimensionality reduction deconvolutes gut microbial community dynamics.上下文感知降维可剖析肠道微生物群落动态。

Nat Biotechnol. 2021 Feb;39(2):165-168. doi: 10.1038/s41587-020-0660-7. Epub 2020 Aug 31.

Avocado: a multi-scale deep tensor factorization method learns a latent representation of the human epigenome.鳄梨：一种多尺度深度张量分解方法，可学习人类表观基因组的潜在表示。

Genome Biol. 2020 Mar 30;21(1):81. doi: 10.1186/s13059-020-01977-6.

Precision Microbiome Modulation with Discrete Dietary Fiber Structures Directs Short-Chain Fatty Acid Production.精准的微生物组调节与离散膳食纤维结构直接影响短链脂肪酸的产生。

Cell Host Microbe. 2020 Mar 11;27(3):389-404.e6. doi: 10.1016/j.chom.2020.01.006. Epub 2020 Jan 30.

A longitudinal big data approach for precision health.纵向大数据方法用于精准健康。

Nat Med. 2019 May;25(5):792-804. doi: 10.1038/s41591-019-0414-6. Epub 2019 May 8.

A Novel Sparse Compositional Technique Reveals Microbial Perturbations.一种新型稀疏合成技术揭示了微生物扰动。

mSystems. 2019 Feb 12;4(1). doi: 10.1128/mSystems.00016-19. eCollection 2019 Jan-Feb.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用现代张量分解进行纵向 'omics 数据的降维。

Dimensionality reduction of longitudinal 'omics data using modern tensor factorizations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献