Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan.
Department of Neurosurgery, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.
PLoS One. 2023 Feb 15;18(2):e0281594. doi: 10.1371/journal.pone.0281594. eCollection 2023.
High-throughput omics technologies have enabled the profiling of entire biological systems. For the biological interpretation of such omics data, two analyses, hypothesis- and data-driven analyses including tensor decomposition, have been used. Both analyses have their own advantages and disadvantages and are mutually complementary; however, a direct comparison of these two analyses for omics data is poorly examined.We applied tensor decomposition (TD) to a dataset representing changes in the concentrations of 562 blood molecules at 14 time points in 20 healthy human subjects after ingestion of 75 g oral glucose. We characterized each molecule by individual dependence (constant or variable) and time dependence (later peak or early peak). Three of the four features extracted by TD were characterized by our previous hypothesis-driven study, indicating that TD can extract some of the same features obtained by hypothesis-driven analysis in a non-biased manner. In contrast to the years taken for our previous hypothesis-driven analysis, the data-driven analysis in this study took days, indicating that TD can extract biological features in a non-biased manner without the time-consuming process of hypothesis generation.
高通量组学技术已经能够对整个生物系统进行分析。为了对这些组学数据进行生物学解释,已经使用了两种分析方法,即基于假设和基于数据的分析,包括张量分解。这两种分析方法各有优缺点,相互补充;然而,对这两种分析方法在组学数据上的直接比较研究甚少。我们将张量分解(TD)应用于一组数据集,该数据集代表了 20 名健康人体受试者在摄入 75 克口服葡萄糖后 14 个时间点的 562 种血液分子浓度的变化。我们通过个体依赖性(恒定或可变)和时间依赖性(后期峰值或早期峰值)来描述每个分子。TD 提取的四个特征中的三个特征与我们之前的基于假设的研究相吻合,这表明 TD 可以以无偏的方式提取一些与基于假设的分析相同的特征。与我们之前基于假设的分析所花费的数年时间相比,本研究中的基于数据的分析仅花费了数天时间,这表明 TD 可以以无偏的方式提取生物学特征,而无需进行假设生成这一耗时的过程。