Yamamoto Hiroyuki, Nakayama Yasumune, Tsugawa Hiroshi
Human Metabolome Technologies, Inc., 246-2 Mizukami, Kakuganji, Tsuruoka, Yamagata 997-0052, Japan.
Department of Applied Microbial Technology, Sojo University, 4-22-1 Ikeda, Kumamoto 860-0082, Japan.
Metabolites. 2021 Mar 5;11(3):149. doi: 10.3390/metabo11030149.
Principal component analysis (PCA) has been widely used in metabolomics. However, it is not always possible to detect phenotype-associated principal component (PC) scores. Previously, we proposed a smoothed PCA for samples acquired with a time course or rank order, but hypothesis testing to select significant metabolite candidates was not possible. Here, we modified the smoothed PCA as an orthogonal smoothed PCA (OS-PCA) so that statistical hypothesis testing in OS-PC loadings could be performed with the same PC projections provided by the smoothed PCA. Statistical hypothesis testing is especially useful in metabolomics because biological interpretations are made based on statistically significant metabolites. We applied the OS-PCA method to two real metabolome datasets, one for metabolic turnover analysis and the other for evaluating the taste of Japanese green tea. The OS-PCA successfully extracted similar PC scores as the smoothed PCA; these scores reflected the expected phenotypes. The significant metabolites that were selected using statistical hypothesis testing of OS-PC loading facilitated biological interpretations that were consistent with the results of our previous study. Our results suggest that OS-PCA combined with statistical hypothesis testing of OS-PC loading is a useful method for the analysis of metabolome data.
主成分分析(PCA)已在代谢组学中广泛应用。然而,检测与表型相关的主成分(PC)得分并非总是可行的。此前,我们针对按时间进程或秩次获取的样本提出了一种平滑主成分分析方法,但无法进行假设检验以选择显著的代谢物候选物。在此,我们将平滑主成分分析修改为正交平滑主成分分析(OS - PCA),以便能在与平滑主成分分析提供的相同PC投影下,对OS - PC载荷进行统计假设检验。统计假设检验在代谢组学中特别有用,因为生物学解释是基于具有统计学显著性的代谢物得出的。我们将OS - PCA方法应用于两个真实的代谢组数据集,一个用于代谢周转分析,另一个用于评估日本绿茶的口感。OS - PCA成功提取了与平滑主成分分析相似的PC得分;这些得分反映了预期的表型。通过对OS - PC载荷进行统计假设检验所选择的显著代谢物有助于得出与我们先前研究结果一致的生物学解释。我们的结果表明,OS - PCA结合对OS - PC载荷的统计假设检验是一种用于分析代谢组数据的有用方法。