School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, India.
Department of Physics, Chuo University, Tokyo, 112-8551, Japan.
Sci Rep. 2021 Apr 26;11(1):8909. doi: 10.1038/s41598-021-87779-7.
Although hypoxia is a critical factor that can drive the progression of various diseases, the mechanism underlying hypoxia itself remains unclear. Recently, m6A has been proposed as an important factor driving hypoxia. Despite successful analyses, potential genes were not selected with statistical significance but were selected based solely on fold changes. Because the number of genes is large while the number of samples is small, it was impossible to select genes using conventional feature selection methods with statistical significance. In this study, we applied the recently proposed principal component analysis (PCA), tensor decomposition (TD), and kernel tensor decomposition (KTD)-based unsupervised feature extraction (FE) to a hypoxia data set. We found that PCA, TD, and KTD-based unsupervised FE could successfully identify a limited number of genes associated with altered gene expression and m6A profiles, as well as the enrichment of hypoxia-related biological terms, with improved statistical significance.
尽管缺氧是驱动各种疾病进展的关键因素,但缺氧本身的机制尚不清楚。最近,m6A 被认为是驱动缺氧的一个重要因素。尽管分析取得了成功,但潜在的基因没有以统计学意义选择,而是仅根据倍数变化选择。由于基因数量大而样本数量小,因此不可能使用具有统计学意义的常规特征选择方法选择基因。在这项研究中,我们将最近提出的主成分分析 (PCA)、张量分解 (TD) 和基于核张量分解 (KTD) 的无监督特征提取 (FE) 应用于缺氧数据集。我们发现,基于 PCA、TD 和 KTD 的无监督 FE 可以成功识别与改变基因表达和 m6A 图谱相关的少数基因,以及与缺氧相关的生物学术语的富集,并且具有提高的统计学意义。