School of Information Science and Engineering, Central South University, Changsha 410083, China.
School of Information Science and Engineering, Qufu Normal University, Rizhao 276826, China.
Molecules. 2017 Dec 2;22(12):2131. doi: 10.3390/molecules22122131.
Detecting genomes with similar expression patterns using clustering techniques plays an important role in gene expression data analysis. Non-negative matrix factorization (NMF) is an effective method for clustering the analysis of gene expression data. However, the NMF-based method is performed within the Euclidean space, and it is usually inappropriate for revealing the intrinsic geometric structure of data space. In order to overcome this shortcoming, Cai et al. proposed a novel algorithm, called graph regularized non-negative matrices factorization (GNMF). Motivated by the topological structure of the GNMF-based method, we propose improved graph regularized non-negative matrix factorization (GNMF) to facilitate the display of geometric structure of data space. Robust manifold non-negative matrix factorization (RM-GNMF) is designed for cancer gene clustering, leading to an enhancement of the GNMF-based algorithm in terms of robustness. We combine the l 2 , 1 -norm NMF with spectral clustering to conduct the wide-ranging experiments on the three known datasets. Clustering results indicate that the proposed method outperforms the previous methods, which displays the latest application of the RM-GNMF-based method in cancer gene clustering.
使用聚类技术检测具有相似表达模式的基因组在基因表达数据分析中起着重要作用。非负矩阵分解 (NMF) 是分析基因表达数据的聚类的有效方法。然而,基于 NMF 的方法是在欧几里得空间中执行的,通常不适合揭示数据空间的内在几何结构。为了克服这一缺点,Cai 等人提出了一种新算法,称为图正则化非负矩阵分解 (GNMF)。受基于 GNMF 的方法的拓扑结构的启发,我们提出了改进的图正则化非负矩阵分解 (GNMF),以促进数据空间的几何结构的显示。稳健流形非负矩阵分解 (RM-GNMF) 被设计用于癌症基因聚类,从而提高了基于 GNMF 的算法的稳健性。我们将 l 2 , 1 -范数 NMF 与谱聚类相结合,在三个已知数据集上进行了广泛的实验。聚类结果表明,所提出的方法优于以前的方法,这展示了 RM-GNMF 方法在癌症基因聚类中的最新应用。