基于相依印度自助餐过程的双非参数稀疏非负矩阵分解。

Doubly Nonparametric Sparse Nonnegative Matrix Factorization Based on Dependent Indian Buffet Processes.

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1835-1849. doi: 10.1109/TNNLS.2017.2676817. Epub 2017 Apr 11.

DOI:10.1109/TNNLS.2017.2676817

Abstract

Sparse nonnegative matrix factorization (SNMF) aims to factorize a data matrix into two optimized nonnegative sparse factor matrices, which could benefit many tasks, such as document-word co-clustering. However, the traditional SNMF typically assumes the number of latent factors (i.e., dimensionality of the factor matrices) to be fixed. This assumption makes it inflexible in practice. In this paper, we propose a doubly sparse nonparametric NMF framework to mitigate this issue by using dependent Indian buffet processes (dIBP). We apply a correlation function for the generation of two stick weights associated with each column pair of factor matrices while still maintaining their respective marginal distribution specified by IBP. As a consequence, the generation of two factor matrices will be columnwise correlated. Under this framework, two classes of correlation function are proposed: 1) using bivariate Beta distribution and 2) using Copula function. Compared with the single IBP-based NMF, this paper jointly makes two factor matrices nonparametric and sparse, which could be applied to broader scenarios, such as co-clustering. This paper is seen to be much more flexible than Gaussian process-based and hierarchial Beta process-based dIBPs in terms of allowing the two corresponding binary matrix columns to have greater variations in their nonzero entries. Our experiments on synthetic data show the merits of this paper compared with the state-of-the-art models in respect of factorization efficiency, sparsity, and flexibility. Experiments on real-world data sets demonstrate the efficiency of this paper in document-word co-clustering tasks.

摘要

稀疏非负矩阵分解（SNMF）旨在将数据矩阵分解为两个优化的非负稀疏因子矩阵，这有助于许多任务，如文档-词协同聚类。然而，传统的 SNMF 通常假设潜在因子的数量（即因子矩阵的维度）是固定的。这种假设在实际应用中缺乏灵活性。在本文中，我们提出了一种双重稀疏非参数 NMF 框架，通过使用相关的印度自助餐过程（dIBP）来缓解这个问题。我们应用相关函数来生成与因子矩阵每列对相关的两个棍状权重，同时仍然保持 IBP 所指定的各自的边缘分布。因此，两个因子矩阵的生成将是列相关的。在这个框架下，我们提出了两类相关函数：1）使用双变量 Beta 分布和 2）使用 Copula 函数。与基于单 IBP 的 NMF 相比，本文联合使两个因子矩阵具有非参数性和稀疏性，这可以应用于更广泛的场景，如协同聚类。与基于高斯过程和层次 Beta 过程的 dIBP 相比，本文在允许两个对应的二元矩阵列在非零项上有更大的变化方面具有更大的灵活性。我们在合成数据上的实验表明，与最先进的模型相比，本文在分解效率、稀疏性和灵活性方面具有优势。在真实数据集上的实验证明了本文在文档-词协同聚类任务中的有效性。

相似文献

Doubly Nonparametric Sparse Nonnegative Matrix Factorization Based on Dependent Indian Buffet Processes.基于相依印度自助餐过程的双非参数稀疏非负矩阵分解。

IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1835-1849. doi: 10.1109/TNNLS.2017.2676817. Epub 2017 Apr 11.

A Fast Gradient Method for Nonnegative Sparse Regression With Self-Dictionary.基于自字典的非负稀疏回归的快速梯度法。

IEEE Trans Image Process. 2018;27(1):24-37. doi: 10.1109/TIP.2017.2753400.

Hessian regularization based symmetric nonnegative matrix factorization for clustering gene expression and microbiome data.基于Hessian正则化的对称非负矩阵分解用于聚类基因表达和微生物组数据

Methods. 2016 Dec 1;111:80-84. doi: 10.1016/j.ymeth.2016.06.017. Epub 2016 Jun 20.

Pairwise Constraint Propagation-Induced Symmetric Nonnegative Matrix Factorization.成对约束传播诱导的对称非负矩阵分解

IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):6348-6361. doi: 10.1109/TNNLS.2018.2830761. Epub 2018 May 18.

Convex nonnegative matrix factorization with manifold regularization.具有流形正则化的凸非负矩阵分解。

Neural Netw. 2015 Mar;63:94-103. doi: 10.1016/j.neunet.2014.11.007. Epub 2014 Dec 4.

Variational Bayesian Matrix Factorization for Bounded Support Data.变分贝叶斯矩阵分解用于有界支持数据。

IEEE Trans Pattern Anal Mach Intell. 2015 Apr;37(4):876-89. doi: 10.1109/TPAMI.2014.2353639.

Bicriteria Sparse Nonnegative Matrix Factorization via Two-Timescale Duplex Neurodynamic Optimization.基于双时间尺度双工神经动力学优化的双准则稀疏非负矩阵分解

IEEE Trans Neural Netw Learn Syst. 2023 Aug;34(8):4881-4891. doi: 10.1109/TNNLS.2021.3125457. Epub 2023 Aug 4.

Large-Cone Nonnegative Matrix Factorization.大锥非负矩阵分解。

IEEE Trans Neural Netw Learn Syst. 2017 Sep;28(9):2129-2142. doi: 10.1109/TNNLS.2016.2574748. Epub 2016 Jun 15.

Generalized Separable Nonnegative Matrix Factorization.广义可分离非负矩阵分解

IEEE Trans Pattern Anal Mach Intell. 2021 May;43(5):1546-1561. doi: 10.1109/TPAMI.2019.2956046. Epub 2021 Apr 1.

Symmetric nonnegative matrix factorization: algorithms and applications to probabilistic clustering.对称非负矩阵分解：算法及其在概率聚类中的应用

IEEE Trans Neural Netw. 2011 Dec;22(12):2117-31. doi: 10.1109/TNN.2011.2172457. Epub 2011 Oct 26.

基于相依印度自助餐过程的双非参数稀疏非负矩阵分解。

Doubly Nonparametric Sparse Nonnegative Matrix Factorization Based on Dependent Indian Buffet Processes.

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1835-1849. doi: 10.1109/TNNLS.2017.2676817. Epub 2017 Apr 11.

DOI:10.1109/TNNLS.2017.2676817

PMID:28422690

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于相依印度自助餐过程的双非参数稀疏非负矩阵分解。

Doubly Nonparametric Sparse Nonnegative Matrix Factorization Based on Dependent Indian Buffet Processes.

出版信息

相似文献

基于相依印度自助餐过程的双非参数稀疏非负矩阵分解。

Doubly Nonparametric Sparse Nonnegative Matrix Factorization Based on Dependent Indian Buffet Processes.

出版信息

相似文献