稀疏随机特征图的项集核。

Sparse random feature maps for the item-multiset kernel.

机构信息

Graduate School of Information Science and Technology, Hokkaido University, Kita 14, Nishi 9, Kita-ku, Sapporo, Hokkaido 060-0814, Japan.

Faculty of Information Science and Technology, Hokkaido University, Kita 14, Nishi 9, Kita-ku, Sapporo, Hokkaido 060-0814, Japan.

出版信息

Neural Netw. 2021 Nov;143:500-514. doi: 10.1016/j.neunet.2021.06.024. Epub 2021 Jul 1.

DOI:10.1016/j.neunet.2021.06.024

PMID:34280609

Abstract

Random feature maps are a promising tool for large-scale kernel methods. Since most random feature maps generate dense random features causing memory explosion, it is hard to apply them to very-large-scale sparse datasets. The factorization machines and related models, which use feature combinations efficiently, scale well for large-scale sparse datasets and have been used in many applications. However, their optimization problems are typically non-convex. Therefore, although they are optimized by using gradient-based iterative methods, such methods cannot find global optimum solutions in general and require a large number of iterations for convergence. In this paper, we define the item-multiset kernel, which is a generalization of the itemset kernel and dot product kernels. Unfortunately, random feature maps for the itemset kernel and dot product kernels cannot approximate the item-multiset kernel. We thus develop a method that converts an item-multiset kernel into an itemset kernel, enabling the item-multiset kernel to be approximated by using a random feature map for the itemset kernel. We propose two random feature maps for the itemset kernel, which run faster and are more memory efficient than the existing feature map for the itemset kernel. They also generate sparse random features when the original (input) feature vector is sparse and thus linear models using proposed methods . Experiments using real-world datasets demonstrated the effectiveness of the proposed methodology: linear models using the proposed random feature maps ran from 10 to 100 times faster than ones based on existing methods.

摘要

随机特征映射是一种很有前途的大规模核方法工具。由于大多数随机特征映射生成密集的随机特征，导致内存爆炸，因此很难将其应用于非常大规模的稀疏数据集。因子分解机和相关模型，它们有效地利用特征组合，能够很好地扩展到大规模稀疏数据集，并已在许多应用中得到应用。然而，它们的优化问题通常是非凸的。因此，尽管它们使用基于梯度的迭代方法进行优化，但这些方法通常无法找到全局最优解，并且需要大量的迭代才能收敛。在本文中，我们定义了项多重集核，它是项集核和点积核的推广。不幸的是，项集核和点积核的随机特征映射不能逼近项多重集核。因此，我们开发了一种将项多重集核转换为项集核的方法，使项多重集核能够通过使用项集核的随机特征映射来逼近。我们提出了两种用于项集核的随机特征映射，它们比现有的项集核特征映射运行速度更快，内存效率更高。当原始（输入）特征向量稀疏时，它们还会生成稀疏的随机特征，因此使用所提出的方法的线性模型。使用真实数据集的实验证明了所提出的方法的有效性：使用所提出的随机特征映射的线性模型的运行速度比基于现有方法的模型快 10 到 100 倍。

相似文献

Sparse random feature maps for the item-multiset kernel.稀疏随机特征图的项集核。

Neural Netw. 2021 Nov;143:500-514. doi: 10.1016/j.neunet.2021.06.024. Epub 2021 Jul 1.

Efficient $\chi ^{2}$ Kernel Linearization via Random Feature Maps.通过随机特征映射实现高效的 $\chi ^{2}$ 核线性化。

IEEE Trans Neural Netw Learn Syst. 2016 Nov;27(11):2448-2453. doi: 10.1109/TNNLS.2015.2476659. Epub 2015 Sep 23.

Efficient additive kernels via explicit feature maps.通过显式特征映射实现高效的加法核。

IEEE Trans Pattern Anal Mach Intell. 2012 Mar;34(3):480-92. doi: 10.1109/TPAMI.2011.153.

Gaussian Quadrature for Kernel Features.核特征的高斯求积法。

Adv Neural Inf Process Syst. 2017 Dec;30:6109-6119.

Sparse representation with kernels.基于核的稀疏表示。

IEEE Trans Image Process. 2013 Feb;22(2):423-34. doi: 10.1109/TIP.2012.2215620. Epub 2012 Sep 21.

Scaling Up Generalized Kernel Methods.扩大广义核方法的规模。

IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3767-3778. doi: 10.1109/TPAMI.2021.3059702. Epub 2022 Jun 3.

Efficient sparse generalized multiple kernel learning.高效稀疏广义多核学习

IEEE Trans Neural Netw. 2011 Mar;22(3):433-46. doi: 10.1109/TNN.2010.2103571. Epub 2011 Jan 20.

L2-norm multiple kernel learning and its application to biomedical data fusion.L2-范数多核学习及其在生物医学数据融合中的应用。

BMC Bioinformatics. 2010 Jun 8;11:309. doi: 10.1186/1471-2105-11-309.

CROification: Accurate Kernel Classification with the Efficiency of Sparse Linear SVM.CRO 化：借助稀疏线性支持向量机的效率实现精确内核分类

IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):34-48. doi: 10.1109/TPAMI.2017.2785313. Epub 2017 Dec 19.

A Hybrid Intrusion Detection Model Combining SAE with Kernel Approximation in Internet of Things.一种结合 SAE 和核逼近的物联网混合入侵检测模型。

Sensors (Basel). 2020 Oct 8;20(19):5710. doi: 10.3390/s20195710.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

稀疏随机特征图的项集核。

Sparse random feature maps for the item-multiset kernel.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献