Song Huan, J Thiagarajan Jayaraman, Sattigeri Prasanna, Spanias Andreas
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5528-5540. doi: 10.1109/TNNLS.2018.2804895. Epub 2018 Mar 6.
Building highly nonlinear and nonparametric models is central to several state-of-the-art machine learning systems. Kernel methods form an important class of techniques that induce a reproducing kernel Hilbert space (RKHS) for inferring non-linear models through the construction of similarity functions from data. These methods are particularly preferred in cases where the training data sizes are limited and when prior knowledge of the data similarities is available. Despite their usefulness, they are limited by the computational complexity and their inability to support end-to-end learning with a task-specific objective. On the other hand, deep neural networks have become the de facto solution for end-to-end inference in several learning paradigms. In this paper, we explore the idea of using deep architectures to perform kernel machine optimization, for both computational efficiency and end-to-end inferencing. To this end, we develop the deep kernel machine optimization framework, that creates an ensemble of dense embeddings using Nyström kernel approximations and utilizes deep learning to generate task-specific representations through the fusion of the embeddings. Intuitively, the filters of the network are trained to fuse information from an ensemble of linear subspaces in the RKHS. Furthermore, we introduce the kernel dropout regularization to enable improved training convergence. Finally, we extend this framework to the multiple kernel case, by coupling a global fusion layer with pretrained deep kernel machines for each of the constituent kernels. Using case studies with limited training data, and lack of explicit feature sources, we demonstrate the effectiveness of our framework over conventional model inferencing techniques.
构建高度非线性和非参数模型是多个先进机器学习系统的核心。核方法构成了一类重要的技术,它通过从数据构建相似性函数来诱导一个再生核希尔伯特空间(RKHS)以推断非线性模型。在训练数据规模有限以及可获得数据相似性的先验知识的情况下,这些方法尤其受到青睐。尽管它们很有用,但受到计算复杂性的限制,并且无法支持具有特定任务目标的端到端学习。另一方面,深度神经网络已成为多种学习范式中端到端推理的实际解决方案。在本文中,为了提高计算效率和实现端到端推理,我们探索使用深度架构来执行核机器优化的想法。为此,我们开发了深度核机器优化框架,该框架使用Nyström核近似创建密集嵌入的集合,并通过嵌入融合利用深度学习生成特定任务的表示。直观地说,网络的滤波器经过训练以融合RKHS中线性子空间集合的信息。此外,我们引入核随机失活正则化以实现更好的训练收敛。最后,我们通过将全局融合层与每个组成核的预训练深度核机器相结合,将此框架扩展到多核情况。通过使用训练数据有限且缺乏明确特征源的案例研究,我们证明了我们的框架相对于传统模型推理技术的有效性。