Zhang Jiaming, Ning Hanwen, Jing Xingjian, Tian Tianhai
IEEE Trans Neural Netw Learn Syst. 2021 May;32(5):1920-1934. doi: 10.1109/TNNLS.2020.2995482. Epub 2021 May 3.
Online learning methods are designed to establish timely predictive models for machine learning problems. The methods for online learning of nonlinear systems are usually developed in the reproducing kernel Hilbert space (RKHS) associated with Gaussian kernel in which the kernel bandwidth is manually selected and remains steady during the entire modeling process in most cases. This setting may make the learning model rigid and inappropriate for complex data streams. Since the bandwidth appears in a nonlinear term of the kernel model, it raises substantial challenges in the development of learning methods with an adaptive bandwidth. In this article, we propose a novel approach to address this important open issue. By a carefully casted linearization scheme, the nonlinear learning problem is reasonably transformed into a state feedback control problem for a series of controllable systems. Then, by employing optimal control techniques, an effective algorithm is developed, and the parameters in the learning model including kernel bandwidth can be efficiently updated in a real-time manner. By taking advantage of the particular structure of the Gaussian kernel model, a theoretical analysis on the convergence and rationality of the proposed method is also provided. Compared with the kernel algorithms with a fixed bandwidth, our novel learning framework can not only achieve adaptive learning results with a better prediction accuracy but also show performance that is more robust with a faster convergence speed. Encouraging numerical results are provided to demonstrate the advantages of our new method.
在线学习方法旨在为机器学习问题建立及时的预测模型。非线性系统的在线学习方法通常是在与高斯核相关的再生核希尔伯特空间(RKHS)中开发的,其中核带宽是手动选择的,并且在大多数情况下在整个建模过程中保持稳定。这种设置可能会使学习模型变得僵化,不适用于复杂的数据流。由于带宽出现在核模型的非线性项中,这给具有自适应带宽的学习方法的开发带来了重大挑战。在本文中,我们提出了一种新颖的方法来解决这个重要的开放问题。通过精心设计的线性化方案,将非线性学习问题合理地转化为一系列可控系统的状态反馈控制问题。然后,通过采用最优控制技术,开发了一种有效的算法,并且可以实时有效地更新学习模型中的参数,包括核带宽。利用高斯核模型的特殊结构,还对所提出方法的收敛性和合理性进行了理论分析。与具有固定带宽的核算法相比,我们新颖的学习框架不仅可以实现具有更好预测精度的自适应学习结果,而且还表现出更强大的性能和更快的收敛速度。提供了令人鼓舞的数值结果来证明我们新方法的优势。