Belanche-Muñoz Lluís A, Wiejacha Małgorzata
Department of Computer Science, Universitat Politècnica de Catalunya, 08034 Barcelona, Catalonia, Spain.
Cien.ai, Ronda Carrer de Sagués, 45, 08021 Barcelona, Catalonia, Spain.
Entropy (Basel). 2023 Jan 12;25(1):154. doi: 10.3390/e25010154.
Kernel methods have played a major role in the last two decades in the modeling and visualization of complex problems in data science. The choice of kernel function remains an open research area and the reasons why some kernels perform better than others are not yet understood. Moreover, the high computational costs of kernel-based methods make it extremely inefficient to use standard model selection methods, such as cross-validation, creating a need for careful kernel design and parameter choice. These reasons justify the prior analyses of kernel matrices, i.e., mathematical objects generated by the kernel functions. This paper explores these topics from an entropic standpoint for the case of kernelized relevance vector machines (RVMs), pinpointing desirable properties of kernel matrices that increase the likelihood of obtaining good model performances in terms of generalization power, as well as relate these properties to the model's fitting ability. We also derive a heuristic for achieving close-to-optimal modeling results while keeping the computational costs low, thus providing a recipe for efficient analysis when processing resources are limited.
在过去二十年中,核方法在数据科学复杂问题的建模和可视化方面发挥了重要作用。核函数的选择仍然是一个开放的研究领域,一些核函数比其他核函数表现更好的原因尚未明确。此外,基于核的方法计算成本高昂,使得使用标准模型选择方法(如交叉验证)效率极低,因此需要精心设计核函数并选择参数。这些原因证明了对核矩阵进行先验分析的合理性,核矩阵即由核函数生成的数学对象。本文从熵的角度探讨这些主题,以核相关向量机(RVM)为例,明确核矩阵的理想属性,这些属性可提高模型在泛化能力方面获得良好性能的可能性,并将这些属性与模型的拟合能力联系起来。我们还推导了一种启发式方法,在保持计算成本较低的同时实现接近最优的建模结果,从而为处理资源有限时的高效分析提供了方法。