Zhou Shaohua Kevin, Chellappa Rama
Integrated Data Systems Department, Siemens Corporate Research, 755 College Road East, Princeton, NJ 08540, USA.
IEEE Trans Pattern Anal Mach Intell. 2006 Jun;28(6):917-29. doi: 10.1109/TPAMI.2006.120.
This paper addresses the problem of characterizing ensemble similarity from sample similarity in a principled manner. Using reproducing kernel as a characterization of sample similarity, we suggest a probabilistic distance measure in the reproducing kernel Hilbert space (RKHS) as the ensemble similarity. Assuming normality in the RKHS, we derive analytic expressions for probabilistic distance measures that are commonly used in many applications, such as Chernoff distance (or the Bhattacharyya distance as its special case), Kullback-Leibler divergence, etc. Since the reproducing kernel implicitly embeds a nonlinear mapping, our approach presents a new way to study these distances whose feasibility and efficiency is demonstrated using experiments with synthetic and real examples. Further, we extend the ensemble similarity to the reproducing kernel for ensemble and study the ensemble similarity for more general data representations.
本文以一种有原则的方式解决了从样本相似性表征总体相似性的问题。使用再生核作为样本相似性的一种表征,我们提出在再生核希尔伯特空间(RKHS)中使用概率距离度量作为总体相似性。假设RKHS中的正态性,我们推导了许多应用中常用的概率距离度量的解析表达式,如切尔诺夫距离(或其特殊情况巴塔恰里亚距离)、库尔贝克 - 莱布勒散度等。由于再生核隐含地嵌入了一个非线性映射,我们的方法提出了一种研究这些距离的新方法,通过合成和实际示例实验证明了其可行性和效率。此外,我们将总体相似性扩展到总体的再生核,并研究更一般数据表示的总体相似性。