Wei Jie
Department of Computer Science, City College of New York, New York, New York 10031, USA.
Int J Image Graph. 2014 Oct;14(4):1450016. doi: 10.1142/S0219467814500168.
In statistics, pattern recognition and signal processing, it is of utmost importance to have an effective and efficient distance to measure the similarity between two distributions and sequences. In statistics this is referred to as . Two leading goodness of fit methods are chi-square and Kolmogorov-Smirnov distances. The strictly localized nature of these two measures hinders their practical utilities in patterns and signals where the sample size is usually small. In view of this problem Rubner and colleagues developed the earth mover's distance (EMD) to allow for cross-bin moves in evaluating the distance between two patterns, which find a broad spectrum of applications. EMD-L1 was later proposed to reduce the time complexity of EMD from super-cubic by one order of magnitude by exploiting the special L1 metric. EMD-hat was developed to turn the global EMD to a localized one by discarding long-distance earth movements. In this work, we introduce a Markov EMD (MEMD) by treating the source and destination nodes absolutely symmetrically. In MEMD, like hat-EMD, the earth is only moved locally as dictated by the degree d of neighborhood system. Nodes that cannot be matched locally is handled by dummy source and destination nodes. By use of this localized network structure, a greedy algorithm that is linear to the degree d and number of nodes is then developed to evaluate the MEMD. Empirical studies on the use of MEMD on deterministic and statistical synthetic sequences and SIFT-based image retrieval suggested encouraging performances.
在统计学、模式识别和信号处理中,拥有一种有效且高效的距离度量方法来衡量两个分布和序列之间的相似性至关重要。在统计学中,这被称为 。两种主要的拟合优度方法是卡方距离和柯尔莫哥洛夫 - 斯米尔诺夫距离。这两种度量的严格局部化性质阻碍了它们在样本量通常较小的模式和信号中的实际应用。鉴于此问题,鲁布纳及其同事开发了推土机距离(EMD),以便在评估两个模式之间的距离时允许跨区间移动,该方法有广泛的应用。后来提出了EMD - L1,通过利用特殊的L1度量将EMD的时间复杂度从超立方降低一个数量级。开发了EMD - hat,通过舍弃长距离的土方移动将全局EMD转变为局部EMD。在这项工作中,我们通过绝对对称地处理源节点和目标节点引入了马尔可夫EMD(MEMD)。在MEMD中,与hat - EMD一样,土方仅在邻域系统的度d的规定下进行局部移动。无法在局部匹配的节点由虚拟源节点和目标节点处理。通过使用这种局部网络结构,然后开发了一种对度d和节点数量呈线性关系的贪心算法来评估MEMD。对MEMD在确定性和统计合成序列以及基于尺度不变特征变换(SIFT)的图像检索中的应用进行的实证研究表明其性能令人鼓舞。