Shokoohi-Yekta Mohammad, Hu Bing, Jin Hongxia, Wang Jun, Keogh Eamonn
Apple Inc., Cupertino, USA.
Facebook, Menlo Park, USA.
Data Min Knowl Discov. 2017 Jan;31(1):1-31. doi: 10.1007/s10618-016-0455-0. Epub 2016 Feb 15.
In recent years Dynamic Time Warping (DTW) has emerged as the distance measure of choice for virtually all time series data mining applications. For example, virtually all applications that process data from wearable devices use DTW as a core sub-routine. This is the result of significant progress in improving DTW's efficiency, together with multiple empirical studies showing that DTW-based classifiers at least equal (and generally surpass) the accuracy of all their rivals across dozens of datasets. Thus far, most of the research has considered only the one-dimensional case, with practitioners generalizing to the multi-dimensional case in one of two ways, or warping. In general, it appears the community believes either that the two ways are equivalent, or that the choice is irrelevant. In this work, we show that this is not the case. The two most commonly used multi-dimensional DTW methods can produce different classifications, and neither one dominates over the other. This seems to suggest that one should learn the best method for a particular application. However, we will show that this is not necessary; a simple, principled rule can be used on a case-by-case basis to predict which of the two methods we should trust at the time of classification. Our method allows us to ensure that classification results are at least as accurate as the better of the two rival methods, and, in many cases, our method is significantly more accurate. We demonstrate our ideas with the most extensive set of multi-dimensional time series classification experiments ever attempted.
近年来,动态时间规整(DTW)已成为几乎所有时间序列数据挖掘应用中首选的距离度量方法。例如,几乎所有处理可穿戴设备数据的应用都将DTW用作核心子程序。这是DTW效率显著提高的结果,同时多项实证研究表明,基于DTW的分类器在数十个数据集中至少与所有竞争对手的准确率相当(且通常超过)。到目前为止,大多数研究仅考虑一维情况,从业者通常通过两种方式之一将其推广到多维情况,即 或 规整。一般来说,该领域似乎认为这两种方式是等效的,或者认为这种选择无关紧要。在这项工作中,我们表明事实并非如此。两种最常用的多维DTW方法可能会产生不同的分类结果,且没有一种方法占主导地位。这似乎表明应该为特定应用学习最佳方法。然而,我们将表明这并非必要;可以根据具体情况使用一个简单、有原则的规则来预测在分类时应该信任这两种方法中的哪一种。我们的方法使我们能够确保分类结果至少与两种竞争方法中较好的方法一样准确,并且在许多情况下,我们的方法要准确得多。我们通过有史以来尝试过的最广泛的多维时间序列分类实验来证明我们的想法。