General Electric Global Research Center, 305 Connor Court, Willowbrook Apt, Niskayuna, NY 12309, USA.
IEEE Trans Pattern Anal Mach Intell. 2012 Apr;34(4):654-69. doi: 10.1109/TPAMI.2011.152.
The goal of this work is to learn a parsimonious and informative representation for high-dimensional time series. Conceptually, this comprises two distinct yet tightly coupled tasks: learning a low-dimensional manifold and modeling the dynamical process. These two tasks have a complementary relationship as the temporal constraints provide valuable neighborhood information for dimensionality reduction and, conversely, the low-dimensional space allows dynamics to be learned efficiently. Solving these two tasks simultaneously allows important information to be exchanged mutually. If nonlinear models are required to capture the rich complexity of time series, then the learning problem becomes harder as the nonlinearities in both tasks are coupled. A divide, conquer, and coordinate method is proposed. The solution approximates the nonlinear manifold and dynamics using simple piecewise linear models. The interactions and coordinations among the linear models are captured in a graphical model. The model structure setup and parameter learning are done using a variational Bayesian approach, which enables automatic Bayesian model structure selection, hence solving the problem of overfitting. By exploiting the model structure, efficient inference and learning algorithms are obtained without oversimplifying the model of the underlying dynamical process. Evaluation of the proposed framework with competing approaches is conducted in three sets of experiments: dimensionality reduction and reconstruction using synthetic time series, video synthesis using a dynamic texture database, and human motion synthesis, classification, and tracking on a benchmark data set. In all experiments, the proposed approach provides superior performance.
这项工作的目标是为高维时间序列学习一种简洁而有信息量的表示。从概念上讲,这包括两个截然不同但紧密相关的任务:学习低维流形和建模动态过程。这两个任务具有互补关系,因为时间约束为降维提供了有价值的邻域信息,反之,低维空间允许高效地学习动态。同时解决这两个任务可以相互交换重要信息。如果需要非线性模型来捕捉时间序列的丰富复杂性,那么由于两个任务中的非线性相互耦合,学习问题就会变得更加困难。提出了一种分而治之、协调的方法。该解决方案使用简单的分段线性模型来近似非线性流形和动态。线性模型之间的相互作用和协调在图形模型中捕获。模型结构设置和参数学习使用变分贝叶斯方法完成,这使得能够自动进行贝叶斯模型结构选择,从而解决过度拟合问题。通过利用模型结构,在不简化基础动态过程模型的情况下,获得了高效的推理和学习算法。通过在三组实验中与竞争方法进行评估:使用合成时间序列进行降维和重建、使用动态纹理数据库进行视频合成以及在基准数据集上进行人体运动合成、分类和跟踪。在所有实验中,所提出的方法都提供了卓越的性能。