IT Research Division, Konica Minolta Laboratory U.S.A. Inc., 2855 Campus Dr., San Mateo, CA 94403, USA.
IEEE Trans Pattern Anal Mach Intell. 2013 Oct;35(10):2468-83. doi: 10.1109/TPAMI.2013.33.
Complex activities typically consist of multiple primitive events happening in parallel or sequentially over a period of time. Understanding such activities requires recognizing not only each individual event but, more importantly, capturing their spatiotemporal dependencies over different time intervals. Most of the current graphical model-based approaches have several limitations. First, time--sliced graphical models such as hidden Markov models (HMMs) and dynamic Bayesian networks are typically based on points of time and they hence can only capture three temporal relations: precedes, follows, and equals. Second, HMMs are probabilistic finite-state machines that grow exponentially as the number of parallel events increases. Third, other approaches such as syntactic and description-based methods, while rich in modeling temporal relationships, do not have the expressive power to capture uncertainties. To address these issues, we introduce the interval temporal Bayesian network (ITBN), a novel graphical model that combines the Bayesian Network with the interval algebra to explicitly model the temporal dependencies over time intervals. Advanced machine learning methods are introduced to learn the ITBN model structure and parameters. Experimental results show that by reasoning with spatiotemporal dependencies, the proposed model leads to a significantly improved performance when modeling and recognizing complex activities involving both parallel and sequential events.
复杂活动通常由多个在一段时间内并行或顺序发生的基本事件组成。理解此类活动不仅需要识别每个单独的事件,更重要的是,需要捕获它们在不同时间间隔的时空依赖关系。目前大多数基于图形模型的方法都存在一些局限性。首先,基于时间切片的图形模型,如隐马尔可夫模型(HMM)和动态贝叶斯网络,通常基于时间点,因此只能捕捉三种时间关系:先行、后继和相等。其次,HMM 是概率有限状态机,随着并行事件数量的增加呈指数级增长。第三,其他方法,如基于句法和描述的方法,虽然在建模时间关系方面很丰富,但没有表达能力来捕获不确定性。为了解决这些问题,我们引入了区间时间贝叶斯网络(ITBN),这是一种将贝叶斯网络与区间代数相结合的新型图形模型,用于显式地对时间间隔上的时间依赖关系进行建模。引入了先进的机器学习方法来学习 ITBN 模型结构和参数。实验结果表明,通过推理时空依赖关系,所提出的模型在对涉及并行和顺序事件的复杂活动进行建模和识别时,性能得到了显著提高。