Chen Chu-Song, Hsieh Wen-Teng, Chen Jiun-Hung
Institute of Information Science, Academia Sinica, Taipei, Taiwan, ROC.
IEEE Trans Syst Man Cybern B Cybern. 2004 Feb;34(1):179-99. doi: 10.1109/tsmcb.2003.811770.
This paper proposes a general scheme for recognizing the contents of a video using a set of panoramas recorded in a database. In essence, a panorama inherently records the appearances of an omni-directional scene from its central point to arbitrary viewing directions and, thus, can serve as a compact representation of an environment. In particular, this paper emphasizes the use of a sequence of successive frames in a video taken with a video camera, instead of a single frame, for visual recognition. The associated recognition task is formulated as a shortest-path searching problem, and a dynamic-programming technique is used to solve it. Experimental results show that our method can effectively recognize a video.
本文提出了一种使用数据库中记录的一组全景图来识别视频内容的通用方案。本质上,全景图从其中心点到任意视角方向固有地记录了全方位场景的外观,因此可以作为环境的紧凑表示。特别地,本文强调使用摄像机拍摄的视频中的连续帧序列,而不是单个帧,来进行视觉识别。相关的识别任务被表述为一个最短路径搜索问题,并使用动态规划技术来解决它。实验结果表明,我们的方法能够有效地识别视频。