IEEE Trans Image Process. 2015 Nov;24(11):3574-85. doi: 10.1109/TIP.2015.2445631. Epub 2015 Jun 15.
While surveillance video is the biggest source of unstructured Big Data today, the emergence of high-efficiency video coding (HEVC) standard is poised to have a huge role in lowering the costs associated with transmission and storage. Among the benefits of HEVC over the legacy MPEG-4 Advanced Video Coding (AVC), is a staggering 40 percent or more bitrate reduction at the same visual quality. Given the bandwidth limitations, video data are compressed essentially by removing spatial and temporal correlations that exist in its uncompressed form. This causes compressed data, which are already de-correlated, to serve as a vital resource for machine learning with significantly fewer samples for training. In this paper, an efficient approach to foreground extraction/segmentation is proposed using novel spatio-temporal de-correlated block features extracted directly from the HEVC compressed video. Most related techniques, in contrast, work on uncompressed images claiming significant storage and computational resources not only for the decoding process prior to initialization but also for the feature selection/extraction and background modeling stage following it. The proposed approach has been qualitatively and quantitatively evaluated against several other state-of-the-art methods.
虽然监控视频是当今最大的非结构化大数据来源,但高效视频编码 (HEVC) 标准的出现有望在降低传输和存储相关成本方面发挥巨大作用。与传统的 MPEG-4 高级视频编码 (AVC) 相比,HEVC 的优势在于在相同的视觉质量下,比特率可惊人地降低 40%或更多。考虑到带宽限制,视频数据基本上通过去除其未压缩形式中存在的空间和时间相关性来进行压缩。这使得已经去相关的压缩数据成为机器学习的重要资源,其训练样本数量显著减少。在本文中,提出了一种使用直接从 HEVC 压缩视频中提取的新颖的时空去相关块特征进行前景提取/分割的有效方法。相比之下,大多数相关技术都针对未压缩的图像,不仅在初始化之前的解码过程中需要大量的存储和计算资源,而且在其后的特征选择/提取和背景建模阶段也需要大量的存储和计算资源。该方法已经针对其他几种最先进的方法进行了定性和定量评估。