用户行为驱动的视图和评分可扩展多视点视频编码。

User-action-driven view and rate scalable multiview video coding.

机构信息

Signal Processing Laboratory, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland.

出版信息

IEEE Trans Image Process. 2013 Sep;22(9):3473-84. doi: 10.1109/TIP.2013.2269801. Epub 2013 Jun 18.

DOI:10.1109/TIP.2013.2269801

Abstract

We derive an optimization framework for joint view and rate scalable coding of multi-view video content represented in the texture plus depth format. The optimization enables the sender to select the subset of coded views and their encoding rates such that the aggregate distortion over a continuum of synthesized views is minimized. We construct the view and rate embedded bitstream such that it delivers optimal performance simultaneously over a discrete set of transmission rates. In conjunction, we develop a user interaction model that characterizes the view selection actions of the client as a Markov chain over a discrete state-space. We exploit the model within the context of our optimization to compute user-action-driven coding strategies that aim at enhancing the client's performance in terms of latency and video quality. Our optimization outperforms the state-of-the-art H.264 SVC codec as well as a multi-view wavelet-based coder equipped with a uniform rate allocation strategy, across all scenarios studied in our experiments. Equally important, we can achieve an arbitrarily fine granularity of encoding bit rates, while providing a novel functionality of view embedded encoding, unlike the other encoding methods that we examined. Finally, we observe that the interactivity-aware coding delivers superior performance over conventional allocation techniques that do not anticipate the client's view selection actions in their operation.

摘要

我们提出了一种用于多视点视频内容的联合视图和率可分级编码的优化框架，该内容采用纹理加深度格式表示。该优化使发送方能够选择编码视图的子集及其编码速率，从而使在连续合成视图上的总失真最小化。我们构建了视图和速率嵌入式比特流，以便在离散的传输速率集上同时提供最佳性能。同时，我们开发了一种用户交互模型，该模型将客户端的视图选择操作描述为离散状态空间上的马尔可夫链。我们在优化的上下文中利用该模型来计算用户操作驱动的编码策略，旨在提高客户端的延迟和视频质量性能。我们的优化在我们的实验中研究的所有场景中都优于最先进的 H.264 SVC 编解码器以及配备统一速率分配策略的多视图小波编码器。同样重要的是，我们可以实现任意细粒度的编码比特率，而不像我们检查的其他编码方法那样提供视图嵌入式编码的新功能。最后，我们观察到，与不预测客户端视图选择操作的传统分配技术相比，交互感知编码提供了更好的性能。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

用户行为驱动的视图和评分可扩展多视点视频编码。

User-action-driven view and rate scalable multiview video coding.

机构信息

出版信息

相似文献

用户行为驱动的视图和评分可扩展多视点视频编码。

User-action-driven view and rate scalable multiview video coding.

机构信息

出版信息

相似文献