基于单目视频的立体视频合成

Stereoscopic video synthesis from a monocular video.

作者信息

Zhang Guofeng, Hua Wei, Qin Xueying, Wong Tien-Tsin, Bao Hujun

机构信息

State Key Lab of CAD & CG, Zhejiang University, Hangzhou, PR China.

出版信息

IEEE Trans Vis Comput Graph. 2007 Jul-Aug;13(4):686-96. doi: 10.1109/TVCG.2007.1032.

DOI:10.1109/TVCG.2007.1032

PMID:17495329

Abstract

This paper presents an automatic and robust approach to synthesize stereoscopic videos from ordinary monocular videos acquired by commodity video cameras. Instead of recovering the depth map, the proposed method synthesizes the binocular parallax in stereoscopic video directly from the motion parallax in monocular video. The synthesis is formulated as an optimization problem via introducing a cost function of the stereoscopic effects, the similarity, and the smoothness constraints. The optimization selects the most suitable frames in the input video for generating the stereoscopic video frames. With the optimized selection, convincing and smooth stereoscopic video can be synthesized even by simple constant-depth warping. No user interaction is required. We demonstrate the visually plausible results obtained given the input clips acquired by ordinary handheld video camera.

摘要

本文提出了一种自动且稳健的方法，可从商用摄像机获取的普通单目视频合成立体视频。该方法并非恢复深度图，而是直接从单目视频中的运动视差合成立体视频中的双目视差。通过引入立体效果、相似度和平滑度约束的代价函数，将合成过程表述为一个优化问题。该优化在输入视频中选择最合适的帧来生成立体视频帧。通过这种优化选择，即使采用简单的恒定深度扭曲，也能合成令人信服且平滑的立体视频，无需用户交互。我们展示了使用普通手持摄像机获取的输入片段所得到的视觉上合理的结果。

相似文献

Stereoscopic video synthesis from a monocular video.

IEEE Trans Vis Comput Graph. 2007 Jul-Aug;13(4):686-96. doi: 10.1109/TVCG.2007.1032.

Generalized parallel-perspective stereo mosaics from airborne video.

IEEE Trans Pattern Anal Mach Intell. 2004 Feb;26(2):226-37. doi: 10.1109/TPAMI.2004.1262190.

Interactive stereoscopic rendering of volumetric environments.

IEEE Trans Vis Comput Graph. 2004 Jan-Feb;10(1):15-28. doi: 10.1109/TVCG.2004.1260755.

Multiresolution and wide-scope depth estimation using a dual-PTZ-camera system.

IEEE Trans Image Process. 2009 Mar;18(3):677-82. doi: 10.1109/TIP.2008.2011178.

Stroke surfaces: temporally coherent artistic animations from video.

IEEE Trans Vis Comput Graph. 2005 Sep-Oct;11(5):540-9. doi: 10.1109/TVCG.2005.85.

MonoSLAM: real-time single camera SLAM.

IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):1052-67. doi: 10.1109/TPAMI.2007.1049.

Correcting interperspective aliasing in autostereoscopic displays.

IEEE Trans Vis Comput Graph. 2005 Mar-Apr;11(2):228-36. doi: 10.1109/TVCG.2005.28.

Full-frame video stabilization with motion inpainting.

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1150-63. doi: 10.1109/TPAMI.2006.141.

Detecting motion regions in presence of strong parallax from a moving camera by multi-view geometric constraints.

IEEE Trans Pattern Anal Mach Intell. 2007 Sep;29(9):1627-41. doi: 10.1109/TPAMI.2007.1084.

Shape deformation using a skeleton to drive simplex transformations.

IEEE Trans Vis Comput Graph. 2008 May-Jun;14(3):693-706. doi: 10.1109/TVCG.2008.28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于单目视频的立体视频合成

Stereoscopic video synthesis from a monocular video.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献