深度迁移：使用非参数采样从视频中提取深度。

Depth Transfer: Depth Extraction from Video Using Non-Parametric Sampling.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2014 Nov;36(11):2144-58. doi: 10.1109/TPAMI.2014.2316835.

DOI:10.1109/TPAMI.2014.2316835

Abstract

We describe a technique that automatically generates plausible depth maps from videos using non-parametric depth sampling. We demonstrate our technique in cases where past methods fail (non-translating cameras and dynamic scenes). Our technique is applicable to single images as well as videos. For videos, we use local motion cues to improve the inferred depth maps, while optical flow is used to ensure temporal depth consistency. For training and evaluation, we use a Kinect-based system to collect a large data set containing stereoscopic videos with known depths. We show that our depth estimation technique outperforms the state-of-the-art on benchmark databases. Our technique can be used to automatically convert a monoscopic video into stereo for 3D visualization, and we demonstrate this through a variety of visually pleasing results for indoor and outdoor scenes, including results from the feature film Charade.

摘要

我们描述了一种使用非参数深度采样从视频中自动生成逼真深度图的技术。我们在过去的方法失败的情况下（非平移相机和动态场景）展示了我们的技术。我们的技术适用于单张图像和视频。对于视频，我们使用局部运动线索来改进推断的深度图，同时使用光流来确保时间深度一致性。对于训练和评估，我们使用基于 Kinect 的系统来收集包含已知深度的立体视频的大型数据集。我们表明，我们的深度估计技术在基准数据库上优于最先进的技术。我们的技术可用于自动将单目视频转换为立体视频以进行 3D 可视化，我们通过各种室内和室外场景的令人愉悦的结果展示了这一点，包括来自特征电影 Charade 的结果。

相似文献

Depth Transfer: Depth Extraction from Video Using Non-Parametric Sampling.

IEEE Trans Pattern Anal Mach Intell. 2014 Nov;36(11):2144-58. doi: 10.1109/TPAMI.2014.2316835.

Pose Estimation and Segmentation of Multiple People in Stereoscopic Movies.

IEEE Trans Pattern Anal Mach Intell. 2015 Aug;37(8):1643-55. doi: 10.1109/TPAMI.2014.2369050.

Toward naturalistic 2D-to-3D conversion.

IEEE Trans Image Process. 2015 Feb;24(2):724-33. doi: 10.1109/TIP.2014.2385474. Epub 2014 Dec 23.

Video stereolization: combining motion analysis with user interaction.

IEEE Trans Vis Comput Graph. 2012 Jul;18(7):1079-88. doi: 10.1109/TVCG.2011.114.

Online temporally consistent indoor depth video enhancement via static structure.

IEEE Trans Image Process. 2015 Jul;24(7):2197-211. doi: 10.1109/TIP.2015.2416658.

Depth Analogy: Data-Driven Approach for Single Image Depth Estimation Using Gradient Samples.

IEEE Trans Image Process. 2015 Dec;24(12):5953-66. doi: 10.1109/TIP.2015.2495261. Epub 2015 Oct 27.

MARCOnI-ConvNet-Based MARker-Less Motion Capture in Outdoor and Indoor Scenes.

IEEE Trans Pattern Anal Mach Intell. 2017 Mar;39(3):501-514. doi: 10.1109/TPAMI.2016.2557779. Epub 2016 Apr 21.

Monocular Depth Estimation with Augmented Ordinal Depth Relationships.

IEEE Trans Image Process. 2018 Oct 24. doi: 10.1109/TIP.2018.2877944.

Blind Stereoscopic Video Quality Assessment: From Depth Perception to Overall Experience.

IEEE Trans Image Process. 2018 Feb;27(2):721-734. doi: 10.1109/TIP.2017.2766780.

Robust video object cosegmentation.

IEEE Trans Image Process. 2015 Oct;24(10):3137-48. doi: 10.1109/TIP.2015.2438550.

引用本文的文献

Monocular depth estimation via a detail semantic collaborative network for indoor scenes.

Sci Rep. 2025 Mar 31;15(1):10990. doi: 10.1038/s41598-025-96024-4.

AMENet is a monocular depth estimation network designed for automatic stereoscopic display.

Sci Rep. 2024 Mar 11;14(1):5868. doi: 10.1038/s41598-024-56095-1.

Knowledge distillation of multi-scale dense prediction transformer for self-supervised depth estimation.

Sci Rep. 2023 Nov 2;13(1):18939. doi: 10.1038/s41598-023-46178-w.

Monocular Depth Estimation Using Deep Learning: A Review.

Sensors (Basel). 2022 Jul 18;22(14):5353. doi: 10.3390/s22145353.

Monocular Depth Estimation with Self-Supervised Learning for Vineyard Unmanned Agricultural Vehicle.

Sensors (Basel). 2022 Jan 18;22(3):721. doi: 10.3390/s22030721.

Joint Soft-Hard Attention for Self-Supervised Monocular Depth Estimation.

Sensors (Basel). 2021 Oct 20;21(21):6956. doi: 10.3390/s21216956.

Computer-Aided Detection of Polyps in Optical Colonoscopy Images.

Proc SPIE Int Soc Opt Eng. 2016 Feb-Mar;9785. doi: 10.1117/12.2216996. Epub 2016 Mar 24.

SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.

Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.

A novel no-sensors 3D model reconstruction from monocular video frames for a dynamic environment.

PeerJ Comput Sci. 2021 May 12;7:e529. doi: 10.7717/peerj-cs.529. eCollection 2021.

Online supervised attention-based recurrent depth estimation from monocular video.

PeerJ Comput Sci. 2020 Nov 23;6:e317. doi: 10.7717/peerj-cs.317. eCollection 2020.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

深度迁移：使用非参数采样从视频中提取深度。

Depth Transfer: Depth Extraction from Video Using Non-Parametric Sampling.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献