• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于学习的 2D 到 3D 图像和视频自动转换。

Learning-based, automatic 2D-to-3D image and video conversion.

机构信息

Department of Electrical and Computer Engineering, Boston University, Boston, MA 02215, USA.

出版信息

IEEE Trans Image Process. 2013 Sep;22(9):3485-96. doi: 10.1109/TIP.2013.2270375. Epub 2013 Jun 20.

DOI:10.1109/TIP.2013.2270375
PMID:23799697
Abstract

Despite a significant growth in the last few years, the availability of 3D content is still dwarfed by that of its 2D counterpart. To close this gap, many 2D-to-3D image and video conversion methods have been proposed. Methods involving human operators have been most successful but also time-consuming and costly. Automatic methods, which typically make use of a deterministic 3D scene model, have not yet achieved the same level of quality for they rely on assumptions that are often violated in practice. In this paper, we propose a new class of methods that are based on the radically different approach of learning the 2D-to-3D conversion from examples. We develop two types of methods. The first is based on learning a point mapping from local image/video attributes, such as color, spatial position, and, in the case of video, motion at each pixel, to scene-depth at that pixel using a regression type idea. The second method is based on globally estimating the entire depth map of a query image directly from a repository of 3D images ( image+depth pairs or stereopairs) using a nearest-neighbor regression type idea. We demonstrate both the efficacy and the computational efficiency of our methods on numerous 2D images and discuss their drawbacks and benefits. Although far from perfect, our results demonstrate that repositories of 3D content can be used for effective 2D-to-3D image conversion. An extension to video is immediate by enforcing temporal continuity of computed depth maps.

摘要

尽管在过去几年中得到了显著发展,但 3D 内容的可用性仍然远远落后于其 2D 内容。为了缩小这一差距,已经提出了许多 2D 到 3D 的图像和视频转换方法。涉及人工操作员的方法最为成功,但也耗时且昂贵。自动方法通常利用确定性的 3D 场景模型,但尚未达到相同的质量水平,因为它们依赖于在实践中经常被违反的假设。在本文中,我们提出了一类新的方法,这些方法基于从示例中学习 2D 到 3D 转换的根本不同的方法。我们开发了两种类型的方法。第一种方法基于学习从局部图像/视频属性(例如颜色、空间位置,并且在视频的情况下,每个像素的运动)到该像素的场景深度的点映射,使用回归类型的想法。第二种方法基于使用最近邻回归类型的想法,直接从 3D 图像(图像+深度对或立体对)的存储库全局估计查询图像的整个深度图。我们在许多 2D 图像上展示了我们的方法的有效性和计算效率,并讨论了它们的优缺点。尽管还远非完美,但我们的结果表明,可以使用 3D 内容的存储库有效地进行 2D 到 3D 的图像转换。通过强制计算的深度图的时间连续性,很容易扩展到视频。

相似文献

1
Learning-based, automatic 2D-to-3D image and video conversion.基于学习的 2D 到 3D 图像和视频自动转换。
IEEE Trans Image Process. 2013 Sep;22(9):3485-96. doi: 10.1109/TIP.2013.2270375. Epub 2013 Jun 20.
2
A Novel 2D-to-3D Video Conversion Method Using Time-Coherent Depth Maps.一种使用时间相干深度图的新型二维到三维视频转换方法。
Sensors (Basel). 2015 Jun 29;15(7):15246-64. doi: 10.3390/s150715246.
3
Automatic Depth Extraction from 2D Images Using a Cluster-Based Learning Framework.基于聚类学习框架的二维图像自动深度提取。
IEEE Trans Image Process. 2018 Jul;27(7):3288-3299. doi: 10.1109/TIP.2018.2813093.
4
Adaptive image warping for hole prevention in 3D view synthesis.自适应图像变形防止 3D 视图合成中的空洞。
IEEE Trans Image Process. 2013 Sep;22(9):3420-32. doi: 10.1109/TIP.2013.2268940. Epub 2013 Jun 14.
5
Toward naturalistic 2D-to-3D conversion.朝向自然的 2D 到 3D 转换。
IEEE Trans Image Process. 2015 Feb;24(2):724-33. doi: 10.1109/TIP.2014.2385474. Epub 2014 Dec 23.
6
High-Quality Depth Estimation Using an Exemplar 3D Model for Stereo Conversion.使用示例3D模型进行立体转换的高质量深度估计
IEEE Trans Vis Comput Graph. 2015 Jul;21(7):835-47. doi: 10.1109/TVCG.2015.2398440.
7
Head pose estimation from a 2D face image using 3D face morphing with depth parameters.基于深度参数的 3D 人脸变形的 2D 人脸图像的头部姿势估计。
IEEE Trans Image Process. 2015 Jun;24(6):1801-8. doi: 10.1109/TIP.2015.2405483. Epub 2015 Feb 19.
8
3D PET image reconstruction including both motion correction and registration directly into an MR or stereotaxic spatial atlas.3D PET 图像重建,包括运动校正和直接到 MR 或立体定向空间图谱的配准。
Phys Med Biol. 2013 Jan 7;58(1):105-26. doi: 10.1088/0031-9155/58/1/105. Epub 2012 Dec 6.
9
First-order and second-order statistical analysis of 3D and 2D image structure.3D和2D图像结构的一阶和二阶统计分析。
Network. 2007 Jun;18(2):129-60. doi: 10.1080/09548980701580444.
10
An efficient depth map preprocessing method based on structure-aided domain transform smoothing for 3D view generation.一种基于结构辅助域变换平滑的高效深度图预处理方法,用于三维视图生成。
PLoS One. 2017 Apr 13;12(4):e0175910. doi: 10.1371/journal.pone.0175910. eCollection 2017.

引用本文的文献

1
Deep Ordinal Regression Network for Monocular Depth Estimation.用于单目深度估计的深度序数回归网络
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2018 Jun;2018:2002-2011. doi: 10.1109/CVPR.2018.00214. Epub 2018 Dec 17.
2
Degraded image enhancement by image dehazing and Directional Filter Banks using Depth Image based Rendering for future free-view 3D-TV.基于深度图像绘制的图像去雾和方向滤波器组对退化图像的增强,用于未来的自由视点 3D-TV。
PLoS One. 2019 May 23;14(5):e0217246. doi: 10.1371/journal.pone.0217246. eCollection 2019.