Suppr超能文献

从单张图像看新视角。

Novel Views of Objects from a Single Image.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2017 Aug;39(8):1576-1590. doi: 10.1109/TPAMI.2016.2601093. Epub 2016 Aug 17.

Abstract

Taking an image of an object is at its core a lossy process. The rich information about the three-dimensional structure of the world is flattened to an image plane and decisions such as viewpoint and camera parameters are final and not easily revertible. As a consequence, possibilities of changing viewpoint are limited. Given a single image depicting an object, novel-view synthesis is the task of generating new images that render the object from a different viewpoint than the one given. The main difficulty is to synthesize the parts that are disoccluded; disocclusion occurs when parts of an object are hidden by the object itself under a specific viewpoint. In this work, we show how to improve novel-view synthesis by making use of the correlations observed in 3D models and applying them to new image instances. We propose a technique to use the structural information extracted from a 3D model that matches the image object in terms of viewpoint and shape. For the latter part, we propose an efficient 2D-to-3D alignment method that associates precisely the image appearance with the 3D model geometry with minimal user interaction. Our technique is able to simulate plausible viewpoint changes for a variety of object classes within seconds. Additionally, we show that our synthesized images can be used as additional training data that improves the performance of standard object detectors.

摘要

拍摄物体的图像本质上是一个有损的过程。有关世界三维结构的丰富信息被展平到一个图像平面上,并且视点和相机参数等决策是最终的,不容易还原。因此,改变视点的可能性是有限的。给定一个描绘物体的单一图像,新视角合成的任务是生成新的图像,从与给定视角不同的视角渲染物体。主要的困难是合成被遮挡的部分;遮挡发生在物体自身在特定视角下遮挡物体的一部分时。在这项工作中,我们展示了如何通过利用在 3D 模型中观察到的相关性并将其应用于新的图像实例来改进新视角合成。我们提出了一种技术,利用从 3D 模型中提取的结构信息来匹配视点和形状与图像对象匹配的信息。对于后者,我们提出了一种有效的 2D 到 3D 对齐方法,该方法可以在最小的用户交互下精确地将图像外观与 3D 模型几何关联起来。我们的技术能够在几秒钟内模拟各种物体类别的合理视角变化。此外,我们还表明,我们合成的图像可以用作额外的训练数据,从而提高标准物体检测器的性能。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验