针对可见和不可见类别的单视图3D网格重建

Single-View 3D Mesh Reconstruction for Seen and Unseen Categories.

作者信息

Yang Xianghui, Lin Guosheng, Zhou Luping

出版信息

IEEE Trans Image Process. 2023;32:3746-3758. doi: 10.1109/TIP.2023.3279661. Epub 2023 Jul 7.

DOI:10.1109/TIP.2023.3279661

Abstract

Single-view 3D object reconstruction is a fundamental and challenging computer vision task that aims at recovering 3D shapes from single-view RGB images. Most existing deep learning based reconstruction methods are trained and evaluated on the same categories, and they cannot work well when handling objects from novel categories that are not seen during training. Focusing on this issue, this paper tackles Single-view 3D Mesh Reconstruction, to study the model generalization on unseen categories and encourage models to reconstruct objects literally. Specifically, we propose an end-to-end two-stage network, GenMesh, to break the category boundaries in reconstruction. Firstly, we factorize the complicated image-to-mesh mapping into two simpler mappings, i.e., image-to-point mapping and point-to-mesh mapping, while the latter is mainly a geometric problem and less dependent on object categories. Secondly, we devise a local feature sampling strategy in 2D and 3D feature spaces to capture the local geometry shared across objects to enhance model generalization. Thirdly, apart from the traditional point-to-point supervision, we introduce a multi-view silhouette loss to supervise the surface generation process, which provides additional regularization and further relieves the overfitting problem. The experimental results show that our method significantly outperforms the existing works on the ShapeNet and Pix3D under different scenarios and various metrics, especially for novel objects.

摘要

单视图3D物体重建是一项基础且具有挑战性的计算机视觉任务，旨在从单视图RGB图像中恢复3D形状。大多数现有的基于深度学习的重建方法都是在相同类别上进行训练和评估的，当处理训练期间未见过的新类别物体时，它们无法很好地工作。针对这个问题，本文着手研究单视图3D网格重建，以研究模型在未见类别上的泛化能力，并鼓励模型真实地重建物体。具体来说，我们提出了一种端到端的两阶段网络GenMesh，以打破重建中的类别界限。首先，我们将复杂的图像到网格映射分解为两个更简单的映射，即图像到点映射和点到网格映射，而后者主要是一个几何问题，对物体类别依赖性较小。其次，我们在2D和3D特征空间中设计了一种局部特征采样策略，以捕获跨物体共享的局部几何信息，从而增强模型的泛化能力。第三，除了传统的点对点监督外，我们还引入了多视图轮廓损失来监督表面生成过程，这提供了额外的正则化，并进一步缓解了过拟合问题。实验结果表明，在不同场景和各种指标下，我们的方法在ShapeNet和Pix3D上显著优于现有方法，特别是对于新物体。

相似文献

Single-View 3D Mesh Reconstruction for Seen and Unseen Categories.针对可见和不可见类别的单视图3D网格重建

IEEE Trans Image Process. 2023;32:3746-3758. doi: 10.1109/TIP.2023.3279661. Epub 2023 Jul 7.

Sym3DNet: Symmetric 3D Prior Network for Single-View 3D Reconstruction.Sym3DNet：用于单视图三维重建的对称三维先验网络。

Sensors (Basel). 2022 Jan 11;22(2):518. doi: 10.3390/s22020518.

A Single Stage and Single View 3D Point Cloud Reconstruction Network Based on DetNet.基于DetNet的单阶段单视角3D点云重建网络

Sensors (Basel). 2022 Oct 27;22(21):8235. doi: 10.3390/s22218235.

3D Reconstruction From a Single Sketch via View-Dependent Depth Sampling.通过视图相关深度采样从单张草图进行 3D 重建

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):9661-9676. doi: 10.1109/TPAMI.2024.3424404. Epub 2024 Nov 6.

CMFAN: Cross-Modal Feature Alignment Network for Few-Shot Single-View 3D Reconstruction.CMFAN：用于少样本单视图3D重建的跨模态特征对齐网络。

IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):5522-5534. doi: 10.1109/TNNLS.2024.3383039. Epub 2025 Feb 28.

3D objects reconstruction from frontal images: an example with guitars.从正面图像重建3D物体：以吉他为例。

Vis Comput. 2023;39(11):5421-5436. doi: 10.1007/s00371-022-02669-x. Epub 2022 Sep 15.

STD-Net: Structure-Preserving and Topology-Adaptive Deformation Network for Single-View 3D Reconstruction.STD-Net：用于单视图3D重建的结构保持和拓扑自适应变形网络

IEEE Trans Vis Comput Graph. 2023 Mar;29(3):1785-1798. doi: 10.1109/TVCG.2021.3131712. Epub 2023 Jan 30.

Hypergraph-Based Multi-Modal Representation for Open-Set 3D Object Retrieval.基于超图的开放集3D物体检索多模态表示

IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2206-2223. doi: 10.1109/TPAMI.2023.3332768. Epub 2024 Mar 6.

Dual-View 3D Reconstruction via Learning Correspondence and Dependency of Point Cloud Regions.通过学习点云区域的对应关系和依赖性进行双视图3D重建

IEEE Trans Image Process. 2022;31:6831-6846. doi: 10.1109/TIP.2022.3215024. Epub 2022 Nov 3.

Learning to Detect 3D Symmetry From Single-View RGB-D Images With Weak Supervision.在弱监督下从单视图RGB-D图像中学习检测3D对称性

IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4882-4896. doi: 10.1109/TPAMI.2022.3186876. Epub 2023 Mar 7.

引用本文的文献

Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper.用于3D重建、增强和配准的深度学习：一篇综述论文。

Entropy (Basel). 2024 Mar 7;26(3):235. doi: 10.3390/e26030235.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

针对可见和不可见类别的单视图3D网格重建

Single-View 3D Mesh Reconstruction for Seen and Unseen Categories.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献