基于渲染二维图像的主动学习的高效三维场景语义分割。

Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images.

出版信息

IEEE Trans Image Process. 2023;32:3521-3535. doi: 10.1109/TIP.2023.3286708. Epub 2023 Jun 29.

DOI:10.1109/TIP.2023.3286708

Abstract

Inspired by Active Learning and 2D-3D semantic fusion, we proposed a novel framework for 3D scene semantic segmentation based on rendered 2D images, which could efficiently achieve semantic segmentation of any large-scale 3D scene with only a few 2D image annotations. In our framework, we first render perspective images at certain positions in the 3D scene. Then we continuously fine-tune a pre-trained network for image semantic segmentation and project all dense predictions to the 3D model for fusion. In each iteration, we evaluate the 3D semantic model and re-render images in several representative areas where the 3D segmentation is not stable and send them to the network for training after annotation. Through this iterative process of rendering-segmentation-fusion, it can effectively generate difficult-to-segment image samples in the scene, while avoiding complex 3D annotations, so as to achieve label-efficient 3D scene segmentation. Experiments on three large-scale indoor and outdoor 3D datasets demonstrate the effectiveness of the proposed method compared with other state-of-the-art.

摘要

受主动学习和 2D-3D 语义融合的启发，我们提出了一种新颖的基于渲染 2D 图像的 3D 场景语义分割框架，仅用少量 2D 图像标注即可高效实现任意大规模 3D 场景的语义分割。在我们的框架中，我们首先在 3D 场景中的某些位置渲染透视图像。然后，我们不断微调用于图像语义分割的预训练网络，并将所有密集预测投影到 3D 模型进行融合。在每次迭代中，我们评估 3D 语义模型并在几个代表性区域重新渲染 3D 分割不稳定的图像，并在标注后将其发送到网络进行训练。通过渲染-分割-融合的迭代过程，可以有效地生成场景中难以分割的图像样本，同时避免复杂的 3D 标注，从而实现高效的 3D 场景分割。在三个大规模室内和室外 3D 数据集上的实验表明，与其他最先进的方法相比，所提出的方法具有有效性。

相似文献

Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images.

IEEE Trans Image Process. 2023;32:3521-3535. doi: 10.1109/TIP.2023.3286708. Epub 2023 Jun 29.

SSR-2D: Semantic 3D Scene Reconstruction From 2D Images.

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8486-8501. doi: 10.1109/TPAMI.2024.3410032. Epub 2024 Nov 6.

Learning Virtual View Selection for 3D Scene Semantic Segmentation.

IEEE Trans Image Process. 2024;33:4159-4172. doi: 10.1109/TIP.2024.3421952. Epub 2024 Jul 16.

Deep learning of the sectional appearances of 3D CT images for anatomical structure segmentation based on an FCN voting method.

Med Phys. 2017 Oct;44(10):5221-5233. doi: 10.1002/mp.12480. Epub 2017 Aug 31.

SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections.

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15562-15576. doi: 10.1109/TPAMI.2023.3321857. Epub 2023 Nov 3.

Multi-scale contextual semantic enhancement network for 3D medical image segmentation.

Phys Med Biol. 2022 Nov 16;67(22). doi: 10.1088/1361-6560/ac9e41.

An Efficient and Accurate 3D Multiple-Contextual Semantic Segmentation Network for Medical Volumetric Images.

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:3309-3312. doi: 10.1109/EMBC46164.2021.9629671.

Multi-eXpert fusion: An ensemble learning framework to segment 3D TRUS prostate images.

Med Phys. 2022 Aug;49(8):5138-5148. doi: 10.1002/mp.15679. Epub 2022 Apr 29.

Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud.

Sensors (Basel). 2021 Jun 8;21(12):3964. doi: 10.3390/s21123964.

CMC-Net: 3D calf muscle compartment segmentation with sparse annotation.

Med Image Anal. 2022 Jul;79:102460. doi: 10.1016/j.media.2022.102460. Epub 2022 Apr 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于渲染二维图像的主动学习的高效三维场景语义分割。

Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images.

出版信息

IEEE Trans Image Process. 2023;32:3521-3535. doi: 10.1109/TIP.2023.3286708. Epub 2023 Jun 29.

DOI:10.1109/TIP.2023.3286708

PMID:37339022

Abstract

摘要

基于渲染二维图像的主动学习的高效三维场景语义分割。

Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images.

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于渲染二维图像的主动学习的高效三维场景语义分割。

Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images.

出版信息

相似文献