• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于少样本和零样本3D点云语义分割的原型适配与投影

Prototype Adaption and Projection for Few- and Zero-Shot 3D Point Cloud Semantic Segmentation.

作者信息

He Shuting, Jiang Xudong, Jiang Wei, Ding Henghui

出版信息

IEEE Trans Image Process. 2023;32:3199-3211. doi: 10.1109/TIP.2023.3279660. Epub 2023 Jun 7.

DOI:10.1109/TIP.2023.3279660
PMID:37252865
Abstract

In this work, we address the challenging task of few-shot and zero-shot 3D point cloud semantic segmentation. The success of few-shot semantic segmentation in 2D computer vision is mainly driven by the pre-training on large-scale datasets like imagenet. The feature extractor pre-trained on large-scale 2D datasets greatly helps the 2D few-shot learning. However, the development of 3D deep learning is hindered by the limited volume and instance modality of datasets due to the significant cost of 3D data collection and annotation. This results in less representative features and large intra-class feature variation for few-shot 3D point cloud segmentation. As a consequence, directly extending existing popular prototypical methods of 2D few-shot classification/segmentation into 3D point cloud segmentation won't work as well as in 2D domain. To address this issue, we propose a Query-Guided Prototype Adaption (QGPA) module to adapt the prototype from support point clouds feature space to query point clouds feature space. With such prototype adaption, we greatly alleviate the issue of large feature intra-class variation in point cloud and significantly improve the performance of few-shot 3D segmentation. Besides, to enhance the representation of prototypes, we introduce a Self-Reconstruction (SR) module that enables prototype to reconstruct the support mask as well as possible. Moreover, we further consider zero-shot 3D point cloud semantic segmentation where there is no support sample. To this end, we introduce category words as semantic information and propose a semantic-visual projection model to bridge the semantic and visual spaces. Our proposed method surpasses state-of-the-art algorithms by a considerable 7.90% and 14.82% under the 2-way 1-shot setting on S3DIS and ScanNet benchmarks, respectively.

摘要

在这项工作中,我们解决了少样本和零样本3D点云语义分割这一具有挑战性的任务。二维计算机视觉中少样本语义分割的成功主要得益于在大规模数据集(如图像网)上的预训练。在大规模二维数据集上预训练的特征提取器极大地有助于二维少样本学习。然而,由于三维数据采集和标注成本高昂,数据集的体积和实例模态有限,阻碍了三维深度学习的发展。这导致少样本三维点云分割的特征代表性较差且类内特征变化较大。因此,直接将现有的流行二维少样本分类/分割原型方法扩展到三维点云分割中,效果不如在二维领域。为了解决这个问题,我们提出了一个查询引导的原型适应(QGPA)模块,将原型从支持点云特征空间适应到查询点云特征空间。通过这种原型适应,我们大大缓解了点云类内特征变化大的问题,并显著提高了少样本三维分割的性能。此外,为了增强原型的表示能力,我们引入了一个自重建(SR)模块,使原型能够尽可能好地重建支持掩码。此外,我们还进一步考虑了没有支持样本的零样本三维点云语义分割。为此,我们引入类别词作为语义信息,并提出了一种语义-视觉投影模型来弥合语义和视觉空间。在S3DIS和ScanNet基准测试的双向单样本设置下,我们提出的方法分别比现有最先进的算法高出7.90%和14.82%。

相似文献

1
Prototype Adaption and Projection for Few- and Zero-Shot 3D Point Cloud Semantic Segmentation.用于少样本和零样本3D点云语义分割的原型适配与投影
IEEE Trans Image Process. 2023;32:3199-3211. doi: 10.1109/TIP.2023.3279660. Epub 2023 Jun 7.
2
Point Cloud Semantic Segmentation Network Based on Multi-Scale Feature Fusion.基于多尺度特征融合的点云语义分割网络
Sensors (Basel). 2021 Feb 26;21(5):1625. doi: 10.3390/s21051625.
3
LESA-Net: Semantic segmentation of multi-type road point clouds in complex agroforestry environment.LESA-Net:复杂农林环境中多类型道路点云的语义分割
Heliyon. 2024 Aug 28;10(17):e36814. doi: 10.1016/j.heliyon.2024.e36814. eCollection 2024 Sep 15.
4
An Efficient Ensemble Deep Learning Approach for Semantic Point Cloud Segmentation Based on 3D Geometric Features and Range Images.一种基于3D几何特征和距离图像的高效集成深度学习语义点云分割方法。
Sensors (Basel). 2022 Aug 18;22(16):6210. doi: 10.3390/s22166210.
5
CLIP-Driven Prototype Network for Few-Shot Semantic Segmentation.用于少样本语义分割的基于CLIP的原型网络
Entropy (Basel). 2023 Sep 18;25(9):1353. doi: 10.3390/e25091353.
6
FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning.FWNet:使用深度学习对全波形激光雷达数据进行语义分割
Sensors (Basel). 2020 Jun 24;20(12):3568. doi: 10.3390/s20123568.
7
PointRas: Uncertainty-Aware Multi-Resolution Learning for Point Cloud Segmentation.PointRas:用于点云分割的不确定性感知多分辨率学习
IEEE Trans Image Process. 2022;31:6002-6016. doi: 10.1109/TIP.2022.3205208. Epub 2022 Sep 19.
8
Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud.基于迁移学习的点云三维目标检测语义分割。
Sensors (Basel). 2021 Jun 8;21(12):3964. doi: 10.3390/s21123964.
9
Semantic Labeling and Instance Segmentation of 3D Point Clouds Using Patch Context Analysis and Multiscale Processing.基于面片上下文分析和多尺度处理的三维点云语义标注与实例分割
IEEE Trans Vis Comput Graph. 2020 Jul;26(7):2485-2498. doi: 10.1109/TVCG.2018.2889944. Epub 2018 Dec 27.
10
Learning Semantic Segmentation of Large-Scale Point Clouds With Random Sampling.通过随机采样学习大规模点云的语义分割
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):8338-8354. doi: 10.1109/TPAMI.2021.3083288. Epub 2022 Oct 4.

引用本文的文献

1
Few-Shot Segmentation of 3D Point Clouds Under Real-World Distributional Shifts in Railroad Infrastructure.铁路基础设施中实际分布变化下的三维点云少样本分割
Sensors (Basel). 2025 Feb 11;25(4):1072. doi: 10.3390/s25041072.