• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面向 3D 点云目标检测和标注的弱监督框架研究

Towards a Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Aug;44(8):4454-4468. doi: 10.1109/TPAMI.2021.3063611. Epub 2022 Jul 1.

DOI:10.1109/TPAMI.2021.3063611
PMID:33656990
Abstract

It is quite laborious and costly to manually label LiDAR point cloud data for training high-quality 3D object detectors. This work proposes a weakly supervised framework which allows learning 3D detection from a few weakly annotated examples. This is achieved by a two-stage architecture design. Stage-1 learns to generate cylindrical object proposals under inaccurate and inexact supervision, obtained by our proposed BEV center-click annotation strategy, where only the horizontal object centers are click-annotated in bird's view scenes. Stage-2 learns to predict cuboids and confidence scores in a coarse-to-fine, cascade manner, under incomplete supervision, i.e., only a small portion of object cuboids are precisely annotated. With KITTI dataset, using only 500 weakly annotated scenes and 534 precisely labeled vehicle instances, our method achieves 86-97 percent the performance of current top-leading, fully supervised detectors (which require 3,712 exhaustively annotated scenes with 15,654 instances). More importantly, with our elaborately designed network architecture, our trained model can be applied as a 3D object annotator, supporting both automatic and active (human-in-the-loop) working modes. The annotations generated by our model can be used to train 3D object detectors, achieving over 95 percent of their original performance (with manually labeled training data). Our experiments also show our model's potential in boosting performance when given more training data. The above designs make our approach highly practical and open-up opportunities for learning 3D detection at reduced annotation cost.

摘要

手动标注激光雷达点云数据来训练高质量的 3D 目标检测模型非常繁琐且成本高昂。本研究提出了一种弱监督框架,可通过少量弱标注示例学习 3D 检测。该框架采用两阶段架构设计。第一阶段在不准确和不精确的监督下学习生成圆柱目标建议,这种监督是通过我们提出的 BEV 中心点击标注策略获得的,其中仅在鸟瞰场景中点击标注水平方向的目标中心。第二阶段在不完全监督下,以粗到精、级联的方式学习预测长方体和置信度得分,即仅对一小部分目标长方体进行精确标注。在 KITTI 数据集上,使用仅 500 个弱标注场景和 534 个精确标注的车辆实例,我们的方法实现了与当前顶级、全监督检测器(需要 3712 个经过详尽标注的场景和 15654 个实例)相当的 86-97%的性能。更重要的是,通过精心设计的网络架构,我们的训练模型可作为 3D 目标标注器,支持自动和主动(人机交互)两种工作模式。我们的模型生成的标注可用于训练 3D 目标检测器,实现超过 95%的原始性能(使用手动标注的训练数据)。我们的实验还表明,在提供更多训练数据时,我们的模型具有提高性能的潜力。上述设计使我们的方法具有高度实用性,并为降低标注成本学习 3D 检测开辟了机会。

相似文献

1
Towards a Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation.面向 3D 点云目标检测和标注的弱监督框架研究
IEEE Trans Pattern Anal Mach Intell. 2022 Aug;44(8):4454-4468. doi: 10.1109/TPAMI.2021.3063611. Epub 2022 Jul 1.
2
Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud.基于迁移学习的点云三维目标检测语义分割。
Sensors (Basel). 2021 Jun 8;21(12):3964. doi: 10.3390/s21123964.
3
Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.回归现实:借助形状引导学习数据高效的3D目标检测器。
IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1165-1180. doi: 10.1109/TPAMI.2023.3328880. Epub 2024 Jan 8.
4
Semi-supervised training using cooperative labeling of weakly annotated data for nodule detection in chest CT.基于弱标注数据的协同标注的半监督训练在胸部 CT 结节检测中的应用。
Med Phys. 2023 Jul;50(7):4255-4268. doi: 10.1002/mp.16219. Epub 2023 Jan 27.
5
Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation.无人工标注的深度学习显著图网络合成监督学习
IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1755-1769. doi: 10.1109/TPAMI.2019.2900649. Epub 2019 Feb 20.
6
From Points to Parts: 3D Object Detection From Point Cloud With Part-Aware and Part-Aggregation Network.从点到部件:基于部件感知与部件聚合网络的点云三维目标检测
IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2647-2664. doi: 10.1109/TPAMI.2020.2977026. Epub 2021 Jul 1.
7
Weakly Supervised Monocular 3D Object Detection by Spatial-Temporal View Consistency.基于时空视图一致性的弱监督单目3D目标检测
IEEE Trans Pattern Anal Mach Intell. 2025 Jan;47(1):84-98. doi: 10.1109/TPAMI.2024.3466915. Epub 2024 Dec 4.
8
Weakly Supervised RGB-D Salient Object Detection With Prediction Consistency Training and Active Scribble Boosting.基于预测一致性训练和主动标注增强的弱监督 RGB-D 显著目标检测
IEEE Trans Image Process. 2022;31:2148-2161. doi: 10.1109/TIP.2022.3151999. Epub 2022 Mar 8.
9
Weakly-Supervised Learning of Category-Specific 3D Object Shapes.特定类别3D物体形状的弱监督学习
IEEE Trans Pattern Anal Mach Intell. 2021 Apr;43(4):1423-1437. doi: 10.1109/TPAMI.2019.2949562. Epub 2021 Mar 9.
10
EPGNet: Enhanced Point Cloud Generation for 3D Object Detection.EPGNet:用于 3D 目标检测的增强点云生成。
Sensors (Basel). 2020 Dec 4;20(23):6927. doi: 10.3390/s20236927.

引用本文的文献

1
HSPC-Net: A hierarchical shape-preserving completion network for machine part point cloud completion.HSPC-Net:一种用于机器零件点云补全的分层保形补全网络。
PLoS One. 2025 Aug 11;20(8):e0330033. doi: 10.1371/journal.pone.0330033. eCollection 2025.
2
SPS-RCNN: Semantic-Guided Proposal Sampling for 3D Object Detection from LiDAR Point Clouds.SPS-RCNN:用于从激光雷达点云进行3D目标检测的语义引导提议采样
Sensors (Basel). 2025 Feb 11;25(4):1064. doi: 10.3390/s25041064.
3
Semantic Segmentation of Terrestrial Laser Scans of Railway Catenary Arches: A Use Case Perspective.
铁路接触网拱形结构地面激光扫描的语义分割:一个用例视角。
Sensors (Basel). 2022 Dec 26;23(1):222. doi: 10.3390/s23010222.
4
LiDAR Intensity Completion: Fully Exploiting the Message from LiDAR Sensors.激光雷达强度补全:充分利用激光雷达传感器的信息
Sensors (Basel). 2022 Oct 4;22(19):7533. doi: 10.3390/s22197533.