• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多视图融合的机器人室内场景感知三维目标检测。

Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception.

机构信息

State Key Laboratory of Robotics and System, Harbin Institute of Technology, Harbin 150001, China.

School of Physical Sciences, University of Science and Technology of China, Hefei 230026, China.

出版信息

Sensors (Basel). 2019 Sep 21;19(19):4092. doi: 10.3390/s19194092.

DOI:10.3390/s19194092
PMID:31546674
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6806321/
Abstract

To autonomously move and operate objects in cluttered indoor environments, a service robot requires the ability of 3D scene perception. Though 3D object detection can provide an object-level environmental description to fill this gap, a robot always encounters incomplete object observation, recurring detections of the same object, error in detection, or intersection between objects when conducting detection continuously in a cluttered room. To solve these problems, we propose a two-stage 3D object detection algorithm which is to fuse multiple views of 3D object point clouds in the first stage and to eliminate unreasonable and intersection detections in the second stage. For each view, the robot performs a 2D object semantic segmentation and obtains 3D object point clouds. Then, an unsupervised segmentation method called Locally Convex Connected Patches (LCCP) is utilized to segment the object accurately from the background. Subsequently, the Manhattan Frame estimation is implemented to calculate the main orientation of the object and subsequently, the 3D object bounding box can be obtained. To deal with the detected objects in multiple views, we construct an object database and propose an object fusion criterion to maintain it automatically. Thus, the same object observed in multi-view is fused together and a more accurate bounding box can be calculated. Finally, we propose an object filtering approach based on prior knowledge to remove incorrect and intersecting objects in the object dataset. Experiments are carried out on both SceneNN dataset and a real indoor environment to verify the stability and accuracy of 3D semantic segmentation and bounding box detection of the object with multi-view fusion.

摘要

为了在杂乱的室内环境中自主移动和操作物体,服务机器人需要具备 3D 场景感知能力。虽然 3D 目标检测可以提供对象级别的环境描述来填补这一空白,但机器人在杂乱的房间中连续进行检测时,总会遇到对象观察不完整、同一对象重复检测、检测错误或对象之间的交叉等问题。为了解决这些问题,我们提出了一种两阶段 3D 目标检测算法,该算法在第一阶段融合多个 3D 目标点云视图,并在第二阶段消除不合理和交叉检测。对于每个视图,机器人执行 2D 目标语义分割并获得 3D 目标点云。然后,使用一种名为局部凸连接补丁(LCCP)的无监督分割方法从背景中准确地分割出对象。随后,实施曼哈顿框架估计以计算对象的主要方向,随后可以获得 3D 对象边界框。为了处理多个视图中的检测对象,我们构建了一个对象数据库并提出了一个对象融合标准来自动维护它。因此,多视图中观察到的相同对象被融合在一起,可以计算出更准确的边界框。最后,我们提出了一种基于先验知识的对象过滤方法,以去除对象数据集中的错误和交叉对象。在 SceneNN 数据集和真实室内环境中进行了实验,以验证多视图融合的 3D 语义分割和边界框检测的稳定性和准确性。

相似文献

1
Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception.基于多视图融合的机器人室内场景感知三维目标检测。
Sensors (Basel). 2019 Sep 21;19(19):4092. doi: 10.3390/s19194092.
2
Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception.基于多通道卷积神经网络的室内机器人环境感知 3D 目标检测
Sensors (Basel). 2019 Feb 21;19(4):893. doi: 10.3390/s19040893.
3
Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds.点云中的 3D 目标检测的精细化投票和场景特征融合。
Comput Intell Neurosci. 2022 Dec 29;2022:3023934. doi: 10.1155/2022/3023934. eCollection 2022.
4
Data-Driven Indoor Scene Modeling from a Single Color Image with Iterative Object Segmentation and Model Retrieval.基于迭代目标分割和模型检索的单幅彩色图像数据驱动室内场景建模
IEEE Trans Vis Comput Graph. 2020 Apr;26(4):1702-1715. doi: 10.1109/TVCG.2018.2880737. Epub 2018 Nov 12.
5
Semantic Labeling and Instance Segmentation of 3D Point Clouds Using Patch Context Analysis and Multiscale Processing.基于面片上下文分析和多尺度处理的三维点云语义标注与实例分割
IEEE Trans Vis Comput Graph. 2020 Jul;26(7):2485-2498. doi: 10.1109/TVCG.2018.2889944. Epub 2018 Dec 27.
6
Stabilization and Validation of 3D Object Position Using Multimodal Sensor Fusion and Semantic Segmentation.使用多模态传感器融合和语义分割技术稳定和验证三维物体位置。
Sensors (Basel). 2020 Feb 18;20(4):1110. doi: 10.3390/s20041110.
7
Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud.基于迁移学习的点云三维目标检测语义分割。
Sensors (Basel). 2021 Jun 8;21(12):3964. doi: 10.3390/s21123964.
8
Efficient point cloud segmentation approach using energy optimization with geometric features for 3D scene understanding.一种基于能量优化并结合几何特征的高效点云分割方法,用于3D场景理解。
J Opt Soc Am A Opt Image Sci Vis. 2021 Jan 1;38(1):60-70. doi: 10.1364/JOSAA.410458.
9
Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility.基于深度学习的道路和铁路智能交通实时3D多目标检测与定位
J Imaging. 2021 Aug 12;7(8):145. doi: 10.3390/jimaging7080145.
10
Clouds of Oriented Gradients for 3D Detection of Objects, Surfaces, and Indoor Scene Layouts.用于三维物体、表面及室内场景布局检测的定向梯度云
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2670-2683. doi: 10.1109/TPAMI.2019.2923201. Epub 2019 Jun 17.

引用本文的文献

1
, Enhancing Long-Term Consistency of Object-Oriented Semantic Maps in Robotics.面向机器人的目标导向语义图的长期一致性增强。
Sensors (Basel). 2022 Jul 15;22(14):5308. doi: 10.3390/s22145308.
2
Autofocus Entropy Repositioning Method Bioinspired in the Magnetic Field Memory of the Bees Applied to Pollination.自动对焦熵重定位方法仿生蜜蜂磁场记忆在授粉中的应用。
Sensors (Basel). 2021 Sep 16;21(18):6198. doi: 10.3390/s21186198.
3
Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud.基于迁移学习的点云三维目标检测语义分割。

本文引用的文献

1
Mask R-CNN.Mask R-CNN。
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):386-397. doi: 10.1109/TPAMI.2018.2844175. Epub 2018 Jun 5.
2
A Robust 3D-2D Interactive Tool for Scene Segmentation and Annotation.一种用于场景分割与标注的强大3D-2D交互式工具。
IEEE Trans Vis Comput Graph. 2018 Dec;24(12):3005-3018. doi: 10.1109/TVCG.2017.2772238. Epub 2017 Nov 20.
3
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
Sensors (Basel). 2021 Jun 8;21(12):3964. doi: 10.3390/s21123964.
4
Visual Saliency Detection for Over-Temperature Regions in 3D Space via Dual-Source Images.基于双源图像的三维空间过温区域视觉显著性检测
Sensors (Basel). 2020 Jun 17;20(12):3414. doi: 10.3390/s20123414.
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.