Multimedia Research Centre, Department of Computing Science, University of Alberta, Edmonton, AB T6G 2R3, Canada.
Sensors (Basel). 2020 Sep 7;20(18):5098. doi: 10.3390/s20185098.
The task of recognising an object and estimating its 6d pose in a scene has received considerable attention in recent years. The accessibility and low-cost of consumer RGB-D cameras, make object recognition and pose estimation feasible even for small industrial businesses. An example is the industrial assembly line, where a robotic arm should pick a small, textureless and mostly homogeneous object and place it in a designated location. Despite all the recent advancements of object recognition and pose estimation techniques in natural scenes, the problem remains challenging for industrial parts. In this paper, we present a framework to simultaneously recognise the object's class and estimate its 6d pose from RGB-D data. The proposed model adapts a global approach, where an object and the Region of Interest (ROI) are first recognised from RGB images. The object's pose is then estimated from the corresponding depth information. We train various classifiers based on extracted Histogram of Oriented Gradient (HOG) features to detect and recognize the objects. We then perform template matching on the point cloud based on surface normal and Fast Point Feature Histograms (FPFH) to estimate the pose of the object. Experimental results show that our system is quite efficient, accurate and robust to illumination and background changes, even for the challenging objects of Tless dataset.
近年来,识别场景中的物体并估计其 6d 姿态的任务受到了广泛关注。由于消费级 RGB-D 相机具有易用性和低成本的特点,即使是小型工业企业也能够实现物体识别和姿态估计。例如在工业装配线上,机械臂应该能够拾取一个小、无纹理且大多为同质的物体,并将其放置在指定的位置。尽管近年来在自然场景中的物体识别和姿态估计技术取得了诸多进展,但对于工业部件来说,这个问题仍然具有挑战性。在本文中,我们提出了一个从 RGB-D 数据中同时识别物体类别并估计其 6d 姿态的框架。所提出的模型采用了一种全局方法,首先从 RGB 图像中识别物体和感兴趣区域(ROI)。然后,根据相应的深度信息估计物体的姿态。我们基于提取的方向梯度直方图(HOG)特征训练各种分类器来检测和识别物体。然后,我们根据表面法向量和快速点特征直方图(FPFH)对点云中的模板进行匹配,以估计物体的姿态。实验结果表明,我们的系统对于光照和背景变化具有较高的效率、准确性和鲁棒性,即使是 Tless 数据集的挑战性物体也是如此。