• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RNNPose:通过循环对应场估计和姿态优化实现的6自由度物体姿态估计

RNNPose: 6-DoF Object Pose Estimation via Recurrent Correspondence Field Estimation and Pose Optimization.

作者信息

Xu Yan, Lin Kwan-Yee, Zhang Guofeng, Wang Xiaogang, Li Hongsheng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Jul;46(7):4669-4683. doi: 10.1109/TPAMI.2024.3360181. Epub 2024 Jun 5.

DOI:10.1109/TPAMI.2024.3360181
PMID:38289852
Abstract

6-DoF object pose estimation from a monocular image is a challenging problem, where a post-refinement procedure is generally needed for high-precision estimation. In this paper, we propose a framework, dubbed RNNPose, based on a recurrent neural network (RNN) for object pose refinement, which is robust to erroneous initial poses and occlusions. During the recurrent iterations, object pose refinement is formulated as a non-linear least squares problem based on the estimated correspondence field (between a rendered image and the observed image). The problem is then solved by a differentiable Levenberg-Marquardt (LM) algorithm enabling end-to-end training. The correspondence field estimation and pose refinement are conducted alternately in each iteration to improve the object poses. Furthermore, to improve the robustness against occlusion, we introduce a consistency-check mechanism based on the learned descriptors of the 3D model and observed 2D images, which downweights the unreliable correspondences during pose optimization. We evaluate RNNPose on several public datasets, including LINEMOD, Occlusion-LINEMOD, YCB-Video and TLESS. We demonstrate state-of-the-art performance and strong robustness against severe clutter and occlusion in the scenes. Extensive experiments validate the effectiveness of our proposed method. Besides, the extended system based on RNNPose successfully generalizes to multi-instance scenarios and achieves top-tier performance on the TLESS dataset.

摘要

从单目图像进行六自由度物体位姿估计是一个具有挑战性的问题,通常需要一个后处理步骤来进行高精度估计。在本文中,我们提出了一个名为RNNPose的框架,它基于循环神经网络(RNN)用于物体位姿优化,对错误的初始位姿和遮挡具有鲁棒性。在循环迭代过程中,物体位姿优化被表述为基于估计的对应场(在渲染图像和观察图像之间)的非线性最小二乘问题。然后通过可微的列文伯格-马夸尔特(LM)算法解决该问题,实现端到端训练。在每次迭代中,对应场估计和位姿优化交替进行以改善物体位姿。此外,为了提高对遮挡的鲁棒性,我们引入了一种基于3D模型和观察到的2D图像的学习描述符的一致性检查机制,该机制在位姿优化期间降低不可靠对应关系的权重。我们在几个公共数据集上评估了RNNPose,包括LINEMOD、Occlusion-LINEMOD、YCB-Video和TLESS。我们展示了在场景中针对严重 clutter 和遮挡的最优性能和强大鲁棒性。广泛的实验验证了我们提出方法的有效性。此外,基于RNNPose的扩展系统成功推广到多实例场景,并在TLESS数据集上取得了顶级性能。

相似文献

1
RNNPose: 6-DoF Object Pose Estimation via Recurrent Correspondence Field Estimation and Pose Optimization.RNNPose:通过循环对应场估计和姿态优化实现的6自由度物体姿态估计
IEEE Trans Pattern Anal Mach Intell. 2024 Jul;46(7):4669-4683. doi: 10.1109/TPAMI.2024.3360181. Epub 2024 Jun 5.
2
DPODv2: Dense Correspondence-Based 6 DoF Pose Estimation.DPODv2:基于密集对应关系的 6 自由度位姿估计。
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7417-7435. doi: 10.1109/TPAMI.2021.3118833. Epub 2022 Oct 4.
3
PVNet: Pixel-Wise Voting Network for 6DoF Object Pose Estimation.PVNet:用于 6DoF 对象位姿估计的像素级投票网络。
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3212-3223. doi: 10.1109/TPAMI.2020.3047388. Epub 2022 May 5.
4
Iterative Pose Refinement for Object Pose Estimation Based on RGBD Data.基于 RGBD 数据的目标位姿估计的迭代姿态精化。
Sensors (Basel). 2020 Jul 24;20(15):4114. doi: 10.3390/s20154114.
5
Robust 6-DoF Pose Estimation under Hybrid Constraints.混合约束下的稳健 6-DoF 位姿估计。
Sensors (Basel). 2022 Nov 12;22(22):8758. doi: 10.3390/s22228758.
6
EPro-PnP: Generalized End-to-End Probabilistic Perspective-N-Points for Monocular Object Pose Estimation.EPro-PnP:用于单目物体位姿估计的广义端到端概率视角N点法
IEEE Trans Pattern Anal Mach Intell. 2024 Jan 16;PP. doi: 10.1109/TPAMI.2024.3354997.
7
Multi-level feature fusion and joint refinement for simultaneous object pose estimation and camera localization.用于同时进行目标位姿估计和相机定位的多层次特征融合和联合细化。
Neural Netw. 2024 Jun;174:106238. doi: 10.1016/j.neunet.2024.106238. Epub 2024 Mar 16.
8
DeepHMap++: Combined Projection Grouping and Correspondence Learning for Full DoF Pose Estimation.DeepHMap++:用于全自由度姿态估计的组合投影分组与对应学习
Sensors (Basel). 2019 Feb 28;19(5):1032. doi: 10.3390/s19051032.
9
Line-Based 6-DoF Object Pose Estimation and Tracking With an Event Camera.基于线的六自由度物体姿态估计与事件相机跟踪
IEEE Trans Image Process. 2024;33:4765-4780. doi: 10.1109/TIP.2024.3445736. Epub 2024 Aug 30.
10
6-D Object Pose Estimation Based on Point Pair Matching for Robotic Grasp Detection.基于点对匹配的6D物体姿态估计用于机器人抓取检测
IEEE Trans Neural Netw Learn Syst. 2025 Jul;36(7):11902-11916. doi: 10.1109/TNNLS.2024.3442433.

引用本文的文献

1
ScaleFormer architecture for scale invariant human pose estimation with enhanced mixed features.用于具有增强混合特征的尺度不变人体姿态估计的ScaleFormer架构。
Sci Rep. 2025 Jul 30;15(1):27754. doi: 10.1038/s41598-025-12620-4.
2
Recurrent multi-view 6DoF pose estimation for marker-less surgical tool tracking.用于无标记手术工具跟踪的循环多视图6自由度姿态估计
Int J Comput Assist Radiol Surg. 2025 Aug;20(8):1589-1599. doi: 10.1007/s11548-025-03436-8. Epub 2025 Jun 17.
3
MagicCubePose, A more comprehensive 6D pose estimation network.
魔方姿态估计网络 MagicCubePose:一种更全面的 6D 姿态估计网络。
Sci Rep. 2023 Apr 28;13(1):6923. doi: 10.1038/s41598-023-32936-3.