• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MSSPA-GC:基于 3D 图卷积的多尺度形状先验自适应的类别级物体位姿估计。

MSSPA-GC: Multi-Scale Shape Prior Adaptation with 3D Graph Convolutions for Category-Level Object Pose Estimation.

机构信息

University of Science and Technology of China, Hefei, 230027, Anhui, China.

University of Science and Technology of China, Hefei, 230027, Anhui, China; Anhui Province Key Laboratory of Software in Computing and Communication, Hefei, 230027, Anhui, China; USTC-Deqing Alpha Innovation Research Institute, Huzhou, 313299, Zhejiang, China.

出版信息

Neural Netw. 2023 Sep;166:609-621. doi: 10.1016/j.neunet.2023.07.037. Epub 2023 Jul 31.

DOI:10.1016/j.neunet.2023.07.037
PMID:37597505
Abstract

Category-level object pose estimation aims to predict the 6D object pose and size of arbitrary objects from known categories. It remains a challenge due to the large intra-class shape variation. Recently, the introduction of the shape prior adaptation mechanism into the normalized canonical coordinates (i.e., NOCS) reconstruction process has been shown to be effective in mitigating the intra-class shape variation. However, existing shape prior adaptation methods simply map the observed point cloud to the normalized object space, and the extracted object descriptors are not sufficient for the perception of the object pose. As a result, they fail to predict the pose of objects with complex geometric structures (e.g., cameras). To this end, this paper proposes a novel shape prior adaption method named MSSPA-GC for category-level object pose estimation. Specifically, our main network takes the observed instance point cloud converted from the RGB-D image and the prior shape point cloud pre-trained on the object CAD models as inputs. Then, a novel 3D graph convolution network and a PointNet-like MLP network are designed to extract pose-aware object features and shape-aware object features from these two inputs, respectively. After that, the two-stream object features are aggregated through a multi-scale feature propagation mechanism to generate comprehensive 3D object descriptors that maintain both pose-sensitive geometric stability and intra-class shape consistency. Finally, by leveraging object descriptors aware of both object pose and shape when reconstructing the NOCS coordinates, our approach elegantly achieves state-of-the-art performance on the widely used REAL275 and CAMERA25 datasets using only 25% of the parameters compared with existing shape prior adaptation models. Moreover, our method also exhibits decent generalization ability on the unconstrained REDWOOD75 dataset.

摘要

类别级目标位姿估计旨在从已知类别中预测任意目标的 6D 目标位姿和大小。由于类内形状变化较大,这仍然是一个挑战。最近,将形状先验自适应机制引入归一化标准坐标(即 NOCS)重建过程中,已被证明可有效减轻类内形状变化。然而,现有的形状先验自适应方法只是将观测点云映射到归一化物体空间,并且提取的物体描述符不足以感知物体位姿。因此,它们无法预测具有复杂几何结构(例如相机)的物体的位姿。为此,本文提出了一种新的类别级目标位姿估计的形状先验自适应方法,名为 MSSPA-GC。具体来说,我们的主网络以从 RGB-D 图像转换的观测实例点云和在物体 CAD 模型上预先训练的先验形状点云作为输入。然后,设计了一种新颖的 3D 图卷积网络和一种类似于 PointNet 的 MLP 网络,分别从这两个输入中提取位姿感知物体特征和形状感知物体特征。之后,通过多尺度特征传播机制将两流物体特征聚合起来,生成同时保持位姿敏感几何稳定性和类内形状一致性的综合 3D 物体描述符。最后,通过在重建 NOCS 坐标时利用同时感知物体位姿和形状的物体描述符,我们的方法在仅使用现有形状先验自适应模型 25%参数的情况下,在广泛使用的 REAL275 和 CAMERA25 数据集上实现了最先进的性能。此外,我们的方法在不受约束的 REDWOOD75 数据集上也表现出了相当不错的泛化能力。

相似文献

1
MSSPA-GC: Multi-Scale Shape Prior Adaptation with 3D Graph Convolutions for Category-Level Object Pose Estimation.MSSPA-GC:基于 3D 图卷积的多尺度形状先验自适应的类别级物体位姿估计。
Neural Netw. 2023 Sep;166:609-621. doi: 10.1016/j.neunet.2023.07.037. Epub 2023 Jul 31.
2
6D-ViT: Category-Level 6D Object Pose Estimation via Transformer-Based Instance Representation Learning.6D-ViT:基于Transformer的实例表示学习的类别级6D物体姿态估计
IEEE Trans Image Process. 2022;31:6907-6921. doi: 10.1109/TIP.2022.3216980. Epub 2022 Nov 3.
3
Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud.图卷积网络在点云中进行 3D 物体位姿估计。
Sensors (Basel). 2022 Oct 25;22(21):8166. doi: 10.3390/s22218166.
4
Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection.用于机器人抓取检测的基于形状变形的6-D类别级物体位姿估计
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1857-1871. doi: 10.1109/TNNLS.2023.3330011. Epub 2025 Jan 7.
5
Instance-level 6D pose estimation based on multi-task parameter sharing for robotic grasping.基于多任务参数共享的实例级6D姿态估计用于机器人抓取。
Sci Rep. 2024 Apr 2;14(1):7801. doi: 10.1038/s41598-024-58590-x.
6
Category-Level Object Pose Estimation with Statistic Attention.基于统计注意力的类别级目标姿态估计
Sensors (Basel). 2024 Aug 19;24(16):5347. doi: 10.3390/s24165347.
7
Corr-Track: Category-Level 6D Pose Tracking with Soft-Correspondence Matrix Estimation.Corr-Track:基于软对应矩阵估计的类别级6D姿态跟踪
IEEE Trans Vis Comput Graph. 2024 May;30(5):2173-2183. doi: 10.1109/TVCG.2024.3372111. Epub 2024 Apr 19.
8
GC-MLP: Graph Convolution MLP for Point Cloud Analysis.GC-MLP:用于点云分析的图卷积 MLP。
Sensors (Basel). 2022 Dec 5;22(23):9488. doi: 10.3390/s22239488.
9
Multi-level feature fusion and joint refinement for simultaneous object pose estimation and camera localization.用于同时进行目标位姿估计和相机定位的多层次特征融合和联合细化。
Neural Netw. 2024 Jun;174:106238. doi: 10.1016/j.neunet.2024.106238. Epub 2024 Mar 16.
10
Marker-Less 3d Object Recognition and 6d Pose Estimation for Homogeneous Textureless Objects: An RGB-D Approach.无标记三维物体识别和同质无纹理物体六自由度位姿估计:RGB-D 方法。
Sensors (Basel). 2020 Sep 7;20(18):5098. doi: 10.3390/s20185098.

引用本文的文献

1
Explainable artificial intelligence to quantify adenoid hypertrophy-related upper airway obstruction using 3D Shape Analysis.利用3D形状分析的可解释人工智能来量化腺样体肥大相关的上气道阻塞。
J Dent. 2025 May;156:105689. doi: 10.1016/j.jdent.2025.105689. Epub 2025 Mar 14.