• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NIS-SLAM:用于3D一致场景理解的神经隐式语义RGB-D同步定位与地图构建

NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding.

作者信息

Zhai Hongjia, Huang Gan, Hu Qirui, Li Guanglin, Bao Hujun, Zhang Guofeng

出版信息

IEEE Trans Vis Comput Graph. 2024 Nov;30(11):7129-7139. doi: 10.1109/TVCG.2024.3456201. Epub 2024 Oct 10.

DOI:10.1109/TVCG.2024.3456201
PMID:39255118
Abstract

In recent years, the paradigm of neural implicit representations has gained substantial attention in the field of Simultaneous Localization and Mapping (SLAM). However, a notable gap exists in the existing approaches when it comes to scene understanding. In this paper, we introduce NIS-SLAM, an efficient neural implicit semantic RGB-D SLAM system, that leverages a pre-trained 2D segmentation network to learn consistent semantic representations. Specifically, for high-fidelity surface reconstruction and spatial consistent scene understanding, we combine high-frequency multi-resolution tetrahedron-based features and low-frequency positional encoding as the implicit scene representations. Besides, to address the inconsistency of 2D segmentation results from multiple views, we propose a fusion strategy that integrates the semantic probabilities from previous non-keyframes into keyframes to achieve consistent semantic learning. Furthermore, we implement a confidence-based pixel sampling and progressive optimization weight function for robust camera tracking. Extensive experimental results on various datasets show the better or more competitive performance of our system when compared to other existing neural dense implicit RGB-D SLAM approaches. Finally, we also show that our approach can be used in augmented reality applications. Project page: https://zju3dv.github.io/nis_slam.

摘要

近年来,神经隐式表示范式在同步定位与地图构建(SLAM)领域受到了广泛关注。然而,现有方法在场景理解方面存在显著差距。在本文中,我们介绍了NIS-SLAM,这是一种高效的神经隐式语义RGB-D SLAM系统,它利用预训练的2D分割网络来学习一致的语义表示。具体来说,为了实现高保真表面重建和空间一致的场景理解,我们将基于高频多分辨率四面体的特征和低频位置编码相结合作为隐式场景表示。此外,为了解决多视图2D分割结果的不一致性,我们提出了一种融合策略,将来自先前非关键帧的语义概率整合到关键帧中,以实现一致的语义学习。此外,我们还实现了基于置信度的像素采样和渐进优化权重函数,以实现稳健的相机跟踪。在各种数据集上的大量实验结果表明,与其他现有的神经密集隐式RGB-D SLAM方法相比,我们的系统具有更好或更具竞争力的性能。最后,我们还表明我们的方法可用于增强现实应用。项目页面:https://zju3dv.github.io/nis_slam 。

相似文献

1
NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding.NIS-SLAM:用于3D一致场景理解的神经隐式语义RGB-D同步定位与地图构建
IEEE Trans Vis Comput Graph. 2024 Nov;30(11):7129-7139. doi: 10.1109/TVCG.2024.3456201. Epub 2024 Oct 10.
2
Robust and Efficient CPU-Based RGB-D Scene Reconstruction.基于 CPU 的鲁棒高效 RGB-D 场景重建。
Sensors (Basel). 2018 Oct 28;18(11):3652. doi: 10.3390/s18113652.
3
Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network.基于像素-体素神经网络的密集 RGB-D 语义建图
Sensors (Basel). 2018 Sep 14;18(9):3099. doi: 10.3390/s18093099.
4
SLAM-based dense surface reconstruction in monocular Minimally Invasive Surgery and its application to Augmented Reality.基于 SLAM 的单目微创手术中密集表面重建及其在增强现实中的应用。
Comput Methods Programs Biomed. 2018 May;158:135-146. doi: 10.1016/j.cmpb.2018.02.006. Epub 2018 Feb 8.
5
DGFlow-SLAM: A Novel Dynamic Environment RGB-D SLAM without Prior Semantic Knowledge Based on Grid Segmentation of Scene Flow.DGFlow-SLAM:一种基于场景流网格分割的无先验语义知识的新型动态环境RGB-D同步定位与地图构建技术
Biomimetics (Basel). 2022 Oct 13;7(4):163. doi: 10.3390/biomimetics7040163.
6
DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization.DiT-SLAM:基于隐式深度表示和紧密耦合图优化的实时密集视觉惯性同步定位与地图构建
Sensors (Basel). 2022 Apr 28;22(9):3389. doi: 10.3390/s22093389.
7
Robust RGB-D SLAM Using Point and Line Features for Low Textured Scene.基于点线特征的鲁棒RGB-D SLAM用于低纹理场景
Sensors (Basel). 2020 Sep 2;20(17):4984. doi: 10.3390/s20174984.
8
CVIDS: A Collaborative Localization and Dense Mapping Framework for Multi-Agent Based Visual-Inertial SLAM.CVIDS:一种用于基于多智能体的视觉惯性同步定位与地图构建的协作式定位与密集建图框架。
IEEE Trans Image Process. 2022;31:6562-6576. doi: 10.1109/TIP.2022.3213189. Epub 2022 Oct 21.
9
Linear RGB-D SLAM for Structured Environments.适用于结构化环境的线性RGB-D同步定位与地图构建
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):8403-8419. doi: 10.1109/TPAMI.2021.3106820. Epub 2022 Oct 4.
10
Dense RGB-D SLAM with Multiple Cameras.多相机稠密 RGB-D SLAM。
Sensors (Basel). 2018 Jul 2;18(7):2118. doi: 10.3390/s18072118.