Unified Depth-Guided Feature Fusion and Reranking for Hierarchical Place Recognition.

Suppr

超能文献

作者信息

Li Kunmo, Ou Yongsheng, Ning Jian, Kong Fanchang, Cai Haiyang, Li Haoyang

机构信息

School of Control Science and Engineering, Dalian University of Technology, Dalian 116024, China.

School of Computer Science, Wuhan University, Wuhan 430072, China.

出版信息

Sensors (Basel). 2025 Jun 29;25(13):4056. doi: 10.3390/s25134056.

DOI:10.3390/s25134056

PMID:40648311

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12252300/

Abstract

Visual Place Recognition (VPR) constitutes a pivotal task in the domains of computer vision and robotics. Prevailing VPR methods predominantly employ RGB-based features for query image retrieval and correspondence establishment. Nevertheless, such unimodal visual representations exhibit inherent susceptibility to environmental variations, inevitably degrading method precision. To address this problem, we propose a robust VPR framework integrating RGB and depth modalities. The architecture employs a coarse-to-fine paradigm, where global retrieval of top-N candidate images is performed using fused multimodal features, followed by a geometric verification of these candidates leveraging depth information. A Discrete Wavelet Transform Fusion (DWTF) module is proposed to generate robust multimodal global descriptors by effectively combining RGB and depth data using discrete wavelet transform. Furthermore, we introduce a Spiking Neuron Graph Matching (SNGM) module, which extracts geometric structure and spatial distance from depth data and employs graph matching for accurate depth feature correspondence. Extensive experiments on several VPR benchmarks demonstrate that our method achieves state-of-the-art performance while maintaining the best accuracy-efficiency trade-off.

摘要

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f63e/12252300/aeb42edcaa6c/sensors-25-04056-g001.jpg

相似文献

Unified Depth-Guided Feature Fusion and Reranking for Hierarchical Place Recognition.

Sensors (Basel). 2025 Jun 29;25(13):4056. doi: 10.3390/s25134056.

Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.

JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.

Short-Term Memory Impairment

CDFAN: Cross-Domain Fusion Attention Network for Pansharpening.

Entropy (Basel). 2025 May 27;27(6):567. doi: 10.3390/e27060567.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

A novel deep learning framework for retinal disease detection leveraging contextual and local features cues from retinal images.

Med Biol Eng Comput. 2025 Feb 7. doi: 10.1007/s11517-025-03314-0.

Structural semantic-guided MR synthesis from PET images via a dual cross-attention mechanism.

Med Phys. 2025 Jul;52(7):e17957. doi: 10.1002/mp.17957.

Influence of early through late fusion on pancreas segmentation from imperfectly registered multimodal magnetic resonance imaging.

J Med Imaging (Bellingham). 2025 Mar;12(2):024008. doi: 10.1117/1.JMI.12.2.024008. Epub 2025 Apr 26.

TFDet: Target-Aware Fusion for RGB-T Pedestrian Detection.

IEEE Trans Neural Netw Learn Syst. 2024 Aug 23;PP. doi: 10.1109/TNNLS.2024.3443455.

Diffusion semantic segmentation model: A generative model for medical image segmentation based on joint distribution.

Med Phys. 2025 Jul;52(7):e17928. doi: 10.1002/mp.17928. Epub 2025 Jun 8.

本文引用的文献

Learnable Graph Matching: A Practical Paradigm for Data Association.

IEEE Trans Pattern Anal Mach Intell. 2024 Jul;46(7):4880-4895. doi: 10.1109/TPAMI.2024.3362401. Epub 2024 Jun 5.

Wavelet-Based Texture Reformation Network for Image Super-Resolution.

IEEE Trans Image Process. 2022;31:2647-2660. doi: 10.1109/TIP.2022.3160072. Epub 2022 Mar 28.

DASGIL: Domain Adaptation for Semantic and Geometric-Aware Image-Based Localization.

IEEE Trans Image Process. 2021;30:1342-1353. doi: 10.1109/TIP.2020.3043875. Epub 2020 Dec 23.

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition.

IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1437-1451. doi: 10.1109/TPAMI.2017.2711011. Epub 2017 Jun 1.

Multi-Graph Matching via Affinity Optimization with Graduated Consistency Regularization.

IEEE Trans Pattern Anal Mach Intell. 2016 Jun;38(6):1228-42. doi: 10.1109/TPAMI.2015.2477832. Epub 2015 Sep 10.

Consistency-driven alternating optimization for multigraph matching: a unified approach.

IEEE Trans Image Process. 2015 Mar;24(3):994-1009. doi: 10.1109/TIP.2014.2387386. Epub 2015 Jan 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验