Chai Xinghua, Yang Jianyong, Yan Xiangming, Di Chengliang, Ye Tao
54th Research Institute of China Electronics Technology Group Corporation, Shijiazhuang 050081, China.
School of Mechanical Electronic and Information Engineering, China University of Mining and Technology, Beijing 100083, China.
Sensors (Basel). 2023 Nov 18;23(22):9261. doi: 10.3390/s23229261.
An autonomous place recognition system is essential for scenarios where GPS is useless, such as underground tunnels. However, it is difficult to use existing algorithms to fully utilize the small number of effective features in underground tunnel data, and recognition accuracy is difficult to guarantee. In order to solve this challenge, an efficient point cloud position recognition algorithm, named Dual-Attention Transformer Network (DAT-Net), is proposed in this paper. The algorithm firstly adopts the farthest point downsampling module to eliminate the invalid redundant points in the point cloud data and retain the basic shape of the point cloud, which reduces the size of the point cloud and, at the same time, reduces the influence of the invalid point cloud on the data analysis. After that, this paper proposes the dual-attention Transformer module to facilitate local information exchange by utilizing the multi-head self-attention mechanism. It extracts local descriptors and integrates highly discriminative global descriptors based on global context with the help of a feature fusion layer to obtain a more accurate and robust global feature representation. Experimental results show that the method proposed in this paper achieves an average F1 score of 0.841 on the SubT-Tunnel dataset and outperforms many existing state-of-the-art algorithms in recognition accuracy and robustness tests.
对于全球定位系统(GPS)无用的场景,如地下隧道,自主位置识别系统至关重要。然而,利用现有算法充分利用地下隧道数据中少量的有效特征存在困难,且识别精度难以保证。为解决这一挑战,本文提出了一种高效的点云位置识别算法,即双注意力Transformer网络(DAT-Net)。该算法首先采用最远点下采样模块,消除点云数据中的无效冗余点,保留点云的基本形状,这既减小了点云的大小,同时也降低了无效点云对数据分析的影响。之后,本文提出双注意力Transformer模块,通过利用多头自注意力机制促进局部信息交换。它提取局部描述符,并借助特征融合层基于全局上下文整合具有高度判别力的全局描述符,以获得更准确、更鲁棒的全局特征表示。实验结果表明,本文提出的方法在SubT-Tunnel数据集上的平均F1分数达到0.841,在识别精度和鲁棒性测试中优于许多现有的先进算法。