基于广义霍夫变换和自适应n移位洗牌注意力机制的三维实例分割

Three-Dimensional Instance Segmentation Using the Generalized Hough Transform and the Adaptive n-Shifted Shuffle Attention.

作者信息

Mulindwa Desire Burume, Du Shengzhi, Liu Qingxue

机构信息

Department of Electrical Engineering, Tshwane University of Technology, Pretoria 0001, South Africa.

School of Mechanical and Electrical Engineering, Kunming University, Kunming 650214, China.

出版信息

Sensors (Basel). 2024 Nov 12;24(22):7215. doi: 10.3390/s24227215.

DOI:10.3390/s24227215

PMID:39598992

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11598058/

Abstract

The progress of 3D instance segmentation techniques has made it essential for several applications, such as augmented reality, autonomous driving, and robotics. Traditional methods usually have challenges with complex indoor scenes made of multiple objects with different occlusions and orientations. In this work, the authors present an innovative model that integrates a new adaptive n-shifted shuffle (ANSS) attention mechanism with the Generalized Hough Transform (GHT) for robust 3D instance segmentation of indoor scenes. The proposed technique leverages the n-shifted sigmoid activation function, which improves the adaptive shuffle attention mechanism, permitting the network to dynamically focus on relevant features across various regions. A learnable shuffling pattern is produced through the proposed ANSS attention mechanism to spatially rearrange the relevant features, thus augmenting the model's ability to capture the object boundaries and their fine-grained details. The integration of GHT furnishes a vigorous framework to localize and detect objects in the 3D space, even when heavy noise and partial occlusions are present. The authors evaluate the proposed method on the challenging Stanford 3D Indoor Spaces Dataset (S3DIS), where it establishes its superiority over existing methods. The proposed approach achieves state-of-the-art performance in both mean Intersection over Union (IoU) and overall accuracy, showcasing its potential for practical deployment in real-world scenarios. These results illustrate that the integration of the ANSS and the GHT yields a robust solution for 3D instance segmentation tasks.

摘要

3D实例分割技术的进步使其在增强现实、自动驾驶和机器人技术等多个应用领域变得至关重要。传统方法在处理由多个具有不同遮挡和方向的物体组成的复杂室内场景时通常面临挑战。在这项工作中，作者提出了一种创新模型，该模型将一种新的自适应n移位混洗（ANSS）注意力机制与广义霍夫变换（GHT）相结合，用于室内场景的稳健3D实例分割。所提出的技术利用n移位Sigmoid激活函数，改进了自适应混洗注意力机制，使网络能够动态地关注各个区域的相关特征。通过所提出的ANSS注意力机制产生一个可学习的混洗模式，以便在空间上重新排列相关特征，从而增强模型捕捉物体边界及其细粒度细节的能力。即使存在大量噪声和部分遮挡，GHT的集成也为在3D空间中定位和检测物体提供了一个强大的框架。作者在具有挑战性的斯坦福3D室内空间数据集（S3DIS）上评估了所提出的方法，该方法在该数据集上显示出优于现有方法的优势。所提出的方法在平均交并比（IoU）和总体准确率方面均达到了当前的最优性能，展示了其在实际场景中实际部署的潜力。这些结果表明，ANSS和GHT的集成产生了一种用于3D实例分割任务的稳健解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b181/11598058/8db95a3604df/sensors-24-07215-g001.jpg

相似文献

Three-Dimensional Instance Segmentation Using the Generalized Hough Transform and the Adaptive n-Shifted Shuffle Attention.基于广义霍夫变换和自适应n移位洗牌注意力机制的三维实例分割

Sensors (Basel). 2024 Nov 12;24(22):7215. doi: 10.3390/s24227215.

Transformer guided self-adaptive network for multi-scale skin lesion image segmentation.Transformer 引导的自适网络用于多尺度皮肤病变图像分割。

Comput Biol Med. 2024 Feb;169:107846. doi: 10.1016/j.compbiomed.2023.107846. Epub 2023 Dec 23.

Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。

Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像（MRI）中进行脑肿瘤分割与检测

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Generalized Hough transform for 3D object recognition and visualization in integral imaging.整体成像中的 3D 目标识别和可视化的广义霍夫变换。

J Opt Soc Am A Opt Image Sci Vis. 2023 Apr 1;40(4):C37-C45. doi: 10.1364/JOSAA.482640.

Semantic segmentation of autonomous driving scenes based on multi-scale adaptive attention mechanism.基于多尺度自适应注意力机制的自动驾驶场景语义分割

Front Neurosci. 2023 Oct 19;17:1291674. doi: 10.3389/fnins.2023.1291674. eCollection 2023.

Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding.Lowis3D：语言驱动的开放世界实例级3D场景理解

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8517-8533. doi: 10.1109/TPAMI.2024.3410324. Epub 2024 Nov 6.

A Dual-Channel and Frequency-Aware Approach for Lightweight Video Instance Segmentation.一种用于轻量级视频实例分割的双通道和频率感知方法。

Sensors (Basel). 2025 Jan 14;25(2):459. doi: 10.3390/s25020459.

SwinDAF3D: Pyramid Swin Transformers with Deep Attentive Features for Automated Finger Joint Segmentation in 3D Ultrasound Images for Rheumatoid Arthritis Assessment.SwinDAF3D：用于类风湿性关节炎评估的3D超声图像中自动手指关节分割的具有深度注意力特征的金字塔Swin变换器

Bioengineering (Basel). 2025 Apr 5;12(4):390. doi: 10.3390/bioengineering12040390.

Guided Depth Completion with Instance Segmentation Fusion in Autonomous Driving Applications.自动驾驶应用中的实例分割融合引导深度补全。

Sensors (Basel). 2022 Dec 7;22(24):9578. doi: 10.3390/s22249578.

引用本文的文献

Stand Structure Extraction and Analysis of Communities in Qianjiazhai, Ailao Mountain, China, Based on Backpack Laser Scanning.基于背包激光扫描的中国哀牢山千家寨林分结构提取与群落分析

Plants (Basel). 2025 Aug 11;14(16):2485. doi: 10.3390/plants14162485.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于广义霍夫变换和自适应n移位洗牌注意力机制的三维实例分割

Three-Dimensional Instance Segmentation Using the Generalized Hough Transform and the Adaptive n-Shifted Shuffle Attention.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献