具有多方向选择机制的多分辨率视觉曼巴用于视网膜疾病检测

Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection.

作者信息

Zuo Qiankun, Shi Zhengkun, Liu Bo, Ping Na, Wang Jiangtao, Cheng Xi, Zhang Kexin, Guo Jia, Wu Yixian, Hong Jin

机构信息

Hubei Key Laboratory of Digital Finance Innovation, Hubei University of Economics, Wuhan, China.

School of Information Engineering, Hubei University of Economics, Wuhan, China.

出版信息

Front Cell Dev Biol. 2024 Oct 11;12:1484880. doi: 10.3389/fcell.2024.1484880. eCollection 2024.

DOI:10.3389/fcell.2024.1484880

PMID:39463765

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11512455/

Abstract

INTRODUCTION

Retinal diseases significantly impact patients' quality of life and increase social medical costs. Optical coherence tomography (OCT) offers high-resolution imaging for precise detection and monitoring of these conditions. While deep learning techniques have been employed to extract features from OCT images for classification, convolutional neural networks (CNNs) often fail to capture global context due to their focus on local receptive fields. Transformer-based methods, on the other hand, suffer from quadratic complexity when handling long-range dependencies.

METHODS

To overcome these limitations, we introduce the Multi-Resolution Visual Mamba (MRVM) model, which addresses long-range dependencies with linear computational complexity for OCT image classification. The MRVM model initially employs convolution to extract local features and subsequently utilizes the retinal Mamba to capture global dependencies. By integrating multi-scale global features, the MRVM enhances classification accuracy and overall performance. Additionally, the multi-directional selection mechanism (MSM) within the retinal Mamba improves feature extraction by concentrating on various directions, thereby better capturing complex, orientation-specific retinal patterns.

RESULTS

Experimental results demonstrate that the MRVM model excels in differentiating retinal images with various lesions, achieving superior detection accuracy compared to traditional methods, with overall accuracies of 98.98% and 96.21% on two public datasets, respectively.

DISCUSSION

This approach offers a novel perspective for accurately identifying retinal diseases and could contribute to the development of more robust artificial intelligence algorithms and recognition systems for medical image-assisted diagnosis.

摘要

引言

视网膜疾病严重影响患者的生活质量，并增加社会医疗成本。光学相干断层扫描（OCT）提供高分辨率成像，用于精确检测和监测这些病症。虽然深度学习技术已被用于从OCT图像中提取特征进行分类，但卷积神经网络（CNN）由于专注于局部感受野，往往无法捕捉全局上下文。另一方面，基于Transformer的方法在处理长程依赖时具有二次复杂度。

方法

为克服这些限制，我们引入了多分辨率视觉曼巴（MRVM）模型，该模型以线性计算复杂度处理长程依赖，用于OCT图像分类。MRVM模型首先采用卷积提取局部特征，随后利用视网膜曼巴捕捉全局依赖。通过整合多尺度全局特征，MRVM提高了分类准确率和整体性能。此外，视网膜曼巴中的多方向选择机制（MSM）通过关注不同方向来改进特征提取，从而更好地捕捉复杂的、特定方向的视网膜模式。

结果

实验结果表明，MRVM模型在区分具有各种病变的视网膜图像方面表现出色，与传统方法相比具有更高的检测准确率，在两个公共数据集上的总体准确率分别为98.98%和96.21%。

讨论

这种方法为准确识别视网膜疾病提供了新的视角，并可能有助于开发更强大的人工智能算法和医学图像辅助诊断识别系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8bc/11512455/c58266c12ba4/fcell-12-1484880-g001.jpg

相似文献

Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection.具有多方向选择机制的多分辨率视觉曼巴用于视网膜疾病检测

Front Cell Dev Biol. 2024 Oct 11;12:1484880. doi: 10.3389/fcell.2024.1484880. eCollection 2024.

HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images.HTC-retina：一种使用来自光学相干断层扫描图像的变压器-卷积神经网络的混合视网膜疾病分类模型。

Comput Biol Med. 2024 Aug;178:108726. doi: 10.1016/j.compbiomed.2024.108726. Epub 2024 Jun 9.

ETU-Net: edge enhancement-guided U-Net with transformer for skin lesion segmentation.ETU-Net：基于边缘增强引导的 U-Net 与 Transformer 的皮肤病变分割。

Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/ad13d2.

HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification.HCTNet：一种用于视网膜光学相干断层扫描图像分类的混合卷积神经网络-Transformer 网络。

Biosensors (Basel). 2022 Jul 20;12(7):542. doi: 10.3390/bios12070542.

Improved deep learning image classification algorithm based on Swin Transformer V2.基于Swin Transformer V2的改进型深度学习图像分类算法。

PeerJ Comput Sci. 2023 Oct 30;9:e1665. doi: 10.7717/peerj-cs.1665. eCollection 2023.

Multi-Scale-Denoising Residual Convolutional Network for Retinal Disease Classification Using OCT.基于 OCT 的视网膜病变分类的多尺度去噪残差卷积网络

Sensors (Basel). 2023 Dec 27;24(1):150. doi: 10.3390/s24010150.

A new visual State Space Model for low-dose CT denoising.一种用于低剂量CT去噪的新型视觉状态空间模型。

Med Phys. 2024 Dec;51(12):8851-8864. doi: 10.1002/mp.17387. Epub 2024 Sep 4.

FNeXter: A Multi-Scale Feature Fusion Network Based on ConvNeXt and Transformer for Retinal OCT Fluid Segmentation.FNeXter：一种基于ConvNeXt和Transformer的多尺度特征融合网络用于视网膜光学相干断层扫描液体分割

Sensors (Basel). 2024 Apr 10;24(8):2425. doi: 10.3390/s24082425.

Deep local-to-global feature learning for medical image super-resolution.用于医学图像超分辨率的深度局部到全局特征学习。

Comput Med Imaging Graph. 2024 Jul;115:102374. doi: 10.1016/j.compmedimag.2024.102374. Epub 2024 Mar 26.

G2ViT: Graph Neural Network-Guided Vision Transformer Enhanced Network for retinal vessel and coronary angiograph segmentation.G2ViT：基于图神经网络引导的视觉Transformer 增强网络，用于视网膜血管和冠状动脉造影分割。

Neural Netw. 2024 Aug;176:106356. doi: 10.1016/j.neunet.2024.106356. Epub 2024 May 3.

引用本文的文献

MSLI-Net: retinal disease detection network based on multi-segment localization and multi-scale interaction.MSLI-Net：基于多段定位和多尺度交互的视网膜疾病检测网络。

Front Cell Dev Biol. 2025 Jun 6;13:1608325. doi: 10.3389/fcell.2025.1608325. eCollection 2025.

Revolutionizing Chinese medicine granule placebo with a machine learning four-color model.利用机器学习四色模型革新中药颗粒安慰剂

Chin Med. 2025 Apr 1;20(1):43. doi: 10.1186/s13020-024-01055-0.

A mutual inclusion mechanism for precise boundary segmentation in medical images.一种用于医学图像精确边界分割的相互包含机制。

Front Bioeng Biotechnol. 2024 Dec 24;12:1504249. doi: 10.3389/fbioe.2024.1504249. eCollection 2024.

本文引用的文献

A New Brain Network Construction Paradigm for Brain Disorder via Diffusion-Based Graph Contrastive Learning.基于扩散的图对比学习的脑疾病新脑网络构建范式。

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10389-10403. doi: 10.1109/TPAMI.2024.3442811. Epub 2024 Nov 6.

Comput Biol Med. 2024 Aug;178:108726. doi: 10.1016/j.compbiomed.2024.108726. Epub 2024 Jun 9.

A new segmentation algorithm for peripapillary atrophy and optic disk from ultra-widefield Photographs.一种新的超广角照片中视盘旁萎缩和视盘的分割算法。

Comput Biol Med. 2024 Apr;172:108281. doi: 10.1016/j.compbiomed.2024.108281. Epub 2024 Mar 13.

Intelligent diagnosis of retinal vein occlusion based on color fundus photographs.基于彩色眼底照片的视网膜静脉阻塞智能诊断

Int J Ophthalmol. 2024 Jan 18;17(1):1-6. doi: 10.18240/ijo.2024.01.01. eCollection 2024.

DBPF-net: dual-branch structural feature extraction reinforcement network for ocular surface disease image classification.DBPF-net：用于眼表疾病图像分类的双分支结构特征提取强化网络

Front Med (Lausanne). 2024 Jan 4;10:1309097. doi: 10.3389/fmed.2023.1309097. eCollection 2023.

Alzheimer's Disease Prediction via Brain Structural-Functional Deep Fusing Network.基于脑结构-功能深度融合网络的阿尔茨海默病预测。

IEEE Trans Neural Syst Rehabil Eng. 2023;31:4601-4612. doi: 10.1109/TNSRE.2023.3333952. Epub 2023 Nov 23.

Brain Structure-Function Fusing Representation Learning Using Adversarial Decomposed-VAE for Analyzing MCI.使用对抗分解 VAE 融合脑结构-功能表示学习分析 MCI。

IEEE Trans Neural Syst Rehabil Eng. 2023;31:4017-4028. doi: 10.1109/TNSRE.2023.3323432. Epub 2023 Oct 18.

MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification.MBT：用于视网膜光学相干断层扫描图像和视频多分类的基于模型的Transformer

Int J Med Inform. 2023 Oct;178:105178. doi: 10.1016/j.ijmedinf.2023.105178. Epub 2023 Aug 21.

Artificial intelligence in retinal disease: clinical application, challenges, and future directions.人工智能在视网膜疾病中的应用：临床应用、挑战及未来方向。

Graefes Arch Clin Exp Ophthalmol. 2023 Nov;261(11):3283-3297. doi: 10.1007/s00417-023-06052-x. Epub 2023 May 9.

Automated detection of myopic maculopathy using five-category models based on vision outlooker for visual recognition.基于视觉展望者的五类模型自动检测近视性黄斑病变以进行视觉识别。

Front Comput Neurosci. 2023 Apr 20;17:1169464. doi: 10.3389/fncom.2023.1169464. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

具有多方向选择机制的多分辨率视觉曼巴用于视网膜疾病检测

Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

DISCUSSION

引言

方法

结果

讨论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献