用于多尺度眼底图像增强的混合卷积神经网络-曼巴模型

Hybrid CNN-Mamba model for multi-scale fundus image enhancement.

作者信息

Wang Xiaopeng, Gong Di, Chen Yi, Zong Zheng, Li Meng, Fan Kun, Jia Lina, Cao Qiyuan, Liu Qiang, Yang Qiang

机构信息

Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China.

China-Japan Friendship Hospital, Beijing 100029, China.

出版信息

Biomed Opt Express. 2025 Feb 20;16(3):1104-1117. doi: 10.1364/BOE.542471. eCollection 2025 Mar 1.

DOI:10.1364/BOE.542471

PMID:40109520

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11919352/

Abstract

This study proposes a multi-scale fundus image enhancement approach that combines CNN with Mamba, demonstrating clear superiority across multiple benchmarks. The model consistently achieves top performance on public datasets, with the lowest FID and KID scores, and the highest PSNR and SSIM values, particularly excelling at larger image resolutions. Notably, its performance improves as the image size increases, with several metrics reaching optimal values at 1024 × 1024 resolution. Scale generalizability further highlights the model's exceptional structural preservation capability. Additionally, its high VSD and IOU scores in segmentation tasks further validate its practical effectiveness, making it a valuable tool for enhancing fundus images and improving diagnostic accuracy.

摘要

本研究提出了一种将卷积神经网络（CNN）与曼巴（Mamba）相结合的多尺度眼底图像增强方法，在多个基准测试中显示出明显的优势。该模型在公共数据集上始终取得最佳性能，具有最低的FID和KID分数，以及最高的PSNR和SSIM值，在较大图像分辨率下表现尤为出色。值得注意的是，其性能随着图像尺寸的增加而提高，在1024×1024分辨率下，多个指标达到最优值。尺度通用性进一步突出了该模型卓越的结构保留能力。此外，它在分割任务中的高VSD和IOU分数进一步验证了其实用有效性，使其成为增强眼底图像和提高诊断准确性的有价值工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f107/11919352/e064ee68d873/boe-16-3-1104-g001.jpg

相似文献

Hybrid CNN-Mamba model for multi-scale fundus image enhancement.用于多尺度眼底图像增强的混合卷积神经网络-曼巴模型

Biomed Opt Express. 2025 Feb 20;16(3):1104-1117. doi: 10.1364/BOE.542471. eCollection 2025 Mar 1.

Transformer guided self-adaptive network for multi-scale skin lesion image segmentation.Transformer 引导的自适网络用于多尺度皮肤病变图像分割。

Comput Biol Med. 2024 Feb;169:107846. doi: 10.1016/j.compbiomed.2023.107846. Epub 2023 Dec 23.

MMAgentRec, a personalized multi-modal recommendation agent with large language model.MMAgentRec，一个带有大语言模型的个性化多模态推荐代理。

Sci Rep. 2025 Apr 8;15(1):12062. doi: 10.1038/s41598-025-96458-w.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像（MRI）中进行脑肿瘤分割与检测

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Contrast Enhancement of RGB Retinal Fundus Images for Improved Segmentation of Blood Vessels Using Convolutional Neural Networks.使用卷积神经网络增强 RGB 视网膜眼底图像对比度，以改善血管分割。

J Digit Imaging. 2023 Apr;36(2):414-432. doi: 10.1007/s10278-022-00738-0. Epub 2022 Dec 1.

VMKLA-UNet: vision Mamba with KAN linear attention U-Net.VMKLA-UNet：带KAN线性注意力机制的视觉曼巴U-Net

Sci Rep. 2025 Apr 17;15(1):13258. doi: 10.1038/s41598-025-97397-2.

Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer.基于多尺度感受野融合与混合变压器的遥感图像超分辨率重建

Sci Rep. 2025 Jan 16;15(1):2140. doi: 10.1038/s41598-025-86446-5.

A deep learning-based framework for retinal fundus image enhancement.基于深度学习的眼底图像增强框架。

PLoS One. 2023 Mar 16;18(3):e0282416. doi: 10.1371/journal.pone.0282416. eCollection 2023.

Enhanced Pneumonia Detection in Chest X-Rays Using Hybrid Convolutional and Vision Transformer Networks.使用混合卷积和视觉Transformer网络增强胸部X光片中的肺炎检测

Curr Med Imaging. 2025;21:e15734056326685. doi: 10.2174/0115734056326685250101113959.

Enhancing bridge damage detection with Mamba-Enhanced HRNet for semantic segmentation.利用 Mamba-Enhanced HRNet 进行语义分割，增强桥梁损伤检测。

PLoS One. 2024 Oct 16;19(10):e0312136. doi: 10.1371/journal.pone.0312136. eCollection 2024.

本文引用的文献

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark.异构网络表示学习：一个包含综述与基准测试的统一框架

IEEE Trans Knowl Data Eng. 2022 Oct;34(10):4854-4873. doi: 10.1109/tkde.2020.3045924. Epub 2020 Dec 21.

GAMMA challenge: Glaucoma grAding from Multi-Modality imAges.伽马挑战赛：多模态图像的青光眼分级。

Med Image Anal. 2023 Dec;90:102938. doi: 10.1016/j.media.2023.102938. Epub 2023 Sep 18.

A generic fundus image enhancement network boosted by frequency self-supervised representation learning.一种通过频率自监督表征学习增强的通用眼底图像增强网络。

Med Image Anal. 2023 Dec;90:102945. doi: 10.1016/j.media.2023.102945. Epub 2023 Sep 9.

Recent trends and advances in fundus image analysis: A review.眼底图像分析的最新趋势和进展：综述。

Comput Biol Med. 2022 Dec;151(Pt A):106277. doi: 10.1016/j.compbiomed.2022.106277. Epub 2022 Nov 2.

Semi-Supervised and Unsupervised Deep Visual Learning: A Survey.半监督与无监督深度视觉学习：一项综述。

IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1327-1347. doi: 10.1109/TPAMI.2022.3201576. Epub 2024 Feb 6.

RFormer: Transformer-Based Generative Adversarial Network for Real Fundus Image Restoration on a New Clinical Benchmark.RFormer：基于 Transformer 的生成对抗网络，用于新临床基准上的真实眼底图像恢复。

IEEE J Biomed Health Inform. 2022 Sep;26(9):4645-4655. doi: 10.1109/JBHI.2022.3187103. Epub 2022 Sep 9.

A Survey on Vision Transformer.视觉Transformer综述

IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):87-110. doi: 10.1109/TPAMI.2022.3152247. Epub 2022 Dec 5.

An Annotation-Free Restoration Network for Cataractous Fundus Images.无注释白内障眼底图像恢复网络。

IEEE Trans Med Imaging. 2022 Jul;41(7):1699-1710. doi: 10.1109/TMI.2022.3147854. Epub 2022 Jun 30.

Structure and Illumination Constrained GAN for Medical Image Enhancement.结构和光照约束生成对抗网络在医学图像增强中的应用。

IEEE Trans Med Imaging. 2021 Dec;40(12):3955-3967. doi: 10.1109/TMI.2021.3101937. Epub 2021 Nov 30.

Applications of deep learning in fundus images: A review.深度学习在眼底图像中的应用：综述。

Med Image Anal. 2021 Apr;69:101971. doi: 10.1016/j.media.2021.101971. Epub 2021 Jan 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于多尺度眼底图像增强的混合卷积神经网络-曼巴模型

Hybrid CNN-Mamba model for multi-scale fundus image enhancement.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献