• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于图像恢复的基于曼巴的两阶段扩散模型。

Two-stage Mamba-based diffusion model for image restoration.

作者信息

Liu Lei, Ma Luan, Wang Shuai, Wang Jun, Melo Silas N

机构信息

School of Computer Science and Technology, Huaibei Normal University, Huaibei, 235000, China.

Huaibei Key Laboratory of Digital Multimedia Intelligent Information Processing, Huaibei Normal University, Huaibei, 235000, China.

出版信息

Sci Rep. 2025 Jul 1;15(1):22265. doi: 10.1038/s41598-025-07032-3.

DOI:10.1038/s41598-025-07032-3
PMID:40595135
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12216552/
Abstract

Image restoration is fundamental in computer vision to restore high-quality images from degraded ones. Recently, models such as the transformer and diffusion have shown notable success in addressing this challenge. However, transformer-based methods face high computational costs due to quadratic complexity, while diffusion-based methods often struggle with suboptimal results due to inaccurate noise estimation. This study proposes Diff-Mamba, a two-stage adaptive Mamba-based diffusion model for image restoration. Diff-Mamba integrates the linear complexity state space model (SSM, also known as Mamba) into image restoration, expanding its applicability to visual data generation. Diff-Mamba mainly consists of two parts: the diffusion state space model (DSSM) and the diffusion feedforward neural network (DFNN). DSSM combines Mamba's high efficiency with the representative power of diffusion models, enhancing both inference and training. DFNN regulates the information flow, enabling each depthwise convolutional layer to focus on the details of image, thus learning more effective local structures for image restoration. The study's findings, verified through extensive experiments, indicate that Diff-Mamba outperforms both diffusion-based and transformer-based methods in image deraining, denoising, and deblurring, demonstrating competitive restoration performance with various commonly used datasets. Code is available at https://github.com/maluan-ml/Diff-Mamba.

摘要

图像恢复是计算机视觉中的基础任务,旨在从退化图像中恢复高质量图像。最近,诸如Transformer和扩散模型等在应对这一挑战方面取得了显著成功。然而,基于Transformer的方法由于二次复杂度而面临高计算成本,而基于扩散的方法则常常因噪声估计不准确而难以获得最优结果。本研究提出了Diff-Mamba,一种基于Mamba的两阶段自适应扩散模型用于图像恢复。Diff-Mamba将线性复杂度状态空间模型(SSM,也称为Mamba)集成到图像恢复中,扩展了其在视觉数据生成中的适用性。Diff-Mamba主要由两部分组成:扩散状态空间模型(DSSM)和扩散前馈神经网络(DFNN)。DSSM将Mamba的高效性与扩散模型的代表性能力相结合,增强了推理和训练效果。DFNN调节信息流,使每个深度卷积层能够专注于图像细节,从而学习更有效的局部结构用于图像恢复。通过大量实验验证的研究结果表明,Diff-Mamba在图像去雨、去噪和去模糊方面优于基于扩散和基于Transformer的方法,在各种常用数据集上展现出具有竞争力的恢复性能。代码可在https://github.com/maluan-ml/Diff-Mamba获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/984709886e2e/41598_2025_7032_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/17547b4fc1ce/41598_2025_7032_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/c9f394220678/41598_2025_7032_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/686269714b88/41598_2025_7032_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/a5f74ac5955f/41598_2025_7032_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/9346576ceb09/41598_2025_7032_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/954961d70e4d/41598_2025_7032_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/f22df988a3b8/41598_2025_7032_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/ab792f1b6a8e/41598_2025_7032_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/984709886e2e/41598_2025_7032_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/17547b4fc1ce/41598_2025_7032_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/c9f394220678/41598_2025_7032_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/686269714b88/41598_2025_7032_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/a5f74ac5955f/41598_2025_7032_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/9346576ceb09/41598_2025_7032_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/954961d70e4d/41598_2025_7032_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/f22df988a3b8/41598_2025_7032_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/ab792f1b6a8e/41598_2025_7032_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/12216552/984709886e2e/41598_2025_7032_Fig9_HTML.jpg

相似文献

1
Two-stage Mamba-based diffusion model for image restoration.用于图像恢复的基于曼巴的两阶段扩散模型。
Sci Rep. 2025 Jul 1;15(1):22265. doi: 10.1038/s41598-025-07032-3.
2
VMDU-net: a dual encoder multi-scale fusion network for polyp segmentation with Vision Mamba and Cross-Shape Transformer integration.VMDU-net:一种用于息肉分割的双编码器多尺度融合网络,集成了视觉曼巴和十字形变换器
Front Artif Intell. 2025 Jun 18;8:1557508. doi: 10.3389/frai.2025.1557508. eCollection 2025.
3
Selective State Space Models Outperform Transformers at Predicting RNA-Seq Read Coverage.在预测RNA测序读段覆盖度方面,选择性状态空间模型优于Transformer模型。
bioRxiv. 2025 Feb 17:2025.02.13.638190. doi: 10.1101/2025.02.13.638190.
4
MHS-VIT: Mamba hybrid self-attention vision transformers for traffic image detection.MHS-VIT:用于交通图像检测的曼巴混合自注意力视觉Transformer
PLoS One. 2025 Jun 30;20(6):e0325962. doi: 10.1371/journal.pone.0325962. eCollection 2025.
5
Diff-UNet: A diffusion embedded network for robust 3D medical image segmentation.Diff-UNet:一种用于稳健3D医学图像分割的扩散嵌入网络。
Med Image Anal. 2025 Oct;105:103654. doi: 10.1016/j.media.2025.103654. Epub 2025 Jun 16.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.
7
Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.利用扩散模型探索基于脑电图信号的图像生成潜力:结合混合方法和多模态分析的综合框架
JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.
8
STVMamba: precipitation nowcasting with spatiotemporal prediction model.STVMamba:基于时空预测模型的降水临近预报
Sci Rep. 2025 Jul 2;15(1):22568. doi: 10.1038/s41598-025-05902-4.
9
Real-world defocus deblurring via score-based diffusion models.通过基于分数的扩散模型实现真实世界散焦去模糊
Sci Rep. 2025 Jul 2;15(1):22942. doi: 10.1038/s41598-025-07326-6.
10
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

本文引用的文献

1
Considering Image Information and Self-Similarity: A Compositional Denoising Network.考虑图像信息与自相似性:一种组合去噪网络
Sensors (Basel). 2023 Jun 26;23(13):5915. doi: 10.3390/s23135915.
2
Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising.超越高斯去噪器:用于图像去噪的深度 CNN 的残差学习。
IEEE Trans Image Process. 2017 Jul;26(7):3142-3155. doi: 10.1109/TIP.2017.2662206. Epub 2017 Feb 1.