• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于简单混合 CNN-Transformer 网络的图像调和。

Image harmonization with Simple Hybrid CNN-Transformer Network.

机构信息

School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, Shannxi, China; School of Artificial Intelligence, OPtics and ElectroNics (iOPEN), Northwestern Polytechnical University, Xi'an, 710072, Shannxi, China.

School of Artificial Intelligence, OPtics and ElectroNics (iOPEN), Northwestern Polytechnical University, Xi'an, 710072, Shannxi, China; Key Laboratory of Intelligent Interaction and Application (Northwestern Polytechnical University), Ministry of Industry and Information Technology, Northwestern Polytechnical University, Xi'an, 710072, Shannxi, China.

出版信息

Neural Netw. 2024 Dec;180:106673. doi: 10.1016/j.neunet.2024.106673. Epub 2024 Aug 30.

DOI:10.1016/j.neunet.2024.106673
PMID:39260009
Abstract

Image harmonization seeks to transfer the illumination distribution of the background to that of the foreground within a composite image. Existing methods lack the ability of establishing global-local pixel illumination dependencies between foreground and background of composite images, which is indispensable for sharp and color-consistent harmonized image generation. To overcome this challenge, we design a novel Simple Hybrid CNN-Transformer Network (SHT-Net), which is formulated into an efficient symmetrical hierarchical architecture. It is composed of two newly designed light-weight Transformer blocks. Firstly, the scale-aware gated block is designed to capture multi-scale features through different heads and expand the receptive fields, which facilitates to generate images with fine-grained details. Secondly, we introduce a simple parallel attention block, which integrates the window-based self-attention and gated channel attention in parallel, resulting in simultaneously global-local pixel illumination relationship modeling capability. Besides, we propose an efficient simple feed forward network to filter out less informative features and allow the features to contribute to generating photo-realistic harmonized results passing through. Extensive experiments on image harmonization benchmarks indicate that our method achieve promising quantitative and qualitative results. The code and pre-trained models are available at https://github.com/guanguanboy/SHT-Net.

摘要

图像调和旨在将复合图像中背景的光照分布转移到前景的光照分布。现有的方法缺乏在复合图像的前景和背景之间建立全局-局部像素光照依赖关系的能力,这对于生成清晰和颜色一致的调和图像是必不可少的。为了克服这一挑战,我们设计了一种新颖的简单混合 CNN-Transformer 网络(SHT-Net),它被构建成一个高效的对称分层架构。它由两个新设计的轻量级 Transformer 块组成。首先,设计了尺度感知门控块,通过不同的头捕获多尺度特征,并扩展感受野,从而有利于生成具有细粒度细节的图像。其次,我们引入了一个简单的并行注意力块,它将基于窗口的自注意力和门控通道注意力并行集成,从而同时具有全局-局部像素光照关系建模能力。此外,我们提出了一种有效的简单前馈网络,用于过滤掉信息量较少的特征,并允许特征通过传递生成逼真的调和结果。在图像调和基准上的广泛实验表明,我们的方法在定量和定性方面都取得了有希望的结果。代码和预训练模型可在 https://github.com/guanguanboy/SHT-Net 上获得。

相似文献

1
Image harmonization with Simple Hybrid CNN-Transformer Network.基于简单混合 CNN-Transformer 网络的图像调和。
Neural Netw. 2024 Dec;180:106673. doi: 10.1016/j.neunet.2024.106673. Epub 2024 Aug 30.
2
MultiTrans: Multi-branch transformer network for medical image segmentation.多分支转换器网络在医学图像分割中的应用。
Comput Methods Programs Biomed. 2024 Sep;254:108280. doi: 10.1016/j.cmpb.2024.108280. Epub 2024 Jun 8.
3
VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation.VSmTrans:一种融合自注意力机制和卷积的 3D 医学图像分割混合范式。
Med Image Anal. 2024 Dec;98:103295. doi: 10.1016/j.media.2024.103295. Epub 2024 Aug 24.
4
ETU-Net: edge enhancement-guided U-Net with transformer for skin lesion segmentation.ETU-Net:基于边缘增强引导的 U-Net 与 Transformer 的皮肤病变分割。
Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/ad13d2.
5
MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation.MSCT-UNET:U 形网络中的多尺度对比变换用于医学图像分割。
Phys Med Biol. 2023 Dec 28;69(1). doi: 10.1088/1361-6560/ad135d.
6
HCformer: Hybrid CNN-Transformer for LDCT Image Denoising.HCformer:用于 LDCT 图像去噪的混合 CNN-Transformer。
J Digit Imaging. 2023 Oct;36(5):2290-2305. doi: 10.1007/s10278-023-00842-9. Epub 2023 Jun 29.
7
EMOST: A dual-branch hybrid network for medical image fusion via efficient model module and sparse transformer.EMOST:一种基于高效模型模块和稀疏 Transformer 的医学图像融合双分支混合网络。
Comput Biol Med. 2024 Sep;179:108771. doi: 10.1016/j.compbiomed.2024.108771. Epub 2024 Jul 5.
8
CSAP-UNet: Convolution and self-attention paralleling network for medical image segmentation with edge enhancement.CSAP-UNet:用于医学图像分割的具有边缘增强的卷积和自注意力并行网络。
Comput Biol Med. 2024 Apr;172:108265. doi: 10.1016/j.compbiomed.2024.108265. Epub 2024 Mar 7.
9
HEA-Net: Attention and MLP Hybrid Encoder Architecture for Medical Image Segmentation.HEA-Net:用于医学图像分割的注意力和 MLP 混合编码器架构。
Sensors (Basel). 2022 Sep 16;22(18):7024. doi: 10.3390/s22187024.
10
ScribFormer: Transformer Makes CNN Work Better for Scribble-Based Medical Image Segmentation.ScribFormer:Transformer 使 CNN 更适用于基于草图的医学图像分割。
IEEE Trans Med Imaging. 2024 Jun;43(6):2254-2265. doi: 10.1109/TMI.2024.3363190. Epub 2024 Jun 3.