• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于高分辨率稀疏注意力的语义布局操作。

Semantic Layout Manipulation With High-Resolution Sparse Attention.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3768-3782. doi: 10.1109/TPAMI.2022.3181587. Epub 2023 Feb 3.

DOI:10.1109/TPAMI.2022.3181587
PMID:35696464
Abstract

We tackle the problem of semantic image layout manipulation, which aims to manipulate an input image by editing its semantic label map. A core problem of this task is how to transfer visual details from the input images to the new semantic layout while making the resulting image visually realistic. Recent work on learning cross-domain correspondence has shown promising results for global layout transfer with dense attention-based warping. However, this method tends to lose texture details due to the resolution limitation and the lack of smoothness constraint on correspondence. To adapt this paradigm for the layout manipulation task, we propose a high-resolution sparse attention module that effectively transfers visual details to new layouts at a resolution up to 512x512. To further improve visual quality, we introduce a novel generator architecture consisting of a semantic encoder and a two-stage decoder for coarse-to-fine synthesis. Experiments on the ADE20k and Places365 datasets demonstrate that our proposed approach achieves substantial improvements over the existing inpainting and layout manipulation methods.

摘要

我们解决了语义图像布局操作的问题,该问题旨在通过编辑输入图像的语义标签图来操作图像。该任务的一个核心问题是如何在使生成的图像具有真实感的同时,将视觉细节从输入图像转移到新的语义布局中。最近在学习跨域对应关系方面的工作表明,基于密集注意力的变形在全局布局转移方面具有很有前景的结果。然而,由于分辨率限制和对应关系缺乏平滑性约束,这种方法往往会丢失纹理细节。为了将这种范式应用于布局操作任务,我们提出了一种高分辨率稀疏注意力模块,该模块可以在高达 512x512 的分辨率下有效地将视觉细节转移到新的布局中。为了进一步提高视觉质量,我们引入了一种新颖的生成器架构,该架构由语义编码器和两级解码器组成,用于从粗到细的合成。在 ADE20k 和 Places365 数据集上的实验表明,我们提出的方法在现有的修复和布局操作方法上取得了实质性的改进。

相似文献

1
Semantic Layout Manipulation With High-Resolution Sparse Attention.基于高分辨率稀疏注意力的语义布局操作。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3768-3782. doi: 10.1109/TPAMI.2022.3181587. Epub 2023 Feb 3.
2
Layout-to-Image Translation With Double Pooling Generative Adversarial Networks.基于双池化生成对抗网络的布局到图像翻译
IEEE Trans Image Process. 2021;30:7903-7913. doi: 10.1109/TIP.2021.3109531. Epub 2021 Sep 20.
3
Edge-Semantic Learning Strategy for Layout Estimation in Indoor Environment.
IEEE Trans Cybern. 2020 Jun;50(6):2730-2739. doi: 10.1109/TCYB.2019.2895837. Epub 2019 Feb 21.
4
Structure-Guided Image Completion With Image-Level and Object-Level Semantic Discriminators.基于图像级和对象级语义判别器的结构引导图像补全
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):7669-7681. doi: 10.1109/TPAMI.2024.3393898. Epub 2024 Nov 6.
5
Pseudo Decoder Guided Light-Weight Architecture for Image Inpainting.用于图像修复的伪解码器引导轻量级架构
IEEE Trans Image Process. 2022;31:6577-6590. doi: 10.1109/TIP.2022.3213444. Epub 2022 Oct 21.
6
Image Inpainting With Local and Global Refinement.图像修复:局部与全局细化
IEEE Trans Image Process. 2022;31:2405-2420. doi: 10.1109/TIP.2022.3152624. Epub 2022 Mar 15.
7
Image Inpainting via Correlated Multi-Resolution Feature Projection.通过相关多分辨率特征投影实现图像修复
IEEE Trans Vis Comput Graph. 2024 Sep;30(9):5953-5964. doi: 10.1109/TVCG.2023.3315061. Epub 2024 Jul 31.
8
On the Diversity of Conditional Image Synthesis with Semantic Layouts.基于语义布局的条件图像合成的多样性
IEEE Trans Image Process. 2019 Jan 10. doi: 10.1109/TIP.2019.2891935.
9
A Novel Upsampling and Context Convolution for Image Semantic Segmentation.一种用于图像语义分割的新型上采样与上下文卷积
Sensors (Basel). 2021 Mar 20;21(6):2170. doi: 10.3390/s21062170.
10
Attention Guided Global Enhancement and Local Refinement Network for Semantic Segmentation.用于语义分割的注意力引导全局增强与局部细化网络
IEEE Trans Image Process. 2022;31:3211-3223. doi: 10.1109/TIP.2022.3166673. Epub 2022 Apr 22.

引用本文的文献

1
Domain-Scalable Unpaired Image Translation via Latent Space Anchoring.通过潜在空间锚定实现域可扩展的无配对图像翻译
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11707-11719. doi: 10.1109/TPAMI.2023.3287774. Epub 2023 Sep 5.