• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DR-GAN:用于文本到图像生成的分布正则化

DR-GAN: Distribution Regularization for Text-to-Image Generation.

作者信息

Tan Hongchen, Liu Xiuping, Yin Baocai, Li Xin

出版信息

IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10309-10323. doi: 10.1109/TNNLS.2022.3165573. Epub 2023 Nov 30.

DOI:10.1109/TNNLS.2022.3165573
PMID:35442894
Abstract

This article presents a new text-to-image (T2I) generation model, named distribution regularization generative adversarial network (DR-GAN), to generate images from text descriptions from improved distribution learning. In DR-GAN, we introduce two novel modules: a semantic disentangling module (SDM) and a distribution normalization module (DNM). SDM combines the spatial self-attention mechanism (SSAM) and a new semantic disentangling loss (SDL) to help the generator distill key semantic information for the image generation. DNM uses a variational auto-encoder (VAE) to normalize and denoise the image latent distribution, which can help the discriminator better distinguish synthesized images from real images. DNM also adopts a distribution adversarial loss (DAL) to guide the generator to align with normalized real image distributions in the latent space. Extensive experiments on two public datasets demonstrated that our DR-GAN achieved a competitive performance in the T2I task. The code link: https://github.com/Tan-H-C/DR-GAN-Distribution-Regularization-for-Text-to-Image-Generation.

摘要

本文提出了一种名为分布正则化生成对抗网络(DR-GAN)的新的文本到图像(T2I)生成模型,用于通过改进的分布学习从文本描述中生成图像。在DR-GAN中,我们引入了两个新颖的模块:语义解缠模块(SDM)和分布归一化模块(DNM)。SDM结合了空间自注意力机制(SSAM)和一种新的语义解缠损失(SDL),以帮助生成器提取用于图像生成的关键语义信息。DNM使用变分自编码器(VAE)对图像潜在分布进行归一化和去噪,这有助于判别器更好地区分合成图像和真实图像。DNM还采用了分布对抗损失(DAL)来引导生成器在潜在空间中与归一化的真实图像分布对齐。在两个公共数据集上进行的大量实验表明,我们的DR-GAN在T2I任务中取得了有竞争力的性能。代码链接:https://github.com/Tan-H-C/DR-GAN-Distribution-Regularization-for-Text-to-Image-Generation 。

相似文献

1
DR-GAN: Distribution Regularization for Text-to-Image Generation.DR-GAN:用于文本到图像生成的分布正则化
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10309-10323. doi: 10.1109/TNNLS.2022.3165573. Epub 2023 Nov 30.
2
KT-GAN: Knowledge-Transfer Generative Adversarial Network for Text-to-Image Synthesis.KT-GAN:用于文本到图像合成的知识转移生成对抗网络。
IEEE Trans Image Process. 2021;30:1275-1290. doi: 10.1109/TIP.2020.3026728. Epub 2020 Dec 23.
3
Functional brain network identification and fMRI augmentation using a VAE-GAN framework.基于 VAE-GAN 框架的功能脑网络识别与 fMRI 增强。
Comput Biol Med. 2023 Oct;165:107395. doi: 10.1016/j.compbiomed.2023.107395. Epub 2023 Sep 1.
4
SAM-GAN: Self-Attention supporting Multi-stage Generative Adversarial Networks for text-to-image synthesis.SAM-GAN:用于文本到图像合成的支持多阶段生成对抗网络的自注意力模型。
Neural Netw. 2021 Jun;138:57-67. doi: 10.1016/j.neunet.2021.01.023. Epub 2021 Feb 10.
5
Semi-supervised segmentation of lesion from breast ultrasound images with attentional generative adversarial network.基于注意力生成对抗网络的乳腺超声图像病灶半监督分割。
Comput Methods Programs Biomed. 2020 Jun;189:105275. doi: 10.1016/j.cmpb.2019.105275. Epub 2019 Dec 12.
6
DualG-GAN, a Dual-channel Generator based Generative Adversarial Network for text-to-face synthesis.基于双通道生成器的生成对抗网络 DualG-GAN 文本到人脸的合成。
Neural Netw. 2022 Nov;155:155-167. doi: 10.1016/j.neunet.2022.08.016. Epub 2022 Aug 19.
7
Improving Skin Cancer Classification Using Heavy-Tailed Student T-Distribution in Generative Adversarial Networks (TED-GAN).在生成对抗网络(TED-GAN)中使用重尾学生T分布改进皮肤癌分类
Diagnostics (Basel). 2021 Nov 19;11(11):2147. doi: 10.3390/diagnostics11112147.
8
Generative adversarial networks with decoder-encoder output noises.生成对抗网络与解码器编码器输出噪声。
Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.
9
Image Generation from Text Using StackGAN with Improved Conditional Consistency Regularization.使用具有改进条件一致性正则化的StackGAN从文本生成图像
Sensors (Basel). 2022 Dec 26;23(1):249. doi: 10.3390/s23010249.
10
Word self-update contrastive adversarial networks for text-to-image synthesis.基于词自更新对比对抗网络的文本到图像合成。
Neural Netw. 2023 Oct;167:433-444. doi: 10.1016/j.neunet.2023.08.038. Epub 2023 Aug 25.

引用本文的文献

1
Dual decoding generative adversarial networks for infrared image enhancement.用于红外图像增强的双解码生成对抗网络。
Sci Rep. 2025 Jul 1;15(1):21423. doi: 10.1038/s41598-025-06538-0.
2
DRSegNet: A cutting-edge approach to Diabetic Retinopathy segmentation and classification using parameter-aware Nature-Inspired optimization.DRSegNet:一种使用参数感知自然启发式优化的糖尿病视网膜病变分割与分类前沿方法。
PLoS One. 2024 Dec 5;19(12):e0312016. doi: 10.1371/journal.pone.0312016. eCollection 2024.
3
Art design integrating visual relation and affective semantics based on Convolutional Block Attention Mechanism-generative adversarial network model.
基于卷积块注意力机制-生成对抗网络模型的融合视觉关系与情感语义的艺术设计
PeerJ Comput Sci. 2024 Aug 30;10:e2274. doi: 10.7717/peerj-cs.2274. eCollection 2024.
4
uRP: An integrated research platform for one-stop analysis of medical images.uRP:一个用于医学图像一站式分析的集成研究平台。
Front Radiol. 2023 Apr 18;3:1153784. doi: 10.3389/fradi.2023.1153784. eCollection 2023.