• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过潜在风格空间操纵实现可控的无监督雪景合成。

Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation.

作者信息

Yang Hanting, Carballo Alexander, Zhang Yuxiao, Takeda Kazuya

机构信息

Graduate School of Informatics, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan.

Faculty of Engineering, Graduate School of Engineering, Gifu University, 1-1 Yanagido, Gifu City 501-1193, Japan.

出版信息

Sensors (Basel). 2023 Oct 12;23(20):8398. doi: 10.3390/s23208398.

DOI:10.3390/s23208398
PMID:37896492
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10611349/
Abstract

In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation models offer a solution by synthesizing samples of the desired domain, thus eliminating the reliance on ground truth supervision. However, the current methods predominantly focus on single projections rather than multiple solutions, not to mention controlling the direction of generation, which creates a scope for enhancement. In this study, we propose a generative adversarial network (GAN)-based model, which incorporates both a style encoder and a content encoder, specifically designed to extract relevant information from an image. Further, we employ a decoder to reconstruct an image using these encoded features, while ensuring that the generated output remains within a permissible range by applying a self-regression module to constrain the style latent space. By modifying the hyperparameters, we can generate controllable outputs with specific style codes. We evaluate the performance of our model by generating snow scenes on the Cityscapes and the EuroCity Persons datasets. The results reveal the effectiveness of our proposed methodology, thereby reinforcing the benefits of our approach in the ongoing evolution of intelligent vehicle technology.

摘要

在智能车辆技术领域,为了开发强大的感知算法,对在具有挑战性的条件下拍摄的图像有高度的依赖性。然而,获取这些图像既耗时又危险。为了解决这个问题,无配对图像到图像翻译模型通过合成所需领域的样本提供了一种解决方案,从而消除了对真实监督的依赖。然而,当前的方法主要集中在单一投影上,而不是多种解决方案,更不用说控制生成方向了,这就为改进留下了空间。在本研究中,我们提出了一种基于生成对抗网络(GAN)的模型,该模型结合了风格编码器和内容编码器,专门设计用于从图像中提取相关信息。此外,我们使用解码器利用这些编码特征重建图像,同时通过应用自回归模块来约束风格潜在空间,确保生成的输出保持在允许的范围内。通过修改超参数,我们可以生成具有特定风格代码的可控输出。我们通过在Cityscapes和EuroCity Persons数据集上生成雪景来评估我们模型的性能。结果揭示了我们提出的方法的有效性,从而强化了我们的方法在智能车辆技术不断发展中的优势。

相似文献

1
Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation.通过潜在风格空间操纵实现可控的无监督雪景合成。
Sensors (Basel). 2023 Oct 12;23(20):8398. doi: 10.3390/s23208398.
2
Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation.探索用于无配对图像到图像翻译中潜在空间解缠的显式域监督
IEEE Trans Pattern Anal Mach Intell. 2021 Apr;43(4):1254-1266. doi: 10.1109/TPAMI.2019.2950198. Epub 2021 Mar 5.
3
Gated-GAN: Adversarial Gated Networks for Multi-Collection Style Transfer.门控 GAN:用于多集合风格迁移的对抗门控网络。
IEEE Trans Image Process. 2019 Feb;28(2):546-560. doi: 10.1109/TIP.2018.2869695. Epub 2018 Sep 12.
4
2D facial landmark localization method for multi-view face synthesis image using a two-pathway generative adversarial network approach.基于双通路生成对抗网络方法的多视角人脸合成图像的二维面部地标定位方法
PeerJ Comput Sci. 2022 Feb 16;8:e897. doi: 10.7717/peerj-cs.897. eCollection 2022.
5
Homomorphic Interpolation Network for Unpaired Image-to-Image Translation.同态插值网络在非配对图像到图像翻译中的应用。
IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2534-2547. doi: 10.1109/TPAMI.2020.3036543. Epub 2022 Apr 1.
6
Unpaired Artistic Portrait Style Transfer via Asymmetric Double-Stream GAN.通过非对称双流生成对抗网络实现非配对艺术肖像风格迁移
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5427-5439. doi: 10.1109/TNNLS.2023.3263846. Epub 2023 Sep 1.
7
NG-GAN: A Robust Noise-Generation Generative Adversarial Network for Generating Old-Image Noise.NG-GAN:一种用于生成旧图像噪声的强大噪声生成生成对抗网络。
Sensors (Basel). 2022 Dec 26;23(1):251. doi: 10.3390/s23010251.
8
Generative adversarial networks with decoder-encoder output noises.生成对抗网络与解码器编码器输出噪声。
Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.
9
In-Domain GAN Inversion for Faithful Reconstruction and Editability.用于忠实重建和可编辑性的领域内生成对抗网络反演
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):2607-2621. doi: 10.1109/TPAMI.2023.3310872. Epub 2024 Apr 3.
10
Improving Skin Cancer Classification Using Heavy-Tailed Student T-Distribution in Generative Adversarial Networks (TED-GAN).在生成对抗网络(TED-GAN)中使用重尾学生T分布改进皮肤癌分类
Diagnostics (Basel). 2021 Nov 19;11(11):2147. doi: 10.3390/diagnostics11112147.

本文引用的文献

1
Framework for Generation and Removal of Multiple Types of Adverse Weather from Driving Scene Images.生成和去除驾驶场景图像中多种类型不良天气的框架。
Sensors (Basel). 2023 Jan 31;23(3):1548. doi: 10.3390/s23031548.
2
Deep Dense Multi-Scale Network for Snow Removal Using Semantic and Depth Priors.基于语义和深度先验的深度密集多尺度除雪网络
IEEE Trans Image Process. 2021;30:7419-7431. doi: 10.1109/TIP.2021.3104166. Epub 2021 Aug 30.
3
EuroCity Persons: A Novel Benchmark for Person Detection in Traffic Scenes.欧洲城市行人:交通场景中行人检测的一种新型基准
IEEE Trans Pattern Anal Mach Intell. 2019 Feb 5. doi: 10.1109/TPAMI.2019.2897684.
4
DesnowNet: Context-Aware Deep Network for Snow Removal.DesnowNet:用于除雪的上下文感知深度网络。
IEEE Trans Image Process. 2018 Feb 14. doi: 10.1109/TIP.2018.2806202.
5
Single Image Haze Removal Using Dark Channel Prior.基于暗通道先验的单幅图像去雾。
IEEE Trans Pattern Anal Mach Intell. 2011 Dec;33(12):2341-53. doi: 10.1109/TPAMI.2010.168. Epub 2010 Sep 9.