• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 GAN 的图像着色用于自监督视觉特征学习。

GAN-Based Image Colorization for Self-Supervised Visual Feature Learning.

机构信息

Faculty of Computer Science and Engineering, University Ss. Cyril and Methodius, 1000 Skopje, North Macedonia.

Instituto de Telecomunicações, Universidade da Beira Interior, 6200-001 Covilhã, Portugal.

出版信息

Sensors (Basel). 2022 Feb 18;22(4):1599. doi: 10.3390/s22041599.

DOI:10.3390/s22041599
PMID:35214498
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8880520/
Abstract

Large-scale labeled datasets are generally necessary for successfully training a deep neural network in the computer vision domain. In order to avoid the costly and tedious work of manually annotating image datasets, self-supervised learning methods have been proposed to learn general visual features automatically. In this paper, we first focus on image colorization with generative adversarial networks (GANs) because of their ability to generate the most realistic colorization results. Then, via transfer learning, we use this as a proxy task for visual understanding. Particularly, we propose to use conditional GANs (cGANs) for image colorization and transfer the gained knowledge to two other downstream tasks, namely, multilabel image classification and semantic segmentation. This is the first time that GANs have been used for self-supervised feature learning through image colorization. Through extensive experiments with the COCO and Pascal datasets, we show an increase of 5% for the classification task and 2.5% for the segmentation task. This demonstrates that image colorization with conditional GANs can boost other downstream tasks' performance without the need for manual annotation.

摘要

大规模的标记数据集通常是成功训练计算机视觉领域的深度神经网络所必需的。为了避免手动注释图像数据集的昂贵和繁琐的工作,已经提出了自监督学习方法来自动学习通用的视觉特征。在本文中,我们首先专注于使用生成对抗网络(GANs)进行图像着色,因为它们能够生成最逼真的着色效果。然后,通过迁移学习,我们将其用作视觉理解的代理任务。特别地,我们建议使用条件生成对抗网络(cGANs)进行图像着色,并将获得的知识转移到另外两个下游任务,即多标签图像分类和语义分割。这是首次通过图像着色使用 GANs 进行自监督特征学习。通过使用 COCO 和 Pascal 数据集进行广泛的实验,我们在分类任务中提高了 5%,在分割任务中提高了 2.5%。这表明,使用条件 GANs 进行图像着色可以在不需要手动注释的情况下提高其他下游任务的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/23b51149143a/sensors-22-01599-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/497c60d6b400/sensors-22-01599-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/4911ba342318/sensors-22-01599-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/3a85337ca64d/sensors-22-01599-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/15108641cf1d/sensors-22-01599-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/699d345cb252/sensors-22-01599-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/e28fc00633b1/sensors-22-01599-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/0d279da88529/sensors-22-01599-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/23b51149143a/sensors-22-01599-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/497c60d6b400/sensors-22-01599-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/4911ba342318/sensors-22-01599-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/3a85337ca64d/sensors-22-01599-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/15108641cf1d/sensors-22-01599-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/699d345cb252/sensors-22-01599-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/e28fc00633b1/sensors-22-01599-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/0d279da88529/sensors-22-01599-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/202c/8880520/23b51149143a/sensors-22-01599-g008.jpg

相似文献

1
GAN-Based Image Colorization for Self-Supervised Visual Feature Learning.基于 GAN 的图像着色用于自监督视觉特征学习。
Sensors (Basel). 2022 Feb 18;22(4):1599. doi: 10.3390/s22041599.
2
Semi-Supervised Semantic Image Segmentation by Deep Diffusion Models and Generative Adversarial Networks.基于深度扩散模型和生成对抗网络的半监督语义图像分割。
Int J Neural Syst. 2024 Nov;34(11):2450057. doi: 10.1142/S0129065724500576. Epub 2024 Aug 15.
3
The role of unpaired image-to-image translation for stain color normalization in colorectal cancer histology classification.非配对图像到图像翻译在结直肠癌组织学分类中用于染色颜色归一化的作用。
Comput Methods Programs Biomed. 2023 Jun;234:107511. doi: 10.1016/j.cmpb.2023.107511. Epub 2023 Mar 26.
4
Generative Adversarial Network Technologies and Applications in Computer Vision.生成对抗网络技术及其在计算机视觉中的应用。
Comput Intell Neurosci. 2020 Aug 1;2020:1459107. doi: 10.1155/2020/1459107. eCollection 2020.
5
High resolution histopathology image generation and segmentation through adversarial training.通过对抗训练生成和分割高分辨率组织病理学图像。
Med Image Anal. 2022 Jan;75:102251. doi: 10.1016/j.media.2021.102251. Epub 2021 Nov 3.
6
A deep learning segmentation strategy that minimizes the amount of manually annotated images.一种深度学习分割策略,可最大限度地减少手动标注图像的数量。
F1000Res. 2021 Mar 30;10:256. doi: 10.12688/f1000research.52026.2. eCollection 2021.
7
SpeckleGAN: a generative adversarial network with an adaptive speckle layer to augment limited training data for ultrasound image processing.SpeckleGAN:一种具有自适应散斑层的生成对抗网络,用于扩充有限的超声图像处理训练数据。
Int J Comput Assist Radiol Surg. 2020 Sep;15(9):1427-1436. doi: 10.1007/s11548-020-02203-1. Epub 2020 Jun 18.
8
Systematic Review of Generative Adversarial Networks (GANs) for Medical Image Classification and Segmentation.生成对抗网络(GANs)在医学图像分类和分割中的系统评价。
J Digit Imaging. 2022 Apr;35(2):137-152. doi: 10.1007/s10278-021-00556-w. Epub 2022 Jan 12.
9
Exploiting the potential of unlabeled endoscopic video data with self-supervised learning.利用自监督学习挖掘未标记内镜视频数据的潜力。
Int J Comput Assist Radiol Surg. 2018 Jun;13(6):925-933. doi: 10.1007/s11548-018-1772-0. Epub 2018 Apr 27.
10
Generative Adversarial Networks in Medical Image Processing.生成对抗网络在医学图像处理中的应用。
Curr Pharm Des. 2021;27(15):1856-1868. doi: 10.2174/1381612826666201125110710.

引用本文的文献

1
Generative AI for Cell Type-Specific Fluorescence Image Generation of hPSC-derived Cardiac Organoid.用于生成人多能干细胞来源的心脏类器官特定细胞类型荧光图像的生成式人工智能。
bioRxiv. 2024 Aug 8:2024.01.15.575724. doi: 10.1101/2024.01.15.575724.
2
Colorizing the Past: Deep Learning for the Automatic Colorization of Historical Aerial Images.为历史航拍图像自动上色:用于历史航拍图像自动上色的深度学习
J Imaging. 2022 Oct 1;8(10):269. doi: 10.3390/jimaging8100269.
3
Hint-Based Image Colorization Based on Hierarchical Vision Transformer.基于分层视觉Transformer的基于提示的图像着色。

本文引用的文献

1
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.基于深度神经网络的自监督视觉特征学习:综述
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):4037-4058. doi: 10.1109/TPAMI.2020.2992393. Epub 2021 Oct 1.
2
iW-Net: an automatic and minimalistic interactive lung nodule segmentation deep network.iW-Net:一种自动且极简的交互式肺结节分割深度网络。
Sci Rep. 2019 Aug 12;9(1):11591. doi: 10.1038/s41598-019-48004-8.
3
Places: A 10 Million Image Database for Scene Recognition.地点:用于场景识别的 1000 万图像数据库。
Sensors (Basel). 2022 Sep 29;22(19):7419. doi: 10.3390/s22197419.
IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1452-1464. doi: 10.1109/TPAMI.2017.2723009. Epub 2017 Jul 4.
4
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
5
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
6
Image quality assessment: from error visibility to structural similarity.图像质量评估:从误差可见性到结构相似性。
IEEE Trans Image Process. 2004 Apr;13(4):600-12. doi: 10.1109/tip.2003.819861.