• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

双通道卷积神经网络的图像风格识别

Convolution Neural Networks With Two Pathways for Image Style Recognition.

出版信息

IEEE Trans Image Process. 2017 Sep;26(9):4102-4113. doi: 10.1109/TIP.2017.2710631. Epub 2017 Jun 9.

DOI:10.1109/TIP.2017.2710631
PMID:28613168
Abstract

Automatic recognition of an image's style is important for many applications, including artwork analysis, photo organization, and image retrieval. Traditional convolution neural network (CNN) approach uses only object features for image style recognition. This approach may not be optimal, because the same object in two images may have different styles. We propose a CNN architecture with two pathways extracting object features and texture features, respectively. The object pathway represents the standard CNN architecture and the texture pathway intermixes the object pathway by outputting the gram matrices of intermediate features in the object pathway. The two pathways are jointly trained. In experiments, two deep CNNs, AlexNet and VGG-19, pretrained on the ImageNet classification data set are fine-tuned for this task. For any model, the two-pathway architecture performs much better than individual pathways, which indicates that the two pathways contain complementary information of an image's style. In particular, the model based on VGG-19 achieves the state-of-the-art results on three benchmark data sets, WikiPaintings, Flickr Style, and AVA Style.

摘要

图像风格的自动识别对于许多应用非常重要,包括艺术品分析、照片组织和图像检索。传统的卷积神经网络 (CNN) 方法仅使用对象特征进行图像风格识别。这种方法可能不是最优的,因为两张图像中的相同对象可能具有不同的风格。我们提出了一种具有两条路径的 CNN 架构,分别提取对象特征和纹理特征。对象路径表示标准的 CNN 架构,纹理路径通过输出对象路径中的中间特征的 Gram 矩阵来混合对象路径。两条路径共同训练。在实验中,我们对两个深度卷积神经网络 AlexNet 和 VGG-19 进行了微调,这些网络都是在 ImageNet 分类数据集上进行预训练的。对于任何模型,双通道架构的性能都明显优于单个路径,这表明两条路径包含了图像风格的互补信息。特别是,基于 VGG-19 的模型在三个基准数据集(WikiPaintings、Flickr Style 和 AVA Style)上实现了最先进的结果。

相似文献

1
Convolution Neural Networks With Two Pathways for Image Style Recognition.双通道卷积神经网络的图像风格识别
IEEE Trans Image Process. 2017 Sep;26(9):4102-4113. doi: 10.1109/TIP.2017.2710631. Epub 2017 Jun 9.
2
Cross-Modal Retrieval With CNN Visual Features: A New Baseline.基于卷积神经网络视觉特征的跨模态检索:一个新的基线。
IEEE Trans Cybern. 2017 Feb;47(2):449-460. doi: 10.1109/TCYB.2016.2519449. Epub 2016 Mar 8.
3
CLIP knows image aesthetics.CLIP了解图像美学。
Front Artif Intell. 2022 Nov 25;5:976235. doi: 10.3389/frai.2022.976235. eCollection 2022.
4
Transfer of Learning in the Convolutional Neural Networks on Classifying Geometric Shapes Based on Local or Global Invariants.基于局部或全局不变量的卷积神经网络在几何形状分类中的学习迁移
Front Comput Neurosci. 2021 Feb 19;15:637144. doi: 10.3389/fncom.2021.637144. eCollection 2021.
5
Representations of regular and irregular shapes by deep Convolutional Neural Networks, monkey inferotemporal neurons and human judgments.深度卷积神经网络、猴子下颞叶神经元和人类判断对规则和不规则形状的表示。
PLoS Comput Biol. 2018 Oct 26;14(10):e1006557. doi: 10.1371/journal.pcbi.1006557. eCollection 2018 Oct.
6
Fisher encoding of convolutional neural network features for endoscopic image classification.用于内镜图像分类的卷积神经网络特征的Fisher编码
J Med Imaging (Bellingham). 2018 Jul;5(3):034504. doi: 10.1117/1.JMI.5.3.034504. Epub 2018 Sep 24.
7
An Ensemble of Fine-Tuned Convolutional Neural Networks for Medical Image Classification.用于医学图像分类的微调卷积神经网络集成
IEEE J Biomed Health Inform. 2017 Jan;21(1):31-40. doi: 10.1109/JBHI.2016.2635663. Epub 2016 Dec 5.
8
Analogy-Detail Networks for Object Recognition.类比-细节网络用于目标识别。
IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4404-4418. doi: 10.1109/TNNLS.2020.3017692. Epub 2021 Oct 5.
9
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.
10
Fine-Tuning CNN Image Retrieval with No Human Annotation.无人工标注微调卷积神经网络图像检索。
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1655-1668. doi: 10.1109/TPAMI.2018.2846566. Epub 2018 Jun 12.

引用本文的文献

1
Teaching CORnet human fMRI representations for enhanced model-brain alignment.教授CORnet人类功能磁共振成像表征以增强模型与大脑的对齐。
Cogn Neurodyn. 2025 Dec;19(1):61. doi: 10.1007/s11571-025-10252-y. Epub 2025 Apr 15.
2
Achieving more human brain-like vision via human EEG representational alignment.通过人类脑电图表征对齐实现更类人脑的视觉。
ArXiv. 2024 Apr 24:arXiv:2401.17231v2.
3
Digital Image Art Style Transfer Algorithm Based on CycleGAN.基于 CycleGAN 的数字图像艺术风格迁移算法。
Comput Intell Neurosci. 2022 Jan 13;2022:6075398. doi: 10.1155/2022/6075398. eCollection 2022.
4
Application of Deep Learning Algorithms to Visual Communication Courses.深度学习算法在视觉通信课程中的应用。
Front Psychol. 2021 Sep 29;12:713723. doi: 10.3389/fpsyg.2021.713723. eCollection 2021.