• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用卷积变换器的美术识别

Fine-art recognition using convolutional transformers.

作者信息

Liu Yu, Bai Haozhe, Wang Jingchao

机构信息

School of Arts, Chongqing University, Chongqing, China.

出版信息

PeerJ Comput Sci. 2024 Oct 18;10:e2409. doi: 10.7717/peerj-cs.2409. eCollection 2024.

DOI:10.7717/peerj-cs.2409
PMID:39650442
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11622946/
Abstract

Digital image processing is a constantly evolving field encompassing a wide range of techniques and applications. Researchers worldwide are continually developing various algorithms across multiple fields to achieve accurate image classification. Advanced computer vision algorithms are crucial for architectural and artistic analysis. The digitalization of art has significantly enhanced the accessibility and conservation of fine-art paintings, yet the risk of art theft remains a significant challenge. Improving art security necessitates the precise identification of fine-art paintings. Although current recognition systems have shown potential, there is significant scope for enhancing their efficiency. We developed an improved recognition system for categorizing fine-art paintings using convolutional transformers, specified by an attention mechanism to enhance focused learning on the data. As part of the most advanced architectures in the deep learning family, transformers are empowered by a multi-head attention mechanism, thus improving learning efficiency. To assess the performance of our model, we compared it with those developed using four pre-trained networks: ResNet50, VGG16, AlexNet, and ViT. Each pre-trained network was integrated into a corresponding state-of-the-art model as the first processing blocks. These four state-of-the-art models were constructed under the transfer learning strategy, one of the most commonly used approaches in this field. The experimental results showed that our proposed system outperformed the other models. Our study also highlighted the effectiveness of using convolutional transformers for learning image features.

摘要

数字图像处理是一个不断发展的领域,涵盖了广泛的技术和应用。世界各地的研究人员不断在多个领域开发各种算法,以实现准确的图像分类。先进的计算机视觉算法对建筑和艺术分析至关重要。艺术的数字化显著提高了美术绘画的可及性和保护水平,但艺术品被盗的风险仍然是一个重大挑战。提高艺术安全性需要精确识别美术绘画。尽管当前的识别系统已显示出潜力,但仍有很大的提升效率的空间。我们开发了一种改进的识别系统,用于使用卷积变换器对美术绘画进行分类,该变换器由注意力机制指定,以增强对数据的聚焦学习。作为深度学习家族中最先进的架构之一,变换器由多头注意力机制赋能,从而提高学习效率。为了评估我们模型的性能,我们将其与使用四个预训练网络(ResNet50、VGG16、AlexNet和ViT)开发的模型进行了比较。每个预训练网络都作为第一个处理块集成到相应的最先进模型中。这四个最先进的模型是在迁移学习策略下构建的,这是该领域最常用的方法之一。实验结果表明,我们提出的系统优于其他模型。我们的研究还强调了使用卷积变换器学习图像特征的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/11622946/d8b4734b41dc/peerj-cs-10-2409-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/11622946/1a0d6722a3a5/peerj-cs-10-2409-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/11622946/b302ea35465b/peerj-cs-10-2409-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/11622946/ce7b7a0c0e6f/peerj-cs-10-2409-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/11622946/d8b4734b41dc/peerj-cs-10-2409-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/11622946/1a0d6722a3a5/peerj-cs-10-2409-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/11622946/b302ea35465b/peerj-cs-10-2409-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/11622946/ce7b7a0c0e6f/peerj-cs-10-2409-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/11622946/d8b4734b41dc/peerj-cs-10-2409-g004.jpg

相似文献

1
Fine-art recognition using convolutional transformers.使用卷积变换器的美术识别
PeerJ Comput Sci. 2024 Oct 18;10:e2409. doi: 10.7717/peerj-cs.2409. eCollection 2024.
2
Do it the transformer way: A comprehensive review of brain and vision transformers for autism spectrum disorder diagnosis and classification.采用变压器方法:自闭症谱系障碍诊断和分类的脑和视觉变压器的全面综述。
Comput Biol Med. 2023 Dec;167:107667. doi: 10.1016/j.compbiomed.2023.107667. Epub 2023 Nov 3.
3
Vision transformer and deep learning based weighted ensemble model for automated spine fracture type identification with GAN generated CT images.基于视觉Transformer和深度学习的加权集成模型,用于利用生成对抗网络(GAN)生成的CT图像自动识别脊柱骨折类型。
Sci Rep. 2025 Apr 25;15(1):14408. doi: 10.1038/s41598-025-98518-7.
4
Multi-class Classification of Retinal Eye Diseases from Ophthalmoscopy Images Using Transfer Learning-Based Vision Transformers.基于迁移学习的视觉变换器对眼底镜图像中的视网膜眼病进行多类别分类
J Imaging Inform Med. 2025 Jan 27. doi: 10.1007/s10278-025-01416-7.
5
Plant recognition by AI: Deep neural nets, transformers, and kNN in deep embeddings.人工智能的植物识别:深度嵌入中的深度神经网络、变换器和k近邻算法
Front Plant Sci. 2022 Sep 27;13:787527. doi: 10.3389/fpls.2022.787527. eCollection 2022.
6
Classification of Artistic Styles of Chinese Art Paintings Based on the CNN Model.基于卷积神经网络模型的中国艺术绘画艺术风格分类。
Comput Intell Neurosci. 2022 Aug 30;2022:4520913. doi: 10.1155/2022/4520913. eCollection 2022.
7
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
8
Emotion Recognition of Chinese Paintings at the Thirteenth National Exhibition of Fines Arts in China Based on Advanced Affective Computing.基于先进情感计算的第十三届全国美术作品展览中中国画的情感识别
Front Psychol. 2021 Oct 22;12:741665. doi: 10.3389/fpsyg.2021.741665. eCollection 2021.
9
An Explainable CNN and Vision Transformer-Based Approach for Real-Time Food Recognition.一种基于可解释卷积神经网络和视觉Transformer的实时食品识别方法。
Nutrients. 2025 Jan 20;17(2):362. doi: 10.3390/nu17020362.
10
Classification of Mobile-Based Oral Cancer Images Using the Vision Transformer and the Swin Transformer.使用视觉Transformer和Swin Transformer对基于移动设备的口腔癌图像进行分类
Cancers (Basel). 2024 Feb 29;16(5):987. doi: 10.3390/cancers16050987.

本文引用的文献

1
Nighttime road scene image enhancement based on cycle-consistent generative adversarial network.基于循环一致生成对抗网络的夜间道路场景图像增强
Sci Rep. 2024 Jun 22;14(1):14375. doi: 10.1038/s41598-024-65270-3.
2
RayMVSNet++: Learning Ray-Based 1D Implicit Fields for Accurate Multi-View Stereo.RayMVSNet++:学习基于光线的一维隐式场以实现精确的多视图立体视觉。
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13666-13682. doi: 10.1109/TPAMI.2023.3296163. Epub 2023 Oct 3.
3
High-content image generation for drug discovery using generative adversarial networks.
基于生成对抗网络的药物发现高内涵图像生成。
Neural Netw. 2020 Dec;132:353-363. doi: 10.1016/j.neunet.2020.09.007. Epub 2020 Sep 20.
4
Improved ArtGAN for Conditional Synthesis of Natural Image and Artwork.用于自然图像和艺术品条件合成的改进型艺术生成对抗网络
IEEE Trans Image Process. 2018 Aug 22. doi: 10.1109/TIP.2018.2866698.
5
Rhythmic brushstrokes distinguish van Gogh from his contemporaries: findings via automated brushstroke extraction.笔触的节奏感将梵高与同时代的画家区分开来:通过自动笔触提取获得的发现。
IEEE Trans Pattern Anal Mach Intell. 2012 Jun;34(6):1159-76. doi: 10.1109/TPAMI.2011.203.
6
State-of-The-Art and Applications of 3D Imaging Sensors in Industry, Cultural Heritage, Medicine, and Criminal Investigation.三维成像传感器在工业、文化遗产、医学和刑事调查中的最新技术和应用。
Sensors (Basel). 2009;9(1):568-601. doi: 10.3390/s90100568. Epub 2009 Jan 20.