结合Swin Transformer和自然场景统计的无参考图像质量评估

No-Reference Image Quality Assessment Combining Swin-Transformer and Natural Scene Statistics.

作者信息

Yang Yuxuan, Lei Zhichun, Li Changlu

机构信息

School of Microelectronics, Tianjin University, Tianjin 300072, China.

Institute of Sensors and Measurements, University of Applied Sciences Ruhr West, 45479 Mülheim an der Ruhr, Germany.

出版信息

Sensors (Basel). 2024 Aug 12;24(16):5221. doi: 10.3390/s24165221.

DOI:10.3390/s24165221

PMID:39204917

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11359186/

Abstract

No-reference image quality assessment aims to evaluate image quality based on human subjective perceptions. Current methods face challenges with insufficient ability to focus on global and local information simultaneously and information loss due to image resizing. To address these issues, we propose a model that combines Swin-Transformer and natural scene statistics. The model utilizes Swin-Transformer to extract multi-scale features and incorporates a feature enhancement module and deformable convolution to improve feature representation, adapting better to structural variations in images, apply dual-branch attention to focus on key areas, and align the assessment more closely with human visual perception. The Natural Scene Statistics compensates information loss caused by image resizing. Additionally, we use a normalized loss function to accelerate model convergence and enhance stability. We evaluate our model on six standard image quality assessment datasets (both synthetic and authentic), and show that our model achieves advanced results across multiple datasets. Compared to the advanced DACNN method, our model achieved Spearman rank correlation coefficients of 0.922 and 0.923 on the KADID and KonIQ datasets, respectively, representing improvements of 1.9% and 2.4% over this method. It demonstrated outstanding performance in handling both synthetic and authentic scenes.

摘要

无参考图像质量评估旨在基于人类主观感知来评估图像质量。当前的方法面临着一些挑战，即同时关注全局和局部信息的能力不足，以及由于图像缩放导致的信息丢失。为了解决这些问题，我们提出了一种结合Swin-Transformer和自然场景统计的模型。该模型利用Swin-Transformer提取多尺度特征，并结合一个特征增强模块和可变形卷积来改善特征表示，从而更好地适应图像中的结构变化，应用双分支注意力来聚焦关键区域，并使评估更紧密地与人类视觉感知对齐。自然场景统计弥补了图像缩放造成的信息损失。此外，我们使用归一化损失函数来加速模型收敛并增强稳定性。我们在六个标准图像质量评估数据集（包括合成数据集和真实数据集）上评估了我们的模型，并表明我们的模型在多个数据集上都取得了先进的结果。与先进的DACNN方法相比，我们的模型在KADID和KonIQ数据集上分别取得了0.922和0.923的斯皮尔曼等级相关系数，比该方法分别提高了1.9%和2.4%。它在处理合成场景和真实场景方面都表现出了出色的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11359186/f5133e6e50b2/sensors-24-05221-g001.jpg

相似文献

No-Reference Image Quality Assessment Combining Swin-Transformer and Natural Scene Statistics.

Sensors (Basel). 2024 Aug 12;24(16):5221. doi: 10.3390/s24165221.

Swin-Net: A Swin-Transformer-Based Network Combing with Multi-Scale Features for Segmentation of Breast Tumor Ultrasound Images.

Diagnostics (Basel). 2024 Jan 26;14(3):269. doi: 10.3390/diagnostics14030269.

SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.

Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.

STEDNet: Swin transformer-based encoder-decoder network for noise reduction in low-dose CT.

Med Phys. 2023 Jul;50(7):4443-4458. doi: 10.1002/mp.16249. Epub 2023 Feb 9.

Transformer-Based Model with Dynamic Attention Pyramid Head for Semantic Segmentation of VHR Remote Sensing Imagery.

Entropy (Basel). 2022 Nov 6;24(11):1619. doi: 10.3390/e24111619.

MDST: multi-domain sparse-view CT reconstruction based on convolution and swin transformer.

Phys Med Biol. 2023 Apr 26;68(9). doi: 10.1088/1361-6560/acc2ab.

Enhancing medical image segmentation with a multi-transformer U-Net.

PeerJ. 2024 Feb 29;12:e17005. doi: 10.7717/peerj.17005. eCollection 2024.

CPFTransformer: transformer fusion context pyramid medical image segmentation network.

Front Neurosci. 2023 Dec 7;17:1288366. doi: 10.3389/fnins.2023.1288366. eCollection 2023.

Face-based age estimation using improved Swin Transformer with attention-based convolution.

Front Neurosci. 2023 Apr 12;17:1136934. doi: 10.3389/fnins.2023.1136934. eCollection 2023.

Small object detection algorithm incorporating swin transformer for tea buds.

PLoS One. 2024 Mar 21;19(3):e0299902. doi: 10.1371/journal.pone.0299902. eCollection 2024.

引用本文的文献

Enhancing Historical Aerial Photographs: A New Approach Based on Non-Reference Metric and Photo Interpretation Elements.

Sensors (Basel). 2025 Mar 27;25(7):2126. doi: 10.3390/s25072126.

本文引用的文献

VCRNet: Visual Compensation Restoration Network for No-Reference Image Quality Assessment.

IEEE Trans Image Process. 2022;31:1613-1627. doi: 10.1109/TIP.2022.3144892. Epub 2022 Feb 1.

Image Quality Assessment: Unifying Structure and Texture Similarity.

IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2567-2581. doi: 10.1109/TPAMI.2020.3045810. Epub 2022 Apr 1.

KonIQ-10k: An ecologically valid database for deep learning of blind image quality assessment.

IEEE Trans Image Process. 2020 Jan 24. doi: 10.1109/TIP.2020.2967829.

Group Maximum Differentiation Competition: Model Comparison with Few Samples.

IEEE Trans Pattern Anal Mach Intell. 2020 Apr;42(4):851-864. doi: 10.1109/TPAMI.2018.2889948. Epub 2018 Dec 27.

End-to-End Blind Image Quality Assessment Using Deep Neural Networks.

IEEE Trans Image Process. 2018 Mar;27(3):1202-1213. doi: 10.1109/TIP.2017.2774045.

Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment.

IEEE Trans Image Process. 2018 Jan;27(1):206-219. doi: 10.1109/TIP.2017.2760518. Epub 2017 Oct 10.

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.

Waterloo Exploration Database: New Challenges for Image Quality Assessment Models.

IEEE Trans Image Process. 2017 Feb;26(2):1004-1016. doi: 10.1109/TIP.2016.2631888. Epub 2016 Nov 22.

Massive Online Crowdsourced Study of Subjective and Objective Picture Quality.

IEEE Trans Image Process. 2016 Jan;25(1):372-87. doi: 10.1109/TIP.2015.2500021. Epub 2015 Nov 11.

A feature-enriched completely blind image quality evaluator.

IEEE Trans Image Process. 2015 Aug;24(8):2579-91. doi: 10.1109/TIP.2015.2426416. Epub 2015 Apr 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

结合Swin Transformer和自然场景统计的无参考图像质量评估

No-Reference Image Quality Assessment Combining Swin-Transformer and Natural Scene Statistics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献