• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

注意网络在提升光场图像质量中的应用。

Attention Networks for the Quality Enhancement of Light Field Images.

机构信息

Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Pleinlaan 2, 1050 Brussels, Belgium.

出版信息

Sensors (Basel). 2021 May 7;21(9):3246. doi: 10.3390/s21093246.

DOI:10.3390/s21093246
PMID:34067191
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8125823/
Abstract

In this paper, we propose a novel filtering method based on deep attention networks for the quality enhancement of light field (LF) images captured by plenoptic cameras and compressed using the High Efficiency Video Coding (HEVC) standard. The proposed architecture was built using efficient complex processing blocks and novel attention-based residual blocks. The network takes advantage of the macro-pixel (MP) structure, specific to LF images, and processes each reconstructed MP in the luminance (Y) channel. The input patch is represented as a tensor that collects, from an MP neighbourhood, four Epipolar Plane Images (EPIs) at four different angles. The experimental results on a common LF image database showed high improvements over HEVC in terms of the structural similarity index (SSIM), with an average Y-Bjøntegaard Delta (BD)-rate savings of 36.57%, and an average Y-BD-PSNR improvement of 2.301 dB. Increased performance was achieved when the HEVC built-in filtering methods were skipped. The visual results illustrate that the enhanced image contains sharper edges and more texture details. The ablation study provides two robust solutions to reduce the inference time by 44.6% and the network complexity by 74.7%. The results demonstrate the potential of attention networks for the quality enhancement of LF images encoded by HEVC.

摘要

在本文中,我们提出了一种基于深度注意网络的新颖滤波方法,用于增强由光场(LF)相机捕获并使用高效视频编码(HEVC)标准压缩的 LF 图像的质量。所提出的架构使用高效的复数处理块和新颖的基于注意力的残差块构建。该网络利用 LF 图像特有的宏像素(MP)结构,对亮度(Y)通道中的每个重建 MP 进行处理。输入补丁表示为一个张量,从 MP 邻域中收集四个不同角度的四个立体图像(EPI)。在常见的 LF 图像数据库上的实验结果表明,与 HEVC 相比,在结构相似性指数(SSIM)方面有很大的提高,平均 Y-Bjøntegaard Delta(BD)-率节省 36.57%,平均 Y-BD-PSNR 提高 2.301dB。当跳过 HEVC 内置滤波方法时,性能得到提高。视觉结果表明,增强后的图像具有更清晰的边缘和更多的纹理细节。消融研究提供了两种稳健的解决方案,可将推断时间减少 44.6%,将网络复杂度降低 74.7%。结果表明,注意网络在增强 HEVC 编码的 LF 图像质量方面具有潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/a1443befe09f/sensors-21-03246-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/a352cfbb8dd6/sensors-21-03246-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/3669f334e6be/sensors-21-03246-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/31e092b8da0b/sensors-21-03246-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/561559be3c1b/sensors-21-03246-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/6551ea1aaee5/sensors-21-03246-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/818d974b973c/sensors-21-03246-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/27232c151f5c/sensors-21-03246-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/0f298d3a58cd/sensors-21-03246-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/1d63c3257ba7/sensors-21-03246-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/df09cf853c24/sensors-21-03246-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/7fa1aabd7473/sensors-21-03246-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/a1443befe09f/sensors-21-03246-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/a352cfbb8dd6/sensors-21-03246-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/3669f334e6be/sensors-21-03246-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/31e092b8da0b/sensors-21-03246-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/561559be3c1b/sensors-21-03246-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/6551ea1aaee5/sensors-21-03246-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/818d974b973c/sensors-21-03246-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/27232c151f5c/sensors-21-03246-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/0f298d3a58cd/sensors-21-03246-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/1d63c3257ba7/sensors-21-03246-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/df09cf853c24/sensors-21-03246-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/7fa1aabd7473/sensors-21-03246-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a0e/8125823/a1443befe09f/sensors-21-03246-g012.jpg

相似文献

1
Attention Networks for the Quality Enhancement of Light Field Images.注意网络在提升光场图像质量中的应用。
Sensors (Basel). 2021 May 7;21(9):3246. doi: 10.3390/s21093246.
2
Deep Learning Post-Filtering Using Multi-Head Attention and Multiresolution Feature Fusion for Image and Intra-Video Quality Enhancement.基于多头注意力和多分辨率特征融合的深度学习后滤波技术在图像和视频内质量增强中的应用。
Sensors (Basel). 2022 Feb 10;22(4):1353. doi: 10.3390/s22041353.
3
Efficient In-loop Filtering Based on Enhanced Deep Convolutional Neural Networks for HEVC.基于增强深度卷积神经网络的高效HEVC帧内滤波
IEEE Trans Image Process. 2020 Mar 27. doi: 10.1109/TIP.2020.2982534.
4
Light Field Reconstruction Using Residual Networks on Raw Images.基于原始图像的残差网络的光场重建。
Sensors (Basel). 2022 Mar 2;22(5):1956. doi: 10.3390/s22051956.
5
RCA-LF: Dense Light Field Reconstruction Using Residual Channel Attention Networks.RCA-LF:使用残差通道注意力网络的密集光场重建
Sensors (Basel). 2022 Jul 14;22(14):5254. doi: 10.3390/s22145254.
6
A Deep Learning Approach for Multi-Frame In-Loop Filter of HEVC.一种用于高效视频编码(HEVC)多帧帧内循环滤波器的深度学习方法。
IEEE Trans Image Process. 2019 Nov;28(11):5663-5678. doi: 10.1109/TIP.2019.2921877. Epub 2019 Jun 14.
7
A Hardware-Friendlyand High-Efficiency H.265/HEVC Encoder for Visual Sensor Networks.面向视觉传感器网络的硬件友好型、高效率 H.265/HEVC 编码器。
Sensors (Basel). 2023 Feb 27;23(5):2625. doi: 10.3390/s23052625.
8
Scalable Coding of Plenoptic Images by Using a Sparse Set and Disparities.利用稀疏集和视差进行全光图像的可扩展编码。
IEEE Trans Image Process. 2016 Jan;25(1):80-91. doi: 10.1109/TIP.2015.2498406. Epub 2015 Nov 5.
9
CS-MRI Reconstruction Using an Improved GAN with Dilated Residual Networks and Channel Attention Mechanism.基于改进生成对抗网络、带空洞残差网络和通道注意力机制的磁共振成像重建
Sensors (Basel). 2023 Sep 6;23(18):7685. doi: 10.3390/s23187685.
10
End-to-End Light Field Spatial Super-Resolution Network Using Multiple Epipolar Geometry.基于多对极几何的端到端光场空间超分辨率网络
IEEE Trans Image Process. 2021;30:5956-5968. doi: 10.1109/TIP.2021.3079805. Epub 2021 Jun 30.

引用本文的文献

1
Advances in Deep-Learning-Based Sensing, Imaging, and Video Processing.深度学习感知、成像和视频处理的进展。
Sensors (Basel). 2022 Aug 18;22(16):6192. doi: 10.3390/s22166192.

本文引用的文献

1
Small Object Detection in Traffic Scenes Based on Attention Feature Fusion.基于注意力特征融合的交通场景小目标检测。
Sensors (Basel). 2021 Apr 26;21(9):3031. doi: 10.3390/s21093031.
2
Edge-Enhanced with Feedback Attention Network for Image Super-Resolution.基于反馈注意力网络的边缘增强图像超分辨率方法。
Sensors (Basel). 2021 Mar 15;21(6):2064. doi: 10.3390/s21062064.
3
Learning Region-Based Attention Network for Traffic Sign Recognition.基于学习区域注意力网络的交通标志识别。
Sensors (Basel). 2021 Jan 20;21(3):686. doi: 10.3390/s21030686.
4
Depth Estimation for Light-Field Images Using Stereo Matching and Convolutional Neural Networks.基于立体匹配和卷积神经网络的光场图像深度估计
Sensors (Basel). 2020 Oct 30;20(21):6188. doi: 10.3390/s20216188.
5
Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding.用于高效视频编码中环路滤波的内容感知卷积神经网络
IEEE Trans Image Process. 2019 Jul;28(7):3343-3356. doi: 10.1109/TIP.2019.2896489. Epub 2019 Jan 31.
6
Base-Anchored Model for Highly Scalable and Accessible Compression of Multiview Imagery.用于多视图图像的高可扩展性和可访问性压缩的基于基数的模型
IEEE Trans Image Process. 2019 Jul;28(7):3205-3218. doi: 10.1109/TIP.2019.2894968. Epub 2019 Jan 24.
7
Residual Highway Convolutional Neural Networks for in-loop Filtering in HEVC.用于 HEVC 中环路滤波的剩余公路卷积神经网络。
IEEE Trans Image Process. 2018 Aug;27(8):3827-3841. doi: 10.1109/TIP.2018.2815841.
8
Light Field Multi-View Video Coding With Two-Directional Parallel Inter-View Prediction.具有双向并行视间预测的光场多视角视频编码。
IEEE Trans Image Process. 2016 Nov;25(11):5104-5117. doi: 10.1109/TIP.2016.2603602. Epub 2016 Aug 26.
9
Depth Estimation with Occlusion Modeling Using Light-Field Cameras.基于光场相机的遮挡建模深度估计。
IEEE Trans Pattern Anal Mach Intell. 2016 Nov;38(11):2170-2181. doi: 10.1109/TPAMI.2016.2515615. Epub 2016 Jan 7.
10
Light field compression using disparity-compensated lifting and shape adaptation.使用视差补偿提升和形状适配的光场压缩
IEEE Trans Image Process. 2006 Apr;15(4):793-806. doi: 10.1109/tip.2005.863954.