• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

学习用于高效光场压缩的内核调制神经表示

Learning Kernel-Modulated Neural Representation for Efficient Light Field Compression.

作者信息

Shi Jinglei, Xu Yihong, Guillemot Christine

出版信息

IEEE Trans Image Process. 2024;33:4060-4074. doi: 10.1109/TIP.2024.3418670. Epub 2024 Jul 4.

DOI:10.1109/TIP.2024.3418670
PMID:38949941
Abstract

Light fields capture 3D scene information by recording light rays emitted from a scene at various orientations. They offer a more immersive perception, compared with classic 2D images, but at the cost of huge data volumes. In this paper, we design a compact neural network representation for the light field compression task. In the same vein as the deep image prior, the neural network takes randomly initialized noise as input and is trained in a supervised manner in order to best reconstruct the target light field Sub-Aperture Images (SAIs). The network is composed of two types of complementary kernels: descriptive kernels (descriptors) that store scene description information learned during training, and modulatory kernels (modulators) that control the rendering of different SAIs from the queried perspectives. To further enhance compactness of the network meanwhile retain high quality of the decoded light field, we propose modulator allocation and apply kernel tensor decomposition techniques, followed by non-uniform quantization and lossless entropy coding. Extensive experiments demonstrate that our method outperforms other state-of-the-art (SOTA) methods by a significant margin in the light field compression task. Moreover, after adapting descriptors, the modulators learned from one light field can be transferred to new light fields for rendering dense views, showing the potential of the solution for view synthesis.

摘要

光场通过记录从场景以各种方向发射的光线来捕获三维场景信息。与传统二维图像相比,它们提供了更身临其境的感知,但代价是数据量巨大。在本文中,我们为光场压缩任务设计了一种紧凑的神经网络表示。与深度图像先验类似,神经网络以随机初始化的噪声作为输入,并以监督方式进行训练,以便最佳地重建目标光场子孔径图像(SAI)。该网络由两种互补内核组成:描述性内核(描述符),用于存储训练期间学习到的场景描述信息;调制内核(调制器),用于从查询视角控制不同SAI的渲染。为了在保持解码光场高质量的同时进一步提高网络的紧凑性,我们提出了调制器分配并应用内核张量分解技术,随后进行非均匀量化和无损熵编码。大量实验表明,在光场压缩任务中,我们的方法比其他现有最先进(SOTA)方法有显著优势。此外,在调整描述符之后,从一个光场学习到的调制器可以转移到新的光场以渲染密集视图,这显示了该解决方案在视图合成方面的潜力。

相似文献

1
Learning Kernel-Modulated Neural Representation for Efficient Light Field Compression.学习用于高效光场压缩的内核调制神经表示
IEEE Trans Image Process. 2024;33:4060-4074. doi: 10.1109/TIP.2024.3418670. Epub 2024 Jul 4.
2
Light-field compression using a pair of steps and depth estimation.使用一对步骤进行光场压缩和深度估计。
Opt Express. 2019 Feb 4;27(3):3557-3573. doi: 10.1364/OE.27.003557.
3
An Untrained Neural Network Prior for Light Field Compression.用于光场压缩的未经训练的神经网络先验
IEEE Trans Image Process. 2022;31:6922-6936. doi: 10.1109/TIP.2022.3217374. Epub 2022 Nov 8.
4
Fast virtual view synthesis for an 8K 3D light-field display based on cutoff-NeRF and 3D voxel rendering.基于截止神经辐射场和 3D 体素渲染的 8K 3D 光场显示的快速虚拟视图合成。
Opt Express. 2022 Nov 21;30(24):44201-44217. doi: 10.1364/OE.473852.
5
Dense-view synthesis for three-dimensional light-field display based on unsupervised learning.基于无监督学习的三维光场显示的密集视图合成
Opt Express. 2019 Aug 19;27(17):24624-24641. doi: 10.1364/OE.27.024624.
6
A Flexible Coding Scheme Based on Block Krylov Subspace Approximation for Light Field Displays with Stacked Multiplicative Layers.一种基于块克里洛夫子空间近似的灵活编码方案,用于具有堆叠乘法层的光场显示
Sensors (Basel). 2021 Jul 4;21(13):4574. doi: 10.3390/s21134574.
7
Enhanced Standard Compatible Image Compression Framework Based on Auxiliary Codec Networks.基于辅助编解码器网络的增强型标准兼容图像压缩框架
IEEE Trans Image Process. 2022;31:664-677. doi: 10.1109/TIP.2021.3134473. Epub 2021 Dec 28.
8
UPST-NeRF: Universal Photorealistic Style Transfer of Neural Radiance Fields for 3D Scene.UPST-NeRF:用于3D场景的神经辐射场通用逼真风格迁移
IEEE Trans Vis Comput Graph. 2025 Apr;31(4):2045-2057. doi: 10.1109/TVCG.2024.3378692. Epub 2025 Feb 27.
9
SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections.场景梦境生成器:从二维图像集合生成无边界三维场景
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15562-15576. doi: 10.1109/TPAMI.2023.3321857. Epub 2023 Nov 3.
10
Self-supervised structural similarity-based convolutional neural network for cardiac diffusion tensor image denoising.基于自监督结构相似性的卷积神经网络用于心脏扩散张量图像去噪
Med Phys. 2023 Oct;50(10):6137-6150. doi: 10.1002/mp.16301. Epub 2023 Apr 17.