基于 RGB 图像的像素级光谱重建的恢复。

A Rehabilitation of Pixel-Based Spectral Reconstruction from RGB Images.

机构信息

School of Computing Sciences, University of East Anglia, Norwich NR4 7TJ, UK.

出版信息

Sensors (Basel). 2023 Apr 21;23(8):4155. doi: 10.3390/s23084155.

DOI:10.3390/s23084155

PMID:37112497

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10142338/

Abstract

Recently, many deep neural networks (DNN) have been proposed to solve the spectral reconstruction (SR) problem: recovering spectra from RGB measurements. Most DNNs seek to learn the relationship between an RGB viewed in a given spatial context and its corresponding spectra. Significantly, it is argued that the same RGB can map to different spectra depending on the context with respect to which it is seen and, more generally, that accounting for spatial context leads to improved SR. However, as it stands, DNN performance is only slightly better than the much simpler pixel-based methods where spatial context is not used. In this paper, we present a new pixel-based algorithm called A++ (an extension of the A+ sparse coding algorithm). In A+, RGBs are clustered, and within each cluster, a designated linear SR map is trained to recover spectra. In A++, we cluster the spectra instead in an attempt to ensure neighboring spectra (i.e., spectra in the same cluster) are recovered by the same SR map. A polynomial regression framework is developed to estimate the spectral neighborhoods given only the RGB values in testing, which in turn determines which mapping should be used to map each testing RGB to its reconstructed spectrum. Compared to the leading DNNs, not only does A++ deliver the best results, it is parameterized by orders of magnitude fewer parameters and has a significantly faster implementation. Moreover, in contradistinction to some DNN methods, A++ uses pixel-based processing, which is robust to image manipulations that alter the spatial context (e.g., blurring and rotations). Our demonstration on the scene relighting application also shows that, while SR methods, in general, provide more accurate relighting results compared to the traditional diagonal matrix correction, A++ provides superior color accuracy and robustness compared to the top DNN methods.

摘要

最近，许多深度神经网络（DNN）被提出以解决光谱重建（SR）问题：从 RGB 测量中恢复光谱。大多数 DNN 试图学习给定空间上下文内的 RGB 与其相应光谱之间的关系。重要的是，有人认为，同一 RGB 可以根据其被观察的上下文映射到不同的光谱，更一般地说，考虑到空间上下文会导致更好的 SR。然而，就目前而言，DNN 的性能仅略好于不使用空间上下文的简单得多的基于像素的方法。在本文中，我们提出了一种新的基于像素的算法，称为 A++（A+稀疏编码算法的扩展）。在 A+中，RGB 被聚类，并且在每个聚类中，训练指定的线性 SR 图以恢复光谱。在 A++中，我们聚类光谱，试图确保相邻的光谱（即同一聚类中的光谱）由相同的 SR 图恢复。开发了一个多项式回归框架来仅根据测试中的 RGB 值估计光谱邻域，这反过来又确定了应该使用哪个映射将每个测试 RGB 映射到其重建光谱。与领先的 DNN 相比，A++不仅提供了最佳的结果，而且其参数化的参数数量要少几个数量级，并且实现速度要快得多。此外，与一些 DNN 方法相反，A++使用基于像素的处理，该处理对改变空间上下文（例如模糊和旋转）的图像操作具有鲁棒性。我们在场景重光照应用程序上的演示还表明，虽然一般来说，与传统的对角矩阵校正相比，SR 方法提供了更准确的重光照结果，但与顶级 DNN 方法相比，A++提供了更高的颜色准确性和鲁棒性。