Wang Wenjing, Luo Rundong, Yang Wenhan, Liu Jiaying
IEEE Trans Pattern Anal Mach Intell. 2024 Sep;46(9):5951-5966. doi: 10.1109/TPAMI.2024.3382108. Epub 2024 Aug 6.
Insufficient lighting poses challenges to both human and machine visual analytics. While existing low-light enhancement methods prioritize human visual perception, they often neglect machine vision and high-level semantics. In this paper, we make pioneering efforts to build an illumination enhancement model for high-level vision. Drawing inspiration from camera response functions, our model could enhance images from the machine vision perspective despite being lightweight in architecture and simple in formulation. We also introduce two approaches that leverage knowledge from base enhancement curves and self-supervised pretext tasks to train for different downstream normal-to-low-light adaptation scenarios. Our proposed framework overcomes the limitations of existing algorithms without requiring access to labeled data in low-light conditions. It facilitates more effective illumination restoration and feature alignment, significantly improving the performance of downstream tasks in a plug-and-play manner. This research advances the field of low-light machine analytics and broadly applies to various high-level vision tasks, including classification, face detection, optical flow estimation, and video action recognition.
光照不足对人类和机器视觉分析都构成了挑战。虽然现有的低光增强方法优先考虑人类视觉感知,但它们往往忽视机器视觉和高级语义。在本文中,我们率先努力构建用于高级视觉的光照增强模型。受相机响应函数的启发,我们的模型尽管架构轻量级且公式简单,但能够从机器视觉角度增强图像。我们还引入了两种方法,利用来自基本增强曲线的知识和自监督预训练任务,针对不同的下游正常到低光适应场景进行训练。我们提出的框架克服了现有算法的局限性,无需访问低光条件下的标注数据。它有助于更有效地进行光照恢复和特征对齐,以即插即用的方式显著提高下游任务的性能。这项研究推动了低光机器分析领域的发展,并广泛应用于各种高级视觉任务,包括分类、人脸检测、光流估计和视频动作识别。