Chen Linwei, Fu Ying, Gu Lin, Yan Chenggang, Harada Tatsuya, Huang Gao
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10763-10780. doi: 10.1109/TPAMI.2024.3449959. Epub 2024 Nov 6.
Dense image prediction tasks demand features with strong category information and precise spatial boundary details at high resolution. To achieve this, modern hierarchical models often utilize feature fusion, directly adding upsampled coarse features from deep layers and high-resolution features from lower levels. In this paper, we observe rapid variations in fused feature values within objects, resulting in intra-category inconsistency due to disturbed high-frequency features. Additionally, blurred boundaries in fused features lack accurate high frequency, leading to boundary displacement. Building upon these observations, we propose Frequency-Aware Feature Fusion (FreqFusion), integrating an Adaptive Low-Pass Filter (ALPF) generator, an offset generator, and an Adaptive High-Pass Filter (AHPF) generator. The ALPF generator predicts spatially-variant low-pass filters to attenuate high-frequency components within objects, reducing intra-class inconsistency during upsampling. The offset generator refines large inconsistent features and thin boundaries by replacing inconsistent features with more consistent ones through resampling, while the AHPF generator enhances high-frequency detailed boundary information lost during downsampling. Comprehensive visualization and quantitative analysis demonstrate that FreqFusion effectively improves feature consistency and sharpens object boundaries. Extensive experiments across various dense prediction tasks confirm its effectiveness.
密集图像预测任务需要具有强类别信息和高分辨率下精确空间边界细节的特征。为了实现这一点,现代分层模型通常利用特征融合,直接将深层的上采样粗特征和较低层的高分辨率特征相加。在本文中,我们观察到对象内融合特征值的快速变化,由于高频特征受到干扰,导致类别内不一致。此外,融合特征中的模糊边界缺乏准确的高频,导致边界位移。基于这些观察结果,我们提出了频率感知特征融合(FreqFusion),它集成了一个自适应低通滤波器(ALPF)生成器、一个偏移生成器和一个自适应高通滤波器(AHPF)生成器。ALPF生成器预测空间变化的低通滤波器,以衰减对象内的高频分量,减少上采样期间的类内不一致。偏移生成器通过重采样用更一致的特征替换不一致的特征来细化大的不一致特征和细边界,而AHPF生成器增强下采样期间丢失的高频详细边界信息。全面的可视化和定量分析表明,FreqFusion有效地提高了特征一致性并锐化了对象边界。在各种密集预测任务上的大量实验证实了其有效性。