School of Computer Science and Technology, Huazhong University of Science and Technology, 1037 Luoyu Road, Wuhan, 430070, Hubei, China.
School of Computer Science and Technology, Huazhong University of Science and Technology, 1037 Luoyu Road, Wuhan, 430070, Hubei, China.
Neural Netw. 2024 Nov;179:106536. doi: 10.1016/j.neunet.2024.106536. Epub 2024 Jul 14.
Cross-domain few-shot Learning (CDFSL) is proposed to first pre-train deep models on a source domain dataset where sufficient data is available, and then generalize models to target domains to learn from only limited data. However, the gap between the source and target domains greatly hampers the generalization and target-domain few-shot finetuning. To address this problem, we analyze the domain gap from the aspect of frequency-domain analysis. We find the domain gap could be reflected by the compositions of source-domain spectra, and the lack of compositions in the source datasets limits the generalization. Therefore, we aim to expand the coverage of spectra composition in the source datasets to help the source domain cover a larger range of possible target-domain information, to mitigate the domain gap. To achieve this goal, we propose the Spectral Decomposition and Transformation (SDT) method, which first randomly decomposes the spectrogram of the source datasets into orthogonal bases, and then randomly samples different coordinates in the space formed by these bases. We integrate the above process into a data augmentation module, and further design a two-stream network to handle augmented images and original images respectively. Experimental results show that our method achieves state-of-the-art performance in the CDFSL benchmark dataset.
跨领域少样本学习 (CDFSL) 旨在首先在源域数据集上进行深度模型的预训练,该数据集有足够的数据,然后将模型推广到目标域,以便仅从有限的数据中学习。然而,源域和目标域之间的差距极大地阻碍了模型的泛化和目标域的少样本微调。为了解决这个问题,我们从频域分析的角度分析了域间隙。我们发现域间隙可以由源域光谱的组成反映出来,而源数据集缺乏组成限制了泛化。因此,我们旨在扩展源数据集的光谱组成的覆盖范围,以帮助源域覆盖更大范围的可能的目标域信息,从而减轻域间隙。为了实现这一目标,我们提出了光谱分解和转换 (SDT) 方法,该方法首先将源数据集的声谱图随机分解为正交基,然后随机在这些基构成的空间中采样不同的坐标。我们将上述过程集成到一个数据增强模块中,并进一步设计了一个双流网络,分别处理增强图像和原始图像。实验结果表明,我们的方法在 CDFSL 基准数据集上取得了最先进的性能。