Wang Junjie, Zhang Mengmeng, Li Wei, Tao Ran
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17189-17201. doi: 10.1109/TNNLS.2023.3300903. Epub 2024 Dec 2.
Mixup-based data augmentation has been proven to be beneficial to the regularization of models during training, especially in the remote-sensing field where the training data is scarce. However, in the process of data augmentation, the Mixup-based methods ignore the target proportion in different inputs and keep the linear insertion ratio consistent, which leads to the response of label space even if no effective objects are introduced in the mixed image due to the randomness of the augmentation process. Moreover, although some previous works have attempted to utilize different multimodal interaction strategies, they could not be well extended to various remote-sensing data combinations. To this end, a multistage information complementary fusion network based on flexible-mixup (Flex-MCFNet) is proposed for hyperspectral-X image classification. First, to bridge the gap between the mixed image and the label, a flexible-mixup (FlexMix) data augmentation strategy is designed, where the weight of the label increases with the ratio of the input image to prevent the negative impact on the label space because of the introduction of invalid information. More importantly, to summarize diverse remote-sensing data inputs including various modal supplements and uncertainties, a multistage information complementary fusion network (MCFNet) is developed. After extracting the features of hyperspectral and complementary modalities [X-modal, including multispectral, synthetic aperture radar (SAR), and light detection and ranging (LiDAR)] separately, the information between complementary modalities is fully interacted and enhanced through multiple stages of information complement and fusion, which is used for the final image classification. Extensive experimental results have demonstrated that Flex-MCFNet can not only effectively expand the training data, but also adequately regularize different data combinations to achieve state-of-the-art performance.
基于混合样本的数据增强已被证明在训练过程中对模型的正则化有益,特别是在训练数据稀缺的遥感领域。然而,在数据增强过程中,基于混合样本的方法忽略了不同输入中的目标比例,并保持线性插入比例一致,这导致即使由于增强过程的随机性在混合图像中没有引入有效对象,标签空间也会产生响应。此外,尽管先前的一些工作尝试利用不同的多模态交互策略,但它们无法很好地扩展到各种遥感数据组合。为此,提出了一种基于灵活混合样本的多阶段信息互补融合网络(Flex-MCFNet)用于高光谱-X图像分类。首先,为了弥合混合图像与标签之间的差距,设计了一种灵活混合样本(FlexMix)数据增强策略,其中标签的权重随着输入图像的比例增加,以防止由于引入无效信息而对标签空间产生负面影响。更重要的是,为了汇总包括各种模态补充和不确定性在内的不同遥感数据输入,开发了一种多阶段信息互补融合网络(MCFNet)。在分别提取高光谱和互补模态[X模态,包括多光谱、合成孔径雷达(SAR)和光探测与测距(LiDAR)]的特征后,互补模态之间的信息通过多阶段的信息互补和融合进行充分交互和增强,用于最终的图像分类。大量实验结果表明,Flex-MCFNet不仅可以有效地扩展训练数据,还可以充分正则化不同的数据组合以实现最优性能。