Gong Zhiqiang, Zhong Ping, Hu Weidong
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):322-333. doi: 10.1109/TNNLS.2020.2978577. Epub 2021 Jan 4.
Nowadays, deep learning methods, especially the convolutional neural networks (CNNs), have shown impressive performance on extracting abstract and high-level features from the hyperspectral image. However, the general training process of CNNs mainly considers the pixelwise information or the samples' correlation to formulate the penalization while ignores the statistical properties especially the spectral variability of each class in the hyperspectral image. These sample-based penalizations would lead to the uncertainty of the training process due to the imbalanced and limited number of training samples. To overcome this problem, this article characterizes each class from the hyperspectral image as a statistical distribution and further develops a novel statistical loss with the distributions, not directly with samples for deep learning. Based on the Fisher discrimination criterion, the loss penalizes the sample variance of each class distribution to decrease the intraclass variance of the training samples. Moreover, an additional diversity-promoting condition is added to enlarge the interclass variance between different class distributions, and this could better discriminate samples from different classes in the hyperspectral image. Finally, the statistical estimation form of the statistical loss is developed with the training samples through multivariant statistical analysis. Experiments over the real-world hyperspectral images show the effectiveness of the developed statistical loss for deep learning.
如今,深度学习方法,尤其是卷积神经网络(CNN),在从高光谱图像中提取抽象和高级特征方面表现出了令人印象深刻的性能。然而,CNN的一般训练过程主要考虑像素级信息或样本的相关性来制定惩罚,而忽略了统计特性,特别是高光谱图像中每个类别的光谱变异性。由于训练样本数量不均衡且有限,这些基于样本的惩罚会导致训练过程的不确定性。为了克服这个问题,本文将高光谱图像中的每个类别表征为一种统计分布,并进一步开发了一种基于这些分布而非直接基于样本的新型统计损失用于深度学习。基于Fisher判别准则,该损失对每个类分布的样本方差进行惩罚,以降低训练样本的类内方差。此外,添加了一个额外的促进多样性条件来扩大不同类分布之间的类间方差,这可以更好地区分高光谱图像中不同类别的样本。最后,通过多元统计分析利用训练样本开发了统计损失的统计估计形式。在真实世界高光谱图像上的实验表明了所开发的用于深度学习的统计损失的有效性。