School of Electronics and Electrical Engineering, Kyungpook National University, Daegu, 41566, South Korea.
School of Electronics and Electrical Engineering, Kyungpook National University, Daegu, 41566, South Korea; Graduate School of Artificial Intelligence, Kyungpook National University, Daegu, 41566, South Korea.
Neural Netw. 2021 Nov;143:489-499. doi: 10.1016/j.neunet.2021.07.003. Epub 2021 Jul 8.
Recognition of ancient Korean-Chinese cursive character (Hanja) is a challenging problem mainly because of large number of classes, damaged cursive characters, various hand-writing styles, and similar confusable characters. They also suffer from lack of training data and class imbalance issues. To address these problems, we propose a unified Regularized Low-shot Attention Transfer with Imbalance τ-Normalizing (RELATIN) framework. This handles the problem with instance-poor classes using a novel low-shot regularizer that encourages the norm of the weight vectors for classes with few samples to be aligned to those of many-shot classes. To overcome the class imbalance problem, we incorporate a decoupled classifier to rectify the decision boundaries via classifier weight-scaling into the proposed low-shot regularizer framework. To address the limited training data issue, the proposed framework performs Jensen-Shannon divergence based data augmentation and incorporate an attention module that aligns the most attentive features of the pretrained network to a target network. We verify the proposed RELATIN framework using highly-imbalanced ancient cursive handwritten character datasets. The results suggest that (i) the extreme class imbalance has a detrimental effect on classification performance; (ii) the proposed low-shot regularizer aligns the norm of the classifier in favor of classes with few samples; (iii) weight-scaling of decoupled classifier for addressing class imbalance appeared to be dominant in all the other baseline conditions; (iv) further addition of the attention module attempts to select more representative features maps from base pretrained model; (v) the proposed (RELATIN) framework results in superior representations to address extreme class imbalance issue.
识别古朝鲜汉字(汉字)是一个具有挑战性的问题,主要是因为类的数量多、草书字符损坏、各种手写风格和相似的易混淆字符。它们还存在缺乏训练数据和类不平衡问题。为了解决这些问题,我们提出了一个统一的正则化低-shot 注意力转移与不平衡τ归一化(RELATIN)框架。该框架使用一种新的低 shot 正则化器来处理类中样本数量较少的问题,该正则化器鼓励具有少数样本的类的权重向量的范数与多 shot 类的权重向量的范数对齐。为了克服类不平衡问题,我们在提出的低 shot 正则化器框架中引入了一个解耦分类器,通过分类器权重缩放来修正决策边界。为了解决有限的训练数据问题,所提出的框架执行基于 Jensen-Shannon 散度的数据增强,并结合注意力模块,将预训练网络的最关注特征对齐到目标网络。我们使用高度不平衡的古代草书手写字符数据集验证了所提出的 RELATIN 框架。结果表明:(i)极端的类不平衡对分类性能有不利影响;(ii)所提出的低 shot 正则化器有利于类中样本数量较少的类的分类器的范数对齐;(iii)用于解决类不平衡的解耦分类器的权重缩放在所有其他基线条件下似乎都是占主导地位的;(iv)注意力模块的进一步添加试图从基础预训练模型中选择更具代表性的特征图;(v)所提出的(RELATIN)框架能够获得更好的表示,以解决极端的类不平衡问题。