IEEE Trans Cybern. 2016 Mar;46(3):756-65. doi: 10.1109/TCYB.2015.2414920. Epub 2015 Mar 31.
Chinese character font recognition (CCFR) has received increasing attention as the intelligent applications based on optical character recognition becomes popular. However, traditional CCFR systems do not handle noisy data effectively. By analyzing in detail the basic strokes of Chinese characters, we propose that font recognition on a single Chinese character is a sequence classification problem, which can be effectively solved by recurrent neural networks. For robust CCFR, we integrate a principal component convolution layer with the 2-D long short-term memory (2DLSTM) and develop principal component 2DLSTM (PC-2DLSTM) algorithm. PC-2DLSTM considers two aspects: 1) the principal component layer convolution operation helps remove the noise and get a rational and complete font information and 2) simultaneously, 2DLSTM deals with the long-range contextual processing along scan directions that can contribute to capture the contrast between character trajectory and background. Experiments using the frequently used CCFR dataset suggest the effectiveness of PC-2DLSTM compared with other state-of-the-art font recognition methods.
汉字字体识别(CCFR)在基于光学字符识别的智能应用越来越受到关注。然而,传统的 CCFR 系统不能有效地处理噪声数据。通过详细分析汉字的基本笔画,我们提出,单个汉字的字体识别是一个序列分类问题,可以通过递归神经网络有效地解决。为了实现稳健的 CCFR,我们将主成分卷积层与 2D 长短期记忆网络(2DLSTM)集成,并开发了主成分 2DLSTM(PC-2DLSTM)算法。PC-2DLSTM 考虑了两个方面:1)主成分层卷积操作有助于去除噪声,得到合理和完整的字体信息;2)同时,2DLSTM 处理沿扫描方向的长程上下文处理,有助于捕捉字符轨迹与背景之间的对比度。使用常用的 CCFR 数据集进行的实验表明,与其他最先进的字体识别方法相比,PC-2DLSTM 是有效的。