Fu H C, Xu Y Y, Chang H Y
Department of Computer Science and Information Engineering, National Chiao Tung University, HsinChu, Taiwan, ROC.
Int J Neural Syst. 1999 Dec;9(6):545-61. doi: 10.1142/s0129065799000575.
Recognition of similar (confusion) characters is a difficult problem in optical character recognition (OCR). In this paper, we introduce a neural network solution that is capable of modeling minor differences among similar characters, and is robust to various personal handwriting styles. The Self-growing Probabilistic Decision-based Neural Network (SPDNN) is a probabilistic type neural network, which adopts a hierarchical network structure with nonlinear basis functions and a competitive credit-assignment scheme. Based on the SPDNN model, we have constructed a three-stage recognition system. First, a coarse classifier determines a character to be input to one of the pre-defined subclasses partitioned from a large character set, such as Chinese mixed with alphanumerics. Then a character recognizer determines the input image which best matches the reference character in the subclass. Lastly, the third module is a similar character recognizer, which can further enhance the recognition accuracy among similar or confusing characters. The prototype system has demonstrated a successful application of SPDNN to similar handwritten Chinese recognition for the public database CCL/HCCR1 (5401 characters x200 samples). Regarding performance, experiments on the CCL/HCCR1 database produced 90.12% recognition accuracy with no rejection, and 94.11% accuracy with 6.7% rejection, respectively. This recognition accuracy represents about 4% improvement on the previously announced performance. As to processing speed, processing before recognition (including image preprocessing, segmentation, and feature extraction) requires about one second for an A4 size character image, and recognition consumes approximately 0.27 second per character on a Pentium-100 based personal computer, without use of any hardware accelerator or co-processor.
相似(易混淆)字符的识别是光学字符识别(OCR)中的一个难题。在本文中,我们介绍了一种神经网络解决方案,它能够对相似字符之间的细微差异进行建模,并且对各种个人手写风格具有鲁棒性。自增长概率决策神经网络(SPDNN)是一种概率型神经网络,它采用具有非线性基函数的分层网络结构和竞争式信用分配方案。基于SPDNN模型,我们构建了一个三阶段识别系统。首先,一个粗分类器确定要输入到从大字符集(如混合了字母数字的中文)划分出的预定义子类之一中的字符。然后,一个字符识别器确定与子类中的参考字符最匹配的输入图像。最后,第三个模块是一个相似字符识别器,它可以进一步提高相似或易混淆字符之间的识别准确率。该原型系统已证明SPDNN在公共数据库CCL/HCCR1(5401个字符×200个样本)的相似手写中文识别中的成功应用。在性能方面,在CCL/HCCR1数据库上的实验分别产生了无拒识时90.12%的识别准确率和有6.7%拒识时94.11%的准确率。这种识别准确率比之前公布的性能提高了约4%。在处理速度方面,对于A4尺寸的字符图像,识别前的处理(包括图像预处理、分割和特征提取)在基于奔腾100的个人计算机上大约需要一秒钟,并且识别每个字符大约消耗0.27秒,无需使用任何硬件加速器或协处理器。