Ramesh G, Shreyas J, Balaji J Manoj, Sharma Ganesh N, Gururaj H L, Srinidhi N N, Askar S S, Abouhawwash Mohamed
Department of AIML-Artificial Intelligence & Machine Learning, Alva's Institute of Engineering and Technology, Mangalore, Karnataka, India.
Department of Information Technology, Manipal Institute of Technology Bengaluru, Manipal Academy of Higher Education, Manipal, Karnataka, India.
Front Neurosci. 2024 Apr 12;18:1362567. doi: 10.3389/fnins.2024.1362567. eCollection 2024.
Handwritten character recognition is one of the classical problems in the field of image classification. Supervised learning techniques using deep learning models are highly effective in their application to handwritten character recognition. However, they require a large dataset of labeled samples to achieve good accuracies. Recent supervised learning techniques for Kannada handwritten character recognition have state of the art accuracy and perform well over a large range of input variations. In this work, a framework is proposed for the Kannada language that incorporates techniques from semi-supervised learning. The framework uses features extracted from a convolutional neural network backbone and uses regularization to improve the trained features and label propagation to classify previously unseen characters. The episodic learning framework is used to validate the framework. Twenty-four classes are used for pre-training, 12 classes are used for testing and 11 classes are used for validation. Fine-tuning is tested using one example per unseen class and five examples per unseen class. Through experimentation the components of the network are implemented in Python using the Pytorch library. It is shown that the accuracy obtained 99.13% make this framework competitive with the currently available supervised learning counterparts, despite the large reduction in the number of labeled samples available for the novel classes.
手写字符识别是图像分类领域的经典问题之一。使用深度学习模型的监督学习技术在应用于手写字符识别方面非常有效。然而,它们需要大量带标签样本的数据集才能获得良好的准确率。最近用于卡纳达语手写字符识别的监督学习技术具有当前最优的准确率,并且在大范围的输入变化下表现良好。在这项工作中,提出了一个针对卡纳达语的框架,该框架融合了半监督学习技术。该框架使用从卷积神经网络主干提取的特征,并使用正则化来改进训练后的特征,以及使用标签传播来对以前未见过的字符进行分类。情节学习框架用于验证该框架。24个类别用于预训练,12个类别用于测试,11个类别用于验证。使用每个未见类别一个示例和每个未见类别五个示例来测试微调。通过实验,使用Pytorch库在Python中实现了网络的组件。结果表明,尽管新类别可用的带标签样本数量大幅减少,但所获得的99.13%的准确率使该框架与目前可用的监督学习对应方法具有竞争力。