Soudy Mohamed, Afify Yasmine, Badr Nagwa
Bioinformatics Program, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt.
Information Systems Department, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt.
PeerJ Comput Sci. 2021 Sep 20;7:e666. doi: 10.7717/peerj-cs.666. eCollection 2021.
Image understanding and scene classification are keystone tasks in computer vision. The development of technologies and profusion of existing datasets open a wide room for improvement in the image classification and recognition research area. Notwithstanding the optimal performance of exiting machine learning models in image understanding and scene classification, there are still obstacles to overcome. All models are data-dependent that can only classify samples close to the training set. Moreover, these models require large data for training and learning. The first problem is solved by few-shot learning, which achieves optimal performance in object detection and classification but with a lack of eligible attention in the scene classification task. Motivated by these findings, in this paper, we introduce two models for few-shot learning in scene classification. In order to trace the behavior of those models, we also introduce two datasets (MiniSun; MiniPlaces) for image scene classification. Experimental results show that the proposed models outperform the benchmark approaches in respect of classification accuracy.
图像理解和场景分类是计算机视觉中的关键任务。技术的发展和现有数据集的丰富为图像分类和识别研究领域的改进提供了广阔空间。尽管现有机器学习模型在图像理解和场景分类方面表现出色,但仍有障碍需要克服。所有模型都依赖数据,只能对接近训练集的样本进行分类。此外,这些模型需要大量数据进行训练和学习。少样本学习解决了第一个问题,它在目标检测和分类中取得了最佳性能,但在场景分类任务中缺乏足够的关注。受这些发现的启发,在本文中,我们介绍了两种用于场景分类的少样本学习模型。为了追踪这些模型的行为,我们还引入了两个用于图像场景分类的数据集(MiniSun;MiniPlaces)。实验结果表明,所提出的模型在分类准确率方面优于基准方法。