Li Qinglan, Lei Jichong, Ren Changan, Peng Zhiqiang, Hong Jun
Department of Physical Education and Research, Hunan Institute of Technology, Hengyang, 421002, Hunan, China.
School of Safety and Management Engineering, Hunan Institute of Technology, Hengyang, 421002, Hunan, China.
Sci Rep. 2024 Aug 17;14(1):19087. doi: 10.1038/s41598-024-69965-5.
As computer image processing and digital technologies advance, creating an efficient method for classifying sports images is crucial for the rapid retrieval and management of large image datasets. Traditional manual methods for classifying sports images are impractical for large-scale data and often inaccurate when distinguishing similar images. This paper introduces an SE module that adaptively adjusts the weights of input feature mapping channels, and a Res module that excels in deep feature extraction, preventing gradient vanishing, multi-scale processing, and enhancing generalization in image recognition. Through extensive experimentation on network structure adjustments, the SE-RES-CNN neural network model is applied to sports image classification. The model is trained on a sports image classification dataset from Kaggle, alongside VGG-16 and ResNet50 models. Training results show that the proposed SE-RES-CNN model improves classification accuracy by approximately 5% compared to VGG-16 and ResNet50 models. Testing revealed that the SE-RES-CNN model classifies 100 out of 500 sports images in 6 s, achieving an accuracy rate of up to 98% and a single prediction time of 0.012 s. This validates the model's accuracy and effectiveness, significantly enhancing sports image retrieval and classification efficiency. This validates the model's accuracy and effectiveness, significantly enhancing sports image retrieval and classification efficiency.
随着计算机图像处理和数字技术的发展,创建一种高效的体育图像分类方法对于大型图像数据集的快速检索和管理至关重要。传统的体育图像手动分类方法对于大规模数据不实用,并且在区分相似图像时往往不准确。本文介绍了一种自适应调整输入特征映射通道权重的SE模块,以及一种在深度特征提取、防止梯度消失、多尺度处理和增强图像识别泛化能力方面表现出色的Res模块。通过对网络结构调整进行广泛实验,将SE-RES-CNN神经网络模型应用于体育图像分类。该模型在来自Kaggle的体育图像分类数据集上进行训练,同时与VGG-16和ResNet50模型进行对比。训练结果表明,与VGG-16和ResNet50模型相比,所提出的SE-RES-CNN模型的分类准确率提高了约5%。测试显示,SE-RES-CNN模型在6秒内对500张体育图像中的100张进行分类,准确率高达98%,单次预测时间为0.012秒。这验证了该模型的准确性和有效性,显著提高了体育图像检索和分类效率。这验证了该模型的准确性和有效性,显著提高了体育图像检索和分类效率。