Wasim Muhammad, Ahmed Imran, Abbas Naveed, Saba Tanzila, Alamri Faten S, Elyassih Alex, Rehman Amjad
Department of Computer Science, Islamia College Peshawar, Peshawar, Pakistan.
School of Computing and Information Science, Anglia Ruskin University, Cambridge, UK.
Sci Rep. 2025 Jul 12;15(1):25250. doi: 10.1038/s41598-025-07620-3.
In computer vision, video analytic researchers have been developing techniques for human activity recognition in several application domains. Academic institutions are in possession of rich video repository generated by the surveillance system in respective campuses. One major requirement is to develop lightweight adaptable models capable of recognizing academic activities, enabling effective decision making in various application domains. This research article proposes a lightweight 3D-CNN architecture for recognizing a novel set of academic activities using a realistic campus video dataset. The proposed sequence learning model is obtained by utilizing spatial and temporal video information enabling accurate classification of the target activity sequences. The proposed model is compared with the LSTM model, a state-of-the-art algorithm for time-series and sequence-learning problems, by performing sufficient experimentations. Experimental results demonstrate that the proposed 3D-CNN model outperforms other variants, achieving the highest accuracy of 95%, minimum computational cost of 13.3 GFLOPs, and low memory overhead of 18,464 KB. These performance indicators make the proposed model an efficient and effective classifier for the proposed academic activity recognition task.
在计算机视觉领域,视频分析研究人员一直在多个应用领域开发人类活动识别技术。学术机构拥有各自校园监控系统生成的丰富视频库。一个主要需求是开发能够识别学术活动的轻量级自适应模型,以便在各种应用领域进行有效的决策。本文提出了一种轻量级3D-CNN架构,用于使用真实的校园视频数据集识别一组新颖的学术活动。所提出的序列学习模型是通过利用视频的空间和时间信息获得的,能够对目标活动序列进行准确分类。通过进行充分的实验,将所提出的模型与LSTM模型(一种用于时间序列和序列学习问题的先进算法)进行了比较。实验结果表明,所提出的3D-CNN模型优于其他变体,达到了95%的最高准确率、13.3 GFLOPs的最低计算成本和18464 KB的低内存开销。这些性能指标使所提出的模型成为所提出的学术活动识别任务的高效有效分类器。