Suppr超能文献

人类行为识别:最佳深度学习特征选择和基于序列的扩展融合范例。

Human Action Recognition: A Paradigm of Best Deep Learning Features Selection and Serial Based Extended Fusion.

机构信息

Department of Computer Science, HITEC University Taxila, Txila 47080, Pakistan.

College of Computer Science and Engineering, University of Ha'il, Ha'il 55211, Saudi Arabia.

出版信息

Sensors (Basel). 2021 Nov 28;21(23):7941. doi: 10.3390/s21237941.

Abstract

Human action recognition (HAR) has gained significant attention recently as it can be adopted for a smart surveillance system in Multimedia. However, HAR is a challenging task because of the variety of human actions in daily life. Various solutions based on computer vision (CV) have been proposed in the literature which did not prove to be successful due to large video sequences which need to be processed in surveillance systems. The problem exacerbates in the presence of multi-view cameras. Recently, the development of deep learning (DL)-based systems has shown significant success for HAR even for multi-view camera systems. In this research work, a DL-based design is proposed for HAR. The proposed design consists of multiple steps including feature mapping, feature fusion and feature selection. For the initial feature mapping step, two pre-trained models are considered, such as DenseNet201 and InceptionV3. Later, the extracted deep features are fused using the Serial based Extended (SbE) approach. Later on, the best features are selected using Kurtosis-controlled Weighted KNN. The selected features are classified using several supervised learning algorithms. To show the efficacy of the proposed design, we used several datasets, such as KTH, IXMAS, WVU, and Hollywood. Experimental results showed that the proposed design achieved accuracies of 99.3%, 97.4%, 99.8%, and 99.9%, respectively, on these datasets. Furthermore, the feature selection step performed better in terms of computational time compared with the state-of-the-art.

摘要

人体动作识别(HAR)最近受到了广泛关注,因为它可以被应用于多媒体中的智能监控系统。然而,HAR 是一项具有挑战性的任务,因为日常生活中的人体动作多种多样。文献中提出了各种基于计算机视觉(CV)的解决方案,但由于监控系统中需要处理大量的视频序列,这些解决方案并没有取得成功。当存在多视角摄像机时,问题会更加严重。最近,基于深度学习(DL)的系统的发展即使对于多视角摄像机系统,也为 HAR 带来了显著的成功。在这项研究工作中,提出了一种基于 DL 的 HAR 设计。该设计包括多个步骤,包括特征映射、特征融合和特征选择。在初始特征映射步骤中,考虑了两个预先训练的模型,如 DenseNet201 和 InceptionV3。之后,使用基于串行的扩展(SbE)方法融合提取的深度特征。之后,使用峰度控制加权 KNN 选择最佳特征。使用几种监督学习算法对选择的特征进行分类。为了展示所提出设计的有效性,我们使用了几个数据集,如 KTH、IXMAS、WVU 和 Hollywood。实验结果表明,所提出的设计在这些数据集上的准确率分别达到了 99.3%、97.4%、99.8%和 99.9%。此外,与现有技术相比,特征选择步骤在计算时间方面表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da6d/8659437/67d9042610cb/sensors-21-07941-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验