DBGC：面向目标识别的基于维度的泛化卷积块。

DBGC: Dimension-Based Generic Convolution Block for Object Recognition.

机构信息

Department of Computer Engineering, Devang Patel Institute of Advance Technology and Research (DEPSTAR), Faculty of Technology and Engineering (FTE), CHARUSAT Campus, Charotar University of Science and Technology (CHARUSAT), Changa 388421, India.

Parul University, Vadodara 382030, Gujarat, India.

出版信息

Sensors (Basel). 2022 Feb 24;22(5):1780. doi: 10.3390/s22051780.

DOI:10.3390/s22051780

PMID:35270929

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8914730/

Abstract

The object recognition concept is being widely used a result of increasing CCTV surveillance and the need for automatic object or activity detection from images or video. Increases in the use of various sensor networks have also raised the need of lightweight process frameworks. Much research has been carried out in this area, but the research scope is colossal as it deals with open-ended problems such as being able to achieve high accuracy in little time using lightweight process frameworks. Convolution Neural Networks and their variants are widely used in various computer vision activities, but most of the architectures of CNN are application-specific. There is always a need for generic architectures with better performance. This paper introduces the Dimension-Based Generic Convolution Block (DBGC), which can be used with any CNN to make the architecture generic and provide a dimension-wise selection of various height, width, and depth kernels. This single unit which uses the separable convolution concept provides multiple combinations using various dimension-based kernels. This single unit can be used for height-based, width-based, or depth-based dimensions; the same unit can even be used for height and width, width and depth, and depth and height dimensions. It can also be used for combinations involving all three dimensions of height, width, and depth. The main novelty of DBGC lies in the dimension selector block included in the proposed architecture. Proposed unoptimized kernel dimensions reduce FLOPs by around one third and also reduce the accuracy by around one half; semi-optimized kernel dimensions yield almost the same or higher accuracy with half the FLOPs of the original architecture, while optimized kernel dimensions provide 5 to 6% higher accuracy with around a 10 M reduction in FLOPs.

摘要

由于闭路电视监控的增加以及需要从图像或视频中自动检测目标或活动，目标识别的概念得到了广泛应用。各种传感器网络的使用增加也提高了对轻量级处理框架的需求。在这一领域已经进行了大量的研究，但研究范围是巨大的，因为它涉及到一些开放性问题，例如如何在使用轻量级处理框架的情况下，在短时间内实现高精度。卷积神经网络及其变体在各种计算机视觉活动中得到了广泛的应用，但大多数 CNN 架构都是特定于应用的。总是需要具有更好性能的通用架构。本文介绍了基于维度的通用卷积块（DBGC），它可以与任何 CNN 一起使用，使架构通用，并提供各种高度、宽度和深度内核的维度选择。这个使用可分离卷积概念的单个单元使用各种基于维度的内核提供了多种组合。这个单个单元可以用于基于高度、基于宽度或基于深度的维度；同一个单元甚至可以用于高度和宽度、宽度和深度以及深度和高度维度。它还可以用于涉及高度、宽度和深度这三个维度的组合。DBGC 的主要新颖之处在于所提出的架构中包含的维度选择器块。所提出的未优化内核维度将 FLOPs 减少了约三分之一，并且将准确性降低了约一半；半优化内核维度的 FLOPs 几乎与原始架构相同或更高，而优化内核维度的 FLOPs 减少了约 10M，但准确性提高了 5%至 6%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e99/8914730/5690de7e3da8/sensors-22-01780-g001.jpg

相似文献

DBGC: Dimension-Based Generic Convolution Block for Object Recognition.DBGC：面向目标识别的基于维度的泛化卷积块。

Sensors (Basel). 2022 Feb 24;22(5):1780. doi: 10.3390/s22051780.

DiCENet: Dimension-Wise Convolutions for Efficient Networks.DiCENet：高效网络的维度卷积。

IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2416-2425. doi: 10.1109/TPAMI.2020.3041871. Epub 2022 Apr 1.

Learning hidden patterns from patient multivariate time series data using convolutional neural networks: A case study of healthcare cost prediction.使用卷积神经网络从患者多变量时间序列数据中学习隐藏模式：以医疗保健成本预测为例。

J Biomed Inform. 2020 Nov;111:103565. doi: 10.1016/j.jbi.2020.103565. Epub 2020 Sep 25.

Sensor-Based Human Activity Recognition with Spatio-Temporal Deep Learning.基于传感器的人机活动识别的时空深度学习。

Sensors (Basel). 2021 Mar 18;21(6):2141. doi: 10.3390/s21062141.

CodnNet: A lightweight CNN architecture for detection of COVID-19 infection.CodnNet：一种用于检测新冠病毒感染的轻量级卷积神经网络架构。

Appl Soft Comput. 2022 Nov;130:109656. doi: 10.1016/j.asoc.2022.109656. Epub 2022 Sep 24.

A Resource-Efficient CNN-Based Method for Moving Vehicle Detection.一种基于资源高效 CNN 的移动车辆检测方法。

Sensors (Basel). 2022 Feb 4;22(3):1193. doi: 10.3390/s22031193.

Lightweight Separable Convolution Network for Breast Cancer Histopathological Identification.用于乳腺癌组织病理学识别的轻量级可分离卷积网络

Diagnostics (Basel). 2023 Jan 13;13(2):299. doi: 10.3390/diagnostics13020299.

SOKS: Automatic Searching of the Optimal Kernel Shapes for Stripe-Wise Network Pruning.SOKS：用于逐条纹网络剪枝的最优内核形状自动搜索

IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):9912-9924. doi: 10.1109/TNNLS.2022.3162067. Epub 2023 Nov 30.

Object detection based on an adaptive attention mechanism.基于自适应注意力机制的目标检测。

Sci Rep. 2020 Jul 9;10(1):11307. doi: 10.1038/s41598-020-67529-x.

CNN Deep Learning with Wavelet Image Fusion of CCD RGB-IR and Depth-Grayscale Sensor Data for Hand Gesture Intention Recognition.CNN 基于 CCD RGB-IR 与深度灰度传感器数据的子波图像融合的深度学习在手势意图识别中的应用。

Sensors (Basel). 2022 Jan 21;22(3):803. doi: 10.3390/s22030803.

引用本文的文献

An Improved Skin Lesion Classification Using a Hybrid Approach with Active Contour Snake Model and Lightweight Attention-Guided Capsule Networks.一种使用主动轮廓蛇模型和轻量级注意力引导胶囊网络的混合方法改进的皮肤病变分类。

Diagnostics (Basel). 2024 Mar 17;14(6):636. doi: 10.3390/diagnostics14060636.

Evaluating Retinal Disease Diagnosis with an Interpretable Lightweight CNN Model Resistant to Adversarial Attacks.使用抗对抗攻击的可解释轻量级卷积神经网络模型评估视网膜疾病诊断

J Imaging. 2023 Oct 11;9(10):219. doi: 10.3390/jimaging9100219.

A New Target Detection Method of Ferrography Wear Particle Images Based on ECAM-YOLOv5-BiFPN Network.一种基于ECAM-YOLOv5-BiFPN网络的铁谱磨损颗粒图像目标检测新方法

Sensors (Basel). 2023 Jul 18;23(14):6477. doi: 10.3390/s23146477.

CNN-LSTM Model for Recognizing Video-Recorded Actions Performed in a Traditional Chinese Exercise.用于识别传统中国功法中视频记录动作的 CNN-LSTM 模型。

IEEE J Transl Eng Health Med. 2023 Jun 2;11:351-359. doi: 10.1109/JTEHM.2023.3282245. eCollection 2023.

Human Behavior Recognition via Hierarchical Patches Descriptor and Approximate Locality-Constrained Linear Coding.基于分层补丁描述符和近似局部约束线性编码的人类行为识别

Sensors (Basel). 2023 May 29;23(11):5179. doi: 10.3390/s23115179.

A Small Object Detection Algorithm Based on Modulated Deformable Convolution and Large Kernel Convolution.基于调制变形卷积和大核卷积的小物体检测算法。

Comput Intell Neurosci. 2023 Jan 24;2023:2506274. doi: 10.1155/2023/2506274. eCollection 2023.

Analysis of cranial ultrasound images for newborn.新生儿颅脑超声图像分析

Front Neurol. 2023 Jan 4;13:1090275. doi: 10.3389/fneur.2022.1090275. eCollection 2022.

Attention-Based Sentiment Region Importance and Relationship Analysis for Image Sentiment Recognition.基于注意力的情感区域重要性和关系分析在图像情感识别中的应用。

Comput Intell Neurosci. 2022 Nov 17;2022:9772714. doi: 10.1155/2022/9772714. eCollection 2022.

Multiscale Traffic Sign Detection Method in Complex Environment Based on YOLOv4.基于 YOLOv4 的复杂环境下多尺度交通标志检测方法。

Comput Intell Neurosci. 2022 Oct 22;2022:5297605. doi: 10.1155/2022/5297605. eCollection 2022.

Progressive Rain Removal Based on the Combination Network of CNN and Transformer.基于 CNN 和 Transformer 组合网络的递进式雨痕去除。

Comput Intell Neurosci. 2022 Sep 24;2022:5067175. doi: 10.1155/2022/5067175. eCollection 2022.

本文引用的文献

CP-BDHCA: Blockchain-Based Confidentiality-Privacy Preserving Big Data Scheme for Healthcare Clouds and Applications.基于区块链的医疗云与应用中数据机密性和隐私保护的大数据方案（CP-BDHCA）

IEEE J Biomed Health Inform. 2022 May;26(5):1937-1948. doi: 10.1109/JBHI.2021.3097237. Epub 2022 May 5.

Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences.基于方向梯度直方图的动作视频序列中人体动作识别特征融合直方图

Sensors (Basel). 2020 Dec 18;20(24):7299. doi: 10.3390/s20247299.

DiCENet: Dimension-Wise Convolutions for Efficient Networks.DiCENet：高效网络的维度卷积。

IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2416-2425. doi: 10.1109/TPAMI.2020.3041871. Epub 2022 Apr 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

DBGC：面向目标识别的基于维度的泛化卷积块。

DBGC: Dimension-Based Generic Convolution Block for Object Recognition.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献