用于视觉处理的帧约束固定像素值卷积神经网络与无帧脉冲动态像素卷积神经网络的比较

Comparison between Frame-Constrained Fix-Pixel-Value and Frame-Free Spiking-Dynamic-Pixel ConvNets for Visual Processing.

作者信息

Farabet Clément, Paz Rafael, Pérez-Carrasco Jose, Zamarreño-Ramos Carlos, Linares-Barranco Alejandro, Lecun Yann, Culurciello Eugenio, Serrano-Gotarredona Teresa, Linares-Barranco Bernabe

机构信息

Computer Science Department, Courant Institute of Mathematical Sciences, New York University New York, NY, USA.

出版信息

Front Neurosci. 2012 Apr 10;6:32. doi: 10.3389/fnins.2012.00032. eCollection 2012.

DOI:10.3389/fnins.2012.00032

PMID:22518097

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3324817/

Abstract

Most scene segmentation and categorization architectures for the extraction of features in images and patches make exhaustive use of 2D convolution operations for template matching, template search, and denoising. Convolutional Neural Networks (ConvNets) are one example of such architectures that can implement general-purpose bio-inspired vision systems. In standard digital computers 2D convolutions are usually expensive in terms of resource consumption and impose severe limitations for efficient real-time applications. Nevertheless, neuro-cortex inspired solutions, like dedicated Frame-Based or Frame-Free Spiking ConvNet Convolution Processors, are advancing real-time visual processing. These two approaches share the neural inspiration, but each of them solves the problem in different ways. Frame-Based ConvNets process frame by frame video information in a very robust and fast way that requires to use and share the available hardware resources (such as: multipliers, adders). Hardware resources are fixed- and time-multiplexed by fetching data in and out. Thus memory bandwidth and size is important for good performance. On the other hand, spike-based convolution processors are a frame-free alternative that is able to perform convolution of a spike-based source of visual information with very low latency, which makes ideal for very high-speed applications. However, hardware resources need to be available all the time and cannot be time-multiplexed. Thus, hardware should be modular, reconfigurable, and expansible. Hardware implementations in both VLSI custom integrated circuits (digital and analog) and FPGA have been already used to demonstrate the performance of these systems. In this paper we present a comparison study of these two neuro-inspired solutions. A brief description of both systems is presented and also discussions about their differences, pros and cons.

摘要

大多数用于图像和图像块特征提取的场景分割与分类架构，在模板匹配、模板搜索和去噪过程中都充分利用了二维卷积操作。卷积神经网络（ConvNets）就是这类能够实现通用生物启发式视觉系统的架构之一。在标准数字计算机中，二维卷积在资源消耗方面通常代价高昂，并且对高效实时应用造成了严重限制。尽管如此，受神经皮层启发的解决方案，如专用的基于帧或无帧脉冲卷积网络卷积处理器，正在推动实时视觉处理的发展。这两种方法都有神经学启发，但它们以不同方式解决问题。基于帧的卷积网络以非常稳健且快速的方式逐帧处理视频信息，这需要使用并共享可用的硬件资源（如乘法器、加法器）。通过数据的输入和输出，硬件资源进行固定和时间复用。因此，内存带宽和大小对于良好性能很重要。另一方面，基于脉冲的卷积处理器是一种无帧替代方案，能够以非常低的延迟对基于脉冲的视觉信息源进行卷积，这使其非常适合超高速应用。然而，硬件资源需要始终可用，且不能进行时间复用。因此，硬件应该是模块化、可重新配置且可扩展的。超大规模集成电路定制集成电路（数字和模拟）以及现场可编程门阵列中的硬件实现都已被用于展示这些系统的性能。在本文中，我们对这两种受神经启发的解决方案进行了比较研究。介绍了这两种系统的简要描述，并讨论了它们的差异、优缺点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377b/3324817/9f4753f09191/fnins-06-00032-g001.jpg

相似文献

Comparison between Frame-Constrained Fix-Pixel-Value and Frame-Free Spiking-Dynamic-Pixel ConvNets for Visual Processing.

Front Neurosci. 2012 Apr 10;6:32. doi: 10.3389/fnins.2012.00032. eCollection 2012.

Mapping from frame-driven to frame-free event-driven vision systems by low-rate rate coding and coincidence processing--application to feedforward ConvNets.

IEEE Trans Pattern Anal Mach Intell. 2013 Nov;35(11):2706-19. doi: 10.1109/TPAMI.2013.71.

A Cost-Efficient High-Speed VLSI Architecture for Spiking Convolutional Neural Network Inference Using Time-Step Binary Spike Maps.

Sensors (Basel). 2021 Sep 8;21(18):6006. doi: 10.3390/s21186006.

A Configurable Event-Driven Convolutional Node with Rate Saturation Mechanism for Modular ConvNet Systems Implementation.

Front Neurosci. 2018 Feb 20;12:63. doi: 10.3389/fnins.2018.00063. eCollection 2018.

Fast vision through frameless event-based sensing and convolutional processing: application to texture recognition.

IEEE Trans Neural Netw. 2010 Apr;21(4):609-20. doi: 10.1109/TNN.2009.2039943. Epub 2010 Feb 22.

Robustness of spiking Deep Belief Networks to noise and reduced bit precision of neuro-inspired hardware platforms.

Front Neurosci. 2015 Jul 9;9:222. doi: 10.3389/fnins.2015.00222. eCollection 2015.

Bio-mimetic high-speed target localization with fused frame and event vision for edge application.

Front Neurosci. 2022 Nov 25;16:1010302. doi: 10.3389/fnins.2022.1010302. eCollection 2022.

On the Reduction of Computational Complexity of Deep Convolutional Neural Networks.

Entropy (Basel). 2018 Apr 23;20(4):305. doi: 10.3390/e20040305.

High-Throughput Line Buffer Microarchitecture for Arbitrary Sized Streaming Image Processing.

J Imaging. 2019 Mar 6;5(3):34. doi: 10.3390/jimaging5030034.

An Improved VLSI Design of the ALU Based FIR Filter for Biomedical Image Filtering Application.

Curr Med Imaging. 2021;17(2):276-287. doi: 10.2174/1573405616999200817101950.

引用本文的文献

Event-Based Trajectory Prediction Using Spiking Neural Networks.

Front Comput Neurosci. 2021 May 24;15:658764. doi: 10.3389/fncom.2021.658764. eCollection 2021.

A High-Speed Low-Cost VLSI System Capable of On-Chip Online Learning for Dynamic Vision Sensor Data Classification.

Sensors (Basel). 2020 Aug 21;20(17):4715. doi: 10.3390/s20174715.

Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA.

Sensors (Basel). 2020 Jun 16;20(12):3404. doi: 10.3390/s20123404.

Efficient Processing of Spatio-Temporal Data Streams With Spiking Neural Networks.

Front Neurosci. 2020 May 5;14:439. doi: 10.3389/fnins.2020.00439. eCollection 2020.

Neuromorphic Spiking Neural Networks and Their Memristor-CMOS Hardware Implementations.

Materials (Basel). 2019 Aug 27;12(17):2745. doi: 10.3390/ma12172745.

Going Deeper in Spiking Neural Networks: VGG and Residual Architectures.

Front Neurosci. 2019 Mar 7;13:95. doi: 10.3389/fnins.2019.00095. eCollection 2019.

Less Data Same Information for Event-Based Sensors: A Bioinspired Filtering and Data Reduction Algorithm.

Sensors (Basel). 2018 Nov 24;18(12):4122. doi: 10.3390/s18124122.

Deep Learning With Spiking Neurons: Opportunities and Challenges.

Front Neurosci. 2018 Oct 25;12:774. doi: 10.3389/fnins.2018.00774. eCollection 2018.

A Configurable Event-Driven Convolutional Node with Rate Saturation Mechanism for Modular ConvNet Systems Implementation.

Front Neurosci. 2018 Feb 20;12:63. doi: 10.3389/fnins.2018.00063. eCollection 2018.

Feature Representations for Neuromorphic Audio Spike Streams.

Front Neurosci. 2018 Feb 9;12:23. doi: 10.3389/fnins.2018.00023. eCollection 2018.

本文引用的文献

Nonlinear Image Representation Using Divisive Normalization.

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2008;2008:1-8. doi: 10.1109/CVPR.2008.4587821.

CAVIAR: a 45k neuron, 5M synapse, 12G connects/s AER hardware sensory-processing- learning-actuating system for high-speed visual object recognition and tracking.

IEEE Trans Neural Netw. 2009 Sep;20(9):1417-38. doi: 10.1109/TNN.2009.2023653. Epub 2009 Jul 24.

Equal numbers of neuronal and nonneuronal cells make the human brain an isometrically scaled-up primate brain.

J Comp Neurol. 2009 Apr 10;513(5):532-41. doi: 10.1002/cne.21974.

Application of the ANNA neural network chip to high-speed character recognition.

IEEE Trans Neural Netw. 1992;3(3):498-505. doi: 10.1109/72.129422.

Evaluation of convolutional neural networks for visual recognition.

IEEE Trans Neural Netw. 1998;9(4):685-96. doi: 10.1109/72.701181.

Why is real-world visual object recognition hard?

PLoS Comput Biol. 2008 Jan;4(1):e27. doi: 10.1371/journal.pcbi.0040027.

A multichip neuromorphic system for spike-based visual information processing.

Neural Comput. 2007 Sep;19(9):2281-300. doi: 10.1162/neco.2007.19.9.2281.

Optic nerve signals in a neuromorphic chip I: Outer and inner retina models.

IEEE Trans Biomed Eng. 2004 Apr;51(4):657-66. doi: 10.1109/tbme.2003.821039.

Speed of processing in the human visual system.

Nature. 1996 Jun 6;381(6582):520-2. doi: 10.1038/381520a0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于视觉处理的帧约束固定像素值卷积神经网络与无帧脉冲动态像素卷积神经网络的比较

Comparison between Frame-Constrained Fix-Pixel-Value and Frame-Free Spiking-Dynamic-Pixel ConvNets for Visual Processing.

作者信息

Farabet Clément, Paz Rafael, Pérez-Carrasco Jose, Zamarreño-Ramos Carlos, Linares-Barranco Alejandro, Lecun Yann, Culurciello Eugenio, Serrano-Gotarredona Teresa, Linares-Barranco Bernabe

机构信息

Computer Science Department, Courant Institute of Mathematical Sciences, New York University New York, NY, USA.

出版信息

Front Neurosci. 2012 Apr 10;6:32. doi: 10.3389/fnins.2012.00032. eCollection 2012.

DOI:10.3389/fnins.2012.00032

PMID:22518097

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3324817/

Abstract

摘要

用于视觉处理的帧约束固定像素值卷积神经网络与无帧脉冲动态像素卷积神经网络的比较

Comparison between Frame-Constrained Fix-Pixel-Value and Frame-Free Spiking-Dynamic-Pixel ConvNets for Visual Processing.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于视觉处理的帧约束固定像素值卷积神经网络与无帧脉冲动态像素卷积神经网络的比较

Comparison between Frame-Constrained Fix-Pixel-Value and Frame-Free Spiking-Dynamic-Pixel ConvNets for Visual Processing.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献