• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于视频目标识别的神经形态系统。

A neuromorphic system for video object recognition.

作者信息

Khosla Deepak, Chen Yang, Kim Kyungnam

机构信息

HRL Laboratories, LLC Malibu, CA, USA.

出版信息

Front Comput Neurosci. 2014 Nov 28;8:147. doi: 10.3389/fncom.2014.00147. eCollection 2014.

DOI:10.3389/fncom.2014.00147
PMID:25506325
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4246681/
Abstract

Automated video object recognition is a topic of emerging importance in both defense and civilian applications. This work describes an accurate and low-power neuromorphic architecture and system for real-time automated video object recognition. Our system, Neuormorphic Visual Understanding of Scenes (NEOVUS), is inspired by computational neuroscience models of feed-forward object detection and classification pipelines for processing visual data. The NEOVUS architecture is inspired by the ventral (what) and dorsal (where) streams of the mammalian visual pathway and integrates retinal processing, object detection based on form and motion modeling, and object classification based on convolutional neural networks. The object recognition performance and energy use of the NEOVUS was evaluated by the Defense Advanced Research Projects Agency (DARPA) under the Neovision2 program using three urban area video datasets collected from a mix of stationary and moving platforms. These datasets are challenging and include a large number of objects of different types in cluttered scenes, with varying illumination and occlusion conditions. In a systematic evaluation of five different teams by DARPA on these datasets, the NEOVUS demonstrated the best performance with high object recognition accuracy and the lowest energy consumption. Its energy use was three orders of magnitude lower than two independent state of the art baseline computer vision systems. The dynamic power requirement for the complete system mapped to commercial off-the-shelf (COTS) hardware that includes a 5.6 Megapixel color camera processed by object detection and classification algorithms at 30 frames per second was measured at 21.7 Watts (W), for an effective energy consumption of 5.45 nanoJoules (nJ) per bit of incoming video. These unprecedented results show that the NEOVUS has the potential to revolutionize automated video object recognition toward enabling practical low-power and mobile video processing applications.

摘要

自动视频目标识别在国防和民用应用中都是一个日益重要的话题。这项工作描述了一种用于实时自动视频目标识别的精确且低功耗的神经形态架构和系统。我们的系统,即神经形态场景视觉理解(NEOVUS),受到用于处理视觉数据的前馈目标检测和分类管道的计算神经科学模型的启发。NEOVUS架构受到哺乳动物视觉通路的腹侧(“什么”)和背侧(“哪里”)信息流的启发,并整合了视网膜处理、基于形状和运动建模的目标检测以及基于卷积神经网络的目标分类。国防高级研究计划局(DARPA)在Neovision2计划下,使用从固定和移动平台混合收集的三个市区视频数据集,对NEOVUS的目标识别性能和能源使用情况进行了评估。这些数据集具有挑战性,包括杂乱场景中大量不同类型的目标,光照和遮挡条件各不相同。在DARPA对这些数据集的五个不同团队进行的系统评估中,NEOVUS表现出了最佳性能,具有高目标识别准确率和最低能耗。其能源使用比两个独立的先进基线计算机视觉系统低三个数量级。映射到商用现货(COTS)硬件的完整系统的动态功率需求,该硬件包括一个560万像素的彩色相机,由目标检测和分类算法以每秒30帧的速度处理,测量结果为21.7瓦(W),对于每比特输入视频的有效能耗为5.45纳焦(nJ)。这些前所未有的结果表明,NEOVUS有潜力彻底改变自动视频目标识别,以实现实用的低功耗和移动视频处理应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/9e7f02389fd6/fncom-08-00147-g0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/497bd72162e9/fncom-08-00147-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/815433e575c1/fncom-08-00147-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/b5344f17d3db/fncom-08-00147-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/04abd6b22d7f/fncom-08-00147-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/731be45dd741/fncom-08-00147-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/152d8b6ed50d/fncom-08-00147-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/475252f889ff/fncom-08-00147-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/0636b9aa6344/fncom-08-00147-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/79e4d83cc005/fncom-08-00147-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/8b7c1fb46a2f/fncom-08-00147-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/773b36551c93/fncom-08-00147-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/f225debb28bb/fncom-08-00147-g0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/9e7f02389fd6/fncom-08-00147-g0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/497bd72162e9/fncom-08-00147-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/815433e575c1/fncom-08-00147-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/b5344f17d3db/fncom-08-00147-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/04abd6b22d7f/fncom-08-00147-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/731be45dd741/fncom-08-00147-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/152d8b6ed50d/fncom-08-00147-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/475252f889ff/fncom-08-00147-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/0636b9aa6344/fncom-08-00147-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/79e4d83cc005/fncom-08-00147-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/8b7c1fb46a2f/fncom-08-00147-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/773b36551c93/fncom-08-00147-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/f225debb28bb/fncom-08-00147-g0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e56/4246681/9e7f02389fd6/fncom-08-00147-g0013.jpg

相似文献

1
A neuromorphic system for video object recognition.一种用于视频目标识别的神经形态系统。
Front Comput Neurosci. 2014 Nov 28;8:147. doi: 10.3389/fncom.2014.00147. eCollection 2014.
2
Low-Power Dynamic Object Detection and Classification With Freely Moving Event Cameras.基于自由移动事件相机的低功耗动态目标检测与分类
Front Neurosci. 2020 Feb 20;14:135. doi: 10.3389/fnins.2020.00135. eCollection 2020.
3
Highly accurate moving object detection in variable bit rate video-based traffic monitoring systems.基于可变比特率视频的交通监控系统中的高精度移动物体检测。
IEEE Trans Neural Netw Learn Syst. 2013 Dec;24(12):1920-31. doi: 10.1109/TNNLS.2013.2270314.
4
Bio-Inspired Video Enhancement for Small Moving Target Detection.生物启发的视频增强用于小运动目标检测。
IEEE Trans Image Process. 2021;30:1232-1244. doi: 10.1109/TIP.2020.3043113. Epub 2020 Dec 21.
5
Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture.使用神经形态架构从关键姿势进行实时生物启发式动作识别。
Front Neurorobot. 2017 Mar 22;11:13. doi: 10.3389/fnbot.2017.00013. eCollection 2017.
6
A Hybrid Neuromorphic Object Tracking and Classification Framework for Real-Time Systems.一种用于实时系统的混合神经形态目标跟踪与分类框架。
IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):10726-10735. doi: 10.1109/TNNLS.2023.3243679. Epub 2024 Aug 5.
7
Track Everything: Limiting Prior Knowledge in Online Multi-Object Recognition.追踪一切:限制在线多目标识别中的先验知识。
IEEE Trans Image Process. 2017 Oct;26(10):4669-4683. doi: 10.1109/TIP.2017.2696744. Epub 2017 Apr 24.
8
SEEMORE: combining color, shape, and texture histogramming in a neurally inspired approach to visual object recognition.SEEMORE:在一种受神经启发的视觉目标识别方法中结合颜色、形状和纹理直方图
Neural Comput. 1997 May 15;9(4):777-804. doi: 10.1162/neco.1997.9.4.777.
9
Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems.用于汽车视觉系统的深度学习色调映射与去马赛克
Sensors (Basel). 2023 Oct 17;23(20):8507. doi: 10.3390/s23208507.
10
Monocular visual scene understanding: understanding multi-object traffic scenes.单目视觉场景理解:理解多目标交通场景。
IEEE Trans Pattern Anal Mach Intell. 2013 Apr;35(4):882-97. doi: 10.1109/TPAMI.2012.174.

引用本文的文献

1
Application of Event Cameras and Neuromorphic Computing to VSLAM: A Survey.事件相机与神经形态计算在视觉同步定位与地图构建中的应用:综述
Biomimetics (Basel). 2024 Jul 20;9(7):444. doi: 10.3390/biomimetics9070444.
2
Editorial: Hierarchical Object Representations in the Visual Cortex and Computer Vision.社论:视觉皮层与计算机视觉中的分层对象表示
Front Comput Neurosci. 2015 Nov 20;9:142. doi: 10.3389/fncom.2015.00142. eCollection 2015.

本文引用的文献

1
Cortical dynamics of contextually cued attentive visual learning and search: spatial and object evidence accumulation.语境线索引导的注意视觉学习和搜索的皮层动力学:空间和目标证据积累。
Psychol Rev. 2010 Oct;117(4):1080-112. doi: 10.1037/a0020664.
2
Vehicle detection using partial least squares.基于偏最小二乘法的车辆检测。
IEEE Trans Pattern Anal Mach Intell. 2011 Jun;33(6):1250-65. doi: 10.1109/TPAMI.2010.182.
3
Object detection with discriminatively trained part-based models.基于判别式训练的部件模型的目标检测。
IEEE Trans Pattern Anal Mach Intell. 2010 Sep;32(9):1627-45. doi: 10.1109/TPAMI.2009.167.
4
Framework for performance evaluation of face, text, and vehicle detection and tracking in video: data, metrics, and protocol.视频中人脸、文本及车辆检测与跟踪性能评估框架:数据、指标与协议
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):319-36. doi: 10.1109/TPAMI.2008.57.
5
Working memory for visual objects: complementary roles of inferior temporal, medial temporal, and prefrontal cortex.视觉对象的工作记忆:颞下回、内侧颞叶和前额叶皮质的互补作用。
Neuroscience. 2006 Apr 28;139(1):277-89. doi: 10.1016/j.neuroscience.2005.06.092. Epub 2005 Dec 15.
6
Mechanisms of visual object recognition studied in monkeys.在猴子身上研究视觉物体识别的机制。
Spat Vis. 2000;13(2-3):147-63. doi: 10.1163/156856800741171.
7
Evidence for boundary-specific grouping.
Vision Res. 1998 Jan;38(1):143-52. doi: 10.1016/s0042-6989(97)00138-7.