• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于视频目标检测的新一代深度学习:综述

New Generation Deep Learning for Video Object Detection: A Survey.

作者信息

Jiao Licheng, Zhang Ruohan, Liu Fang, Yang Shuyuan, Hou Biao, Li Lingling, Tang Xu

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3195-3215. doi: 10.1109/TNNLS.2021.3053249. Epub 2022 Aug 3.

DOI:10.1109/TNNLS.2021.3053249
PMID:33534715
Abstract

Video object detection, a basic task in the computer vision field, is rapidly evolving and widely used. In recent years, deep learning methods have rapidly become widespread in the field of video object detection, achieving excellent results compared with those of traditional methods. However, the presence of duplicate information and abundant spatiotemporal information in video data poses a serious challenge to video object detection. Therefore, in recent years, many scholars have investigated deep learning detection algorithms in the context of video data and have achieved remarkable results. Considering the wide range of applications, a comprehensive review of the research related to video object detection is both a necessary and challenging task. This survey attempts to link and systematize the latest cutting-edge research on video object detection with the goal of classifying and analyzing video detection algorithms based on specific representative models. The differences and connections between video object detection and similar tasks are systematically demonstrated, and the evaluation metrics and video detection performance of nearly 40 models on two data sets are presented. Finally, the various applications and challenges facing video object detection are discussed.

摘要

视频目标检测作为计算机视觉领域的一项基础任务,正在迅速发展并得到广泛应用。近年来,深度学习方法在视频目标检测领域迅速普及,与传统方法相比取得了优异的成果。然而,视频数据中存在的重复信息和丰富的时空信息给视频目标检测带来了严峻挑战。因此,近年来许多学者在视频数据背景下研究深度学习检测算法,并取得了显著成果。考虑到应用范围广泛,对视频目标检测相关研究进行全面综述既是一项必要的任务,也是一项具有挑战性的任务。本综述试图将视频目标检测的最新前沿研究联系起来并进行系统化,目的是基于特定的代表性模型对视频检测算法进行分类和分析。系统地展示了视频目标检测与类似任务之间的差异和联系,并给出了近40种模型在两个数据集上的评估指标和视频检测性能。最后,讨论了视频目标检测面临的各种应用和挑战。

相似文献

1
New Generation Deep Learning for Video Object Detection: A Survey.用于视频目标检测的新一代深度学习:综述
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3195-3215. doi: 10.1109/TNNLS.2021.3053249. Epub 2022 Aug 3.
2
Research on Intelligent Video Detection of Small Targets Based on Deep Learning Intelligent Algorithm.基于深度学习智能算法的智能小目标视频检测研究。
Comput Intell Neurosci. 2022 Jul 14;2022:3843155. doi: 10.1155/2022/3843155. eCollection 2022.
3
AI-based object detection latest trends in remote sensing, multimedia and agriculture applications.基于人工智能的目标检测在遥感、多媒体和农业应用中的最新趋势。
Front Plant Sci. 2022 Nov 18;13:1041514. doi: 10.3389/fpls.2022.1041514. eCollection 2022.
4
Multi-Stream Attention-Aware Graph Convolution Network for Video Salient Object Detection.用于视频显著目标检测的多流注意力感知图卷积网络
IEEE Trans Image Process. 2021;30:4183-4197. doi: 10.1109/TIP.2021.3070200. Epub 2021 Apr 12.
5
Recent Advances in 3D Object Detection in the Era of Deep Neural Networks: A Survey.深度神经网络时代3D目标检测的最新进展:一项综述。
IEEE Trans Image Process. 2019 Nov 28. doi: 10.1109/TIP.2019.2955239.
6
Object Detection With Deep Learning: A Review.基于深度学习的目标检测研究综述。
IEEE Trans Neural Netw Learn Syst. 2019 Nov;30(11):3212-3232. doi: 10.1109/TNNLS.2018.2876865. Epub 2019 Jan 28.
7
Weakly Supervised Object Localization and Detection: A Survey.弱监督目标定位与检测:综述
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5866-5885. doi: 10.1109/TPAMI.2021.3074313. Epub 2022 Aug 4.
8
Small Object Detection and Tracking: A Comprehensive Review.小目标检测与跟踪:全面综述
Sensors (Basel). 2023 Aug 3;23(15):6887. doi: 10.3390/s23156887.
9
Utility of the Simulated Outcomes Following Carotid Artery Laceration Video Data Set for Machine Learning Applications.模拟颈动脉切开术后结果视频数据集在机器学习应用中的效用。
JAMA Netw Open. 2022 Mar 1;5(3):e223177. doi: 10.1001/jamanetworkopen.2022.3177.
10
Deep learning-based small object detection: A survey.基于深度学习的小目标检测:一项综述。
Math Biosci Eng. 2023 Feb 2;20(4):6551-6590. doi: 10.3934/mbe.2023282.

引用本文的文献

1
Manifold learning for olfactory habituation to strongly fluctuating backgrounds.用于嗅觉习惯化以适应强烈波动背景的流形学习
bioRxiv. 2025 May 30:2025.05.26.656161. doi: 10.1101/2025.05.26.656161.
2
YOLO-Act: Unified Spatiotemporal Detection of Human Actions Across Multi-Frame Sequences.YOLO-Act:跨多帧序列的人类动作统一时空检测
Sensors (Basel). 2025 May 10;25(10):3013. doi: 10.3390/s25103013.
3
Motion feature extraction using magnocellular-inspired spiking neural networks for drone detection.使用受大细胞启发的脉冲神经网络进行无人机检测的运动特征提取
Front Comput Neurosci. 2025 Jan 22;19:1452203. doi: 10.3389/fncom.2025.1452203. eCollection 2025.
4
Causal Inference Meets Deep Learning: A Comprehensive Survey.因果推断与深度学习:全面综述
Research (Wash D C). 2024 Sep 10;7:0467. doi: 10.34133/research.0467. eCollection 2024.
5
SASFF: A Video Synthesis Algorithm for Unstructured Array Cameras Based on Symmetric Auto-Encoding and Scale Feature Fusion.SASFF:一种基于对称自动编码和尺度特征融合的非结构化阵列相机视频合成算法
Sensors (Basel). 2023 Dec 19;24(1):5. doi: 10.3390/s24010005.
6
TeaDiseaseNet: multi-scale self-attentive tea disease detection.茶病网络:多尺度自注意力茶病检测
Front Plant Sci. 2023 Oct 11;14:1257212. doi: 10.3389/fpls.2023.1257212. eCollection 2023.
7
UWV-Yolox: A Deep Learning Model for Underwater Video Object Detection.UWV-Yolox:用于水下视频目标检测的深度学习模型。
Sensors (Basel). 2023 May 18;23(10):4859. doi: 10.3390/s23104859.
8
Neuromorphic-PM: processing-in-pixel-in-memory paradigm for neuromorphic image sensors.神经形态-PM:用于神经形态图像传感器的像素内处理存储范式
Front Neuroinform. 2023 May 4;17:1144301. doi: 10.3389/fninf.2023.1144301. eCollection 2023.
9
A Review of the Technology, Training, and Assessment Methods for the First Real-Time AI-Enhanced Medical Device for Endoscopy.首款用于内窥镜检查的实时人工智能增强型医疗设备的技术、培训及评估方法综述
Bioengineering (Basel). 2023 Mar 24;10(4):404. doi: 10.3390/bioengineering10040404.
10
Dual-View Single-Shot Multibox Detector at Urban Intersections: Settings and Performance Evaluation.城市路口双视图单拍多盒探测器:设置与性能评估。
Sensors (Basel). 2023 Mar 16;23(6):3195. doi: 10.3390/s23063195.