• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过高质量对象链接实现视频中的目标检测

Object Detection in Videos by High Quality Object Linking.

作者信息

Tang Peng, Wang Chunyu, Wang Xinggang, Liu Wenyu, Zeng Wenjun, Wang Jingdong

出版信息

IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1272-1278. doi: 10.1109/TPAMI.2019.2910529. Epub 2019 Apr 11.

DOI:10.1109/TPAMI.2019.2910529
PMID:30990176
Abstract

Compared with object detection in static images, object detection in videos is more challenging due to degraded image qualities. An effective way to address this problem is to exploit temporal contexts by linking the same object across video to form tubelets and aggregating classification scores in the tubelets. In this paper, we focus on obtaining high quality object linking results for better classification. Unlike previous methods that link objects by checking boxes between neighboring frames, we propose to link in the same frame. To achieve this goal, we extend prior methods in following aspects: (1) a cuboid proposal network that extracts spatio-temporal candidate cuboids which bound the movement of objects; (2) a short tubelet detection network that detects short tubelets in short video segments; (3) a short tubelet linking algorithm that links temporally-overlapping short tubelets to form long tubelets. Experiments on the ImageNet VID dataset show that our method outperforms both the static image detector and the previous state of the art. In particular, our method improves results by 8.8 percent over the static image detector for fast moving objects.

摘要

与静态图像中的目标检测相比,视频中的目标检测由于图像质量下降而更具挑战性。解决此问题的有效方法是通过跨视频链接同一目标以形成小目标管并汇总小目标管中的分类分数来利用时间上下文。在本文中,我们专注于获得高质量的目标链接结果以进行更好的分类。与之前通过检查相邻帧之间的框来链接目标的方法不同,我们建议在同一帧中进行链接。为实现这一目标,我们在以下方面扩展了先前的方法:(1)一个长方体提议网络,用于提取界定目标运动的时空候选长方体;(2)一个短目标管检测网络,用于在短视频片段中检测短目标管;(3)一种短目标管链接算法,用于链接时间上重叠的短目标管以形成长目标管。在ImageNet VID数据集上的实验表明,我们的方法优于静态图像检测器和先前的现有技术。特别是,对于快速移动的目标,我们的方法比静态图像检测器的结果提高了8.8%。

相似文献

1
Object Detection in Videos by High Quality Object Linking.通过高质量对象链接实现视频中的目标检测
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1272-1278. doi: 10.1109/TPAMI.2019.2910529. Epub 2019 Apr 11.
2
In Defense of Clip-Based Video Relation Detection.
IEEE Trans Image Process. 2024;33:2759-2769. doi: 10.1109/TIP.2024.3379935. Epub 2024 Apr 9.
3
Attention-Guided Disentangled Feature Aggregation for Video Object Detection.面向视频目标检测的注意力引导解缠特征聚合。
Sensors (Basel). 2022 Nov 7;22(21):8583. doi: 10.3390/s22218583.
4
Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection.用于弱监督视频目标检测的渐进式帧提议挖掘
IEEE Trans Image Process. 2024;33:1560-1573. doi: 10.1109/TIP.2024.3364536. Epub 2024 Feb 27.
5
Temporally Identity-Aware SSD With Attentional LSTM.基于注意力 LSTM 的具有时间身份感知能力的 SSD。
IEEE Trans Cybern. 2020 Jun;50(6):2674-2686. doi: 10.1109/TCYB.2019.2894261. Epub 2019 Feb 11.
6
YoTube: Searching Action Proposal Via Recurrent and Static Regression Networks.YoTube:通过递归和静态回归网络进行搜索动作提案。
IEEE Trans Image Process. 2018 Jun;27(6):2609-2622. doi: 10.1109/TIP.2018.2806279.
7
Deep Spatial-Temporal Joint Feature Representation for Video Object Detection.用于视频目标检测的深度时空联合特征表示。
Sensors (Basel). 2018 Mar 4;18(3):774. doi: 10.3390/s18030774.
8
Video Salient Object Detection Using Spatiotemporal Deep Features.基于时空深度特征的视频显著目标检测
IEEE Trans Image Process. 2018 Oct;27(10):5002-5015. doi: 10.1109/TIP.2018.2849860.
9
Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation.Segment-Tube:基于逐帧分割的非修剪视频中的时空动作定位。
Sensors (Basel). 2018 May 22;18(5):1657. doi: 10.3390/s18051657.
10
Geometric correction of atmospheric turbulence-degraded video containing moving objects.含运动物体的大气湍流退化视频的几何校正
Opt Express. 2015 Feb 23;23(4):5091-101. doi: 10.1364/OE.23.005091.

引用本文的文献

1
Real-Time Abnormal Object Detection for Video Surveillance in Smart Cities.智慧城市视频监控中的实时异常目标检测。
Sensors (Basel). 2022 May 19;22(10):3862. doi: 10.3390/s22103862.
2
Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review.视频对象与人类动作检测中的视觉特征学习:系统综述
Micromachines (Basel). 2021 Dec 31;13(1):72. doi: 10.3390/mi13010072.