• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过逐像素自适应特征增强实现卫星视频中的高精度多目标跟踪

High-Precision Multi-Object Tracking in Satellite Videos via Pixel-Wise Adaptive Feature Enhancement.

作者信息

Wan Gang, Su Zhijuan, Wu Yitian, Guo Ningbo, Cong Dianwei, Wei Zhanji, Liu Wei, Wang Guoping

机构信息

School of Space Information, Space Engineering University, Beijing 101407, China.

State Key Laboratory of Geo-Information Engineering, Xi'an 710054, China.

出版信息

Sensors (Basel). 2024 Oct 9;24(19):6489. doi: 10.3390/s24196489.

DOI:10.3390/s24196489
PMID:39409528
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11479360/
Abstract

In this paper, we focus on the multi-target tracking (MOT) task in satellite videos. To achieve efficient and accurate tracking, we propose a transformer-distillation-based end-to-end joint detection and tracking (JDT) method. Specifically, (1) considering that targets in satellite videos usually have small scales and are shot from a bird's-eye view, we propose a pixel-wise transformer-based feature distillation module through which useful object representations are learned via pixel-wise distillation using a strong teacher detection network; (2) targets in satellite videos, such as airplanes, ships, and vehicles, usually have similar appearances, so we propose a temperature-controllable key feature learning objective function, and by highlighting the learning of similar features during distilling, the tracking accuracy for such objects can be further improved; (3) we propose a method that is based on an end-to-end network but simultaneously learns from a highly precise teacher network and tracking head during training so that the tracking accuracy of the end-to-end network can be improved via distillation without compromising efficiency. The experimental results on three recently released publicly available datasets demonstrated the superior performance of the proposed method for satellite videos. The proposed method achieved over 90% overall tracking performance on the AIR-MOT dataset.

摘要

在本文中,我们专注于卫星视频中的多目标跟踪(MOT)任务。为实现高效且准确的跟踪,我们提出一种基于Transformer蒸馏的端到端联合检测与跟踪(JDT)方法。具体而言,(1)考虑到卫星视频中的目标通常具有小尺度且是从鸟瞰视角拍摄的,我们提出一种基于逐像素Transformer的特征蒸馏模块,通过该模块利用强大的教师检测网络通过逐像素蒸馏来学习有用的目标表示;(2)卫星视频中的目标,如飞机、船只和车辆,通常具有相似的外观,因此我们提出一种温度可控的关键特征学习目标函数,并且通过在蒸馏过程中突出对相似特征的学习,可以进一步提高对此类目标的跟踪精度;(3)我们提出一种基于端到端网络的方法,但在训练期间同时从高精度的教师网络和跟踪头进行学习,以便通过蒸馏提高端到端网络的跟踪精度而不影响效率。在最近发布的三个公开可用数据集上的实验结果证明了所提方法在卫星视频方面的卓越性能。所提方法在AIR-MOT数据集上实现了超过90%的整体跟踪性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/2eaeabb6cf52/sensors-24-06489-g006a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/2363d5bb05b9/sensors-24-06489-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/278365194d95/sensors-24-06489-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/86853b92f0d3/sensors-24-06489-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/09ecc1135eef/sensors-24-06489-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/66ef94ae2bda/sensors-24-06489-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/2eaeabb6cf52/sensors-24-06489-g006a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/2363d5bb05b9/sensors-24-06489-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/278365194d95/sensors-24-06489-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/86853b92f0d3/sensors-24-06489-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/09ecc1135eef/sensors-24-06489-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/66ef94ae2bda/sensors-24-06489-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f06/11479360/2eaeabb6cf52/sensors-24-06489-g006a.jpg

相似文献

1
High-Precision Multi-Object Tracking in Satellite Videos via Pixel-Wise Adaptive Feature Enhancement.通过逐像素自适应特征增强实现卫星视频中的高精度多目标跟踪
Sensors (Basel). 2024 Oct 9;24(19):6489. doi: 10.3390/s24196489.
2
Transformer-Based Multiple-Object Tracking via Anchor-Based-Query and Template Matching.基于锚点查询和模板匹配的基于Transformer的多目标跟踪
Sensors (Basel). 2023 Dec 30;24(1):229. doi: 10.3390/s24010229.
3
HRSiam: High-Resolution Siamese Network, Towards Space-Borne Satellite Video Tracking.HRSiam:高分辨率暹罗网络,用于星载卫星视频跟踪
IEEE Trans Image Process. 2021;30:3056-3068. doi: 10.1109/TIP.2020.3045634. Epub 2021 Feb 24.
4
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking With Transformer.探索用于基于Transformer的3D点云目标跟踪的点-鸟瞰图融合方法
IEEE Trans Pattern Anal Mach Intell. 2024 Sep;46(9):5921-5935. doi: 10.1109/TPAMI.2024.3373693. Epub 2024 Aug 6.
5
Multiple Traffic Target Tracking with Spatial-Temporal Affinity Network.基于时空关联网络的多目标跟踪
Comput Intell Neurosci. 2022 May 23;2022:9693767. doi: 10.1155/2022/9693767. eCollection 2022.
6
Pixel-Guided Association for Multi-Object Tracking.用于多目标跟踪的像素引导关联
Sensors (Basel). 2022 Nov 18;22(22):8922. doi: 10.3390/s22228922.
7
Effective Multi-Object Tracking via Global Object Models and Object Constraint Learning.通过全局对象模型和对象约束学习实现有效的多目标跟踪。
Sensors (Basel). 2022 Oct 18;22(20):7943. doi: 10.3390/s22207943.
8
Multi-Target Tracking Based on a Combined Attention Mechanism and Occlusion Sensing in a Behavior-Analysis System.基于行为分析系统中的联合注意力机制和遮挡感知的多目标跟踪。
Sensors (Basel). 2023 Mar 8;23(6):2956. doi: 10.3390/s23062956.
9
EMAT: Efficient feature fusion network for visual tracking via optimized multi-head attention.EMAT:通过优化多头注意力进行视觉跟踪的高效特征融合网络。
Neural Netw. 2024 Apr;172:106110. doi: 10.1016/j.neunet.2024.106110. Epub 2024 Jan 6.
10
An end-to-end tracking method for polyp detectors in colonoscopy videos.结肠镜视频中息肉检测器的端到端跟踪方法。
Artif Intell Med. 2022 Sep;131:102363. doi: 10.1016/j.artmed.2022.102363. Epub 2022 Jul 14.

本文引用的文献

1
Adaptive Contourlet Fusion Clustering for SAR Image Change Detection.用于合成孔径雷达(SAR)图像变化检测的自适应Contourlet融合聚类
IEEE Trans Image Process. 2022;31:2295-2308. doi: 10.1109/TIP.2022.3154922. Epub 2022 Mar 11.
2
Deep Affinity Network for Multiple Object Tracking.用于多目标跟踪的深度亲和网络
IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):104-119. doi: 10.1109/TPAMI.2019.2929520. Epub 2020 Dec 4.
3
Learning without Forgetting.学过不忘。
IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2935-2947. doi: 10.1109/TPAMI.2017.2773081. Epub 2017 Nov 14.
4
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.