• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过图像合成和域对抗学习实现自监督视觉跟踪

Self-Supervised Visual Tracking via Image Synthesis and Domain Adversarial Learning.

作者信息

Geng Gu, Zhou Sida, Tang Jianing, Zhang Xinming, Liu Qiao, Yuan Di

机构信息

Guangzhou Institute of Technology, Xidian University, Guangzhou 510555, China.

School of Electrical and Information Engineering, Yunnan Minzu University, Kunming 650504, China.

出版信息

Sensors (Basel). 2025 Jul 25;25(15):4621. doi: 10.3390/s25154621.

DOI:10.3390/s25154621
PMID:40807783
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12349480/
Abstract

With the widespread use of sensors in applications such as autonomous driving and intelligent security, stable and efficient target tracking from diverse sensor data has become increasingly important. Self-supervised visual tracking has attracted increasing attention due to its potential to eliminate reliance on costly manual annotations; however, existing methods often train on incomplete object representations, resulting in inaccurate localization during inference. In addition, current methods typically struggle when applied to deep networks. To address these limitations, we propose a novel self-supervised tracking framework based on image synthesis and domain adversarial learning. We first construct a large-scale database of real-world target objects, then synthesize training video pairs by randomly inserting these targets into background frames while applying geometric and appearance transformations to simulate realistic variations. To reduce domain shift introduced by synthetic content, we incorporate a domain classification branch after feature extraction and adopt domain adversarial training to encourage feature alignment between real and synthetic domains. Experimental results on five standard tracking benchmarks demonstrate that our method significantly enhances tracking accuracy compared to existing self-supervised approaches without introducing any additional labeling cost. The proposed framework not only ensures complete target coverage during training but also shows strong scalability to deeper network architectures, offering a practical and effective solution for real-world tracking applications.

摘要

随着传感器在自动驾驶和智能安全等应用中的广泛使用,从各种传感器数据中进行稳定高效的目标跟踪变得越来越重要。自监督视觉跟踪因其有潜力消除对昂贵人工标注的依赖而受到越来越多的关注;然而,现有方法通常在不完整的对象表示上进行训练,导致推理过程中的定位不准确。此外,当前方法在应用于深度网络时通常会遇到困难。为了解决这些限制,我们提出了一种基于图像合成和域对抗学习的新型自监督跟踪框架。我们首先构建一个大规模的真实世界目标对象数据库,然后通过将这些目标随机插入背景帧中,同时应用几何和外观变换来模拟现实变化,从而合成训练视频对。为了减少合成内容引入的域转移,我们在特征提取后加入一个域分类分支,并采用域对抗训练来促进真实域和合成域之间的特征对齐。在五个标准跟踪基准上的实验结果表明,与现有的自监督方法相比,我们的方法在不引入任何额外标注成本的情况下显著提高了跟踪精度。所提出的框架不仅在训练期间确保了目标的完整覆盖,而且对更深的网络架构显示出强大的可扩展性,为实际的跟踪应用提供了一种实用有效的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/b693068e7e79/sensors-25-04621-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/46b4b1c5337d/sensors-25-04621-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/b4372bd213ad/sensors-25-04621-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/ddba2a5aa546/sensors-25-04621-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/5a24aa8bad2e/sensors-25-04621-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/c0891648e204/sensors-25-04621-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/fe7789649133/sensors-25-04621-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/b693068e7e79/sensors-25-04621-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/46b4b1c5337d/sensors-25-04621-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/b4372bd213ad/sensors-25-04621-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/ddba2a5aa546/sensors-25-04621-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/5a24aa8bad2e/sensors-25-04621-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/c0891648e204/sensors-25-04621-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/fe7789649133/sensors-25-04621-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebe5/12349480/b693068e7e79/sensors-25-04621-g007.jpg

相似文献

1
Self-Supervised Visual Tracking via Image Synthesis and Domain Adversarial Learning.通过图像合成和域对抗学习实现自监督视觉跟踪
Sensors (Basel). 2025 Jul 25;25(15):4621. doi: 10.3390/s25154621.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
A medical image classification method based on self-regularized adversarial learning.基于自正则化对抗学习的医学图像分类方法。
Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.
4
Integrated neural network framework for multi-object detection and recognition using UAV imagery.用于使用无人机图像进行多目标检测与识别的集成神经网络框架。
Front Neurorobot. 2025 Jul 30;19:1643011. doi: 10.3389/fnbot.2025.1643011. eCollection 2025.
5
Short-Term Memory Impairment短期记忆障碍
6
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
7
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
8
Boundary-aware information maximization for self-supervised medical image segmentation.用于自监督医学图像分割的边界感知信息最大化
Med Image Anal. 2024 May;94:103150. doi: 10.1016/j.media.2024.103150. Epub 2024 Mar 28.
9
An open-source deep learning framework for respiratory motion monitoring and volumetric imaging during radiation therapy.一种用于放射治疗期间呼吸运动监测和容积成像的开源深度学习框架。
Med Phys. 2025 Jul;52(7):e18015. doi: 10.1002/mp.18015.
10
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.

本文引用的文献

1
A Target Tracking Method Based on a Pyramid Channel Attention Mechanism.一种基于金字塔通道注意力机制的目标跟踪方法。
Sensors (Basel). 2025 May 20;25(10):3214. doi: 10.3390/s25103214.
2
Enhanced YOLOv5: An Efficient Road Object Detection Method.增强型YOLOv5:一种高效的道路目标检测方法。
Sensors (Basel). 2023 Oct 10;23(20):8355. doi: 10.3390/s23208355.
3
CoSOV1Net: A Cone- and Spatial-Opponent Primary Visual Cortex-Inspired Neural Network for Lightweight Salient Object Detection.CoSOV1Net:一种受视锥和空间对立型初级视觉皮层启发的用于轻量级显著目标检测的神经网络。
Sensors (Basel). 2023 Jul 17;23(14):6450. doi: 10.3390/s23146450.
4
Self-Supervised Tracking via Target-Aware Data Synthesis.通过目标感知数据合成实现自监督跟踪
IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):9186-9197. doi: 10.1109/TNNLS.2022.3231537. Epub 2024 Jul 10.
5
A Video Target Tracking and Correction Model with Blockchain and Robust Feature Location.基于区块链和稳健特征定位的视频目标跟踪与校正模型。
Sensors (Basel). 2023 Feb 22;23(5):2408. doi: 10.3390/s23052408.
6
Self-Supervised Deep Correlation Tracking.自监督深度相关跟踪
IEEE Trans Image Process. 2021;30:976-985. doi: 10.1109/TIP.2020.3037518. Epub 2020 Dec 9.
7
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild.GOT-10k:用于野外通用目标跟踪的大型高多样性基准数据集。
IEEE Trans Pattern Anal Mach Intell. 2021 May;43(5):1562-1577. doi: 10.1109/TPAMI.2019.2957464. Epub 2021 Apr 1.
8
t-Distributed Stochastic Neighbor Embedding (t-SNE): A tool for eco-physiological transcriptomic analysis.t分布随机邻域嵌入(t-SNE):一种用于生态生理转录组分析的工具。
Mar Genomics. 2020 Jun;51:100723. doi: 10.1016/j.margen.2019.100723. Epub 2019 Nov 26.
9
Synthetic Data Generation for End-to-End Thermal Infrared Tracking.端到端热红外跟踪的合成数据生成。
IEEE Trans Image Process. 2019 Apr;28(4):1837-1850. doi: 10.1109/TIP.2018.2879249. Epub 2018 Nov 2.
10
Object Tracking Benchmark.目标跟踪基准测试。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1834-48. doi: 10.1109/TPAMI.2014.2388226.