• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于三阶段级联架构的暹罗滑动窗口网络算法用于目标跟踪。

Three-stage cascade architecture-based siamese sliding window network algorithm for object tracking.

作者信息

Yang Zheng, Liu Kaiwen, Li Quanlong, Hou Yandong, Yan Zhiyu

机构信息

School of Electrical Engineering, Yellow River Conservancy Technical Institute, Dongjing street, Kaifeng, 475004, Henan, China.

School of Artificial Intelligence, Henan University, Mingli street, Zhengzhou, 450000, Henan, China.

出版信息

Heliyon. 2025 Jan 6;11(2):e41612. doi: 10.1016/j.heliyon.2024.e41612. eCollection 2025 Jan 30.

DOI:10.1016/j.heliyon.2024.e41612
PMID:39897889
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11782978/
Abstract

To enhance the correlation of feature information and enrich the pattern of cross-correlation metrics, we propose the Siam ST algorithm, which is based on a three-stage cascade (TSC) architecture. The sliding window is introduced in the last three layers of convolution blocks, which can obtain the global information of images and fully capture the target feature. The TSC structure is developed by using the regional proposal network. It makes the features of the current frame interact with the previous frame. As a result, our method has a high effect of robustness and association features extraction. Therefore, our ablation experiments are conducted on the VOT2016 dataset, and comparison experiments are conducted on four datasets, VOT2018, LaSOT, Tracking Net, and UAV123. Our proposed algorithm demonstrates a significant improvement compared to SiamRPN++ across four datasets.

摘要

为了增强特征信息的相关性并丰富互相关指标的模式,我们提出了基于三级级联(TSC)架构的暹罗时空(Siam ST)算法。在卷积块的最后三层中引入滑动窗口,其可以获取图像的全局信息并充分捕捉目标特征。TSC结构是通过使用区域提议网络开发的。它使当前帧的特征与前一帧进行交互。因此,我们的方法在鲁棒性和关联特征提取方面具有很高的效果。因此,我们在VOT2016数据集上进行了消融实验,并在四个数据集VOT2018、LaSOT、Tracking Net和UAV123上进行了对比实验。与SiamRPN++相比,我们提出的算法在四个数据集上均有显著改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/fb1123a33ad4/gr012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/8a69af9da579/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/3f1616873961/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/8292522fd720/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/a7a0aa5d909b/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/01700b11179c/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/223a94174049/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/cf75f0376794/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/642dffccbcef/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/761dd37e576a/gr009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/275f23bac364/gr010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/e04cb10d29df/gr011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/fb1123a33ad4/gr012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/8a69af9da579/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/3f1616873961/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/8292522fd720/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/a7a0aa5d909b/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/01700b11179c/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/223a94174049/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/cf75f0376794/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/642dffccbcef/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/761dd37e576a/gr009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/275f23bac364/gr010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/e04cb10d29df/gr011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df1e/11782978/fb1123a33ad4/gr012.jpg

相似文献

1
Three-stage cascade architecture-based siamese sliding window network algorithm for object tracking.基于三阶段级联架构的暹罗滑动窗口网络算法用于目标跟踪。
Heliyon. 2025 Jan 6;11(2):e41612. doi: 10.1016/j.heliyon.2024.e41612. eCollection 2025 Jan 30.
2
DSiam-CnK: A CBAM- and KCF-Enabled Deep Siamese Region Proposal Network for Human Tracking in Dynamic and Occluded Scenes.DSiam-CnK:一种用于动态和遮挡场景中人体跟踪的启用CBAM和KCF的深度暹罗区域提议网络。
Sensors (Basel). 2024 Dec 21;24(24):8176. doi: 10.3390/s24248176.
3
Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks.利用连体网络学习视觉目标跟踪中目标的几何信息。
Sensors (Basel). 2021 Nov 23;21(23):7790. doi: 10.3390/s21237790.
4
Robust Template Adjustment Siamese Network for Object Visual Tracking.用于目标视觉跟踪的鲁棒模板调整暹罗网络
Sensors (Basel). 2021 Feb 20;21(4):1466. doi: 10.3390/s21041466.
5
HROM: Learning High-Resolution Representation and Object-Aware Masks for Visual Object Tracking.HROM:用于视觉目标跟踪的学习高分辨率表示和对象感知掩模。
Sensors (Basel). 2020 Aug 26;20(17):4807. doi: 10.3390/s20174807.
6
DP-Siam: Dynamic Policy Siamese Network for Robust Object Tracking.DP-Siam:用于鲁棒目标跟踪的动态策略暹罗网络
IEEE Trans Image Process. 2019 Sep 25. doi: 10.1109/TIP.2019.2942506.
7
Siamese anchor-free object tracking with multiscale spatial attentions.基于多尺度空间注意力的暹罗无锚点目标跟踪
Sci Rep. 2021 Nov 25;11(1):22908. doi: 10.1038/s41598-021-02095-4.
8
SiamCAN: Real-Time Visual Tracking Based on Siamese Center-Aware Network.暹罗CAN:基于暹罗中心感知网络的实时视觉跟踪
IEEE Trans Image Process. 2021;30:3597-3609. doi: 10.1109/TIP.2021.3060905. Epub 2021 Mar 17.
9
Lightweight Siamese Network with Global Correlation for Single-Object Tracking.用于单目标跟踪的具有全局相关性的轻量级暹罗网络。
Sensors (Basel). 2024 Dec 21;24(24):8171. doi: 10.3390/s24248171.
10
Siamese network with a depthwise over-parameterized convolutional layer for visual tracking.用于视觉跟踪的带有深度过度参数化卷积层的暹罗网络。
PLoS One. 2022 Aug 31;17(8):e0273690. doi: 10.1371/journal.pone.0273690. eCollection 2022.

引用本文的文献

1
A weight clustering algorithm based on sliding window model for stream data.
Sci Rep. 2025 Jul 2;15(1):22521. doi: 10.1038/s41598-025-96696-y.

本文引用的文献

1
Inverse sparse tracker with a locally weighted distance metric.基于局部加权距离度量的逆稀疏跟踪器。
IEEE Trans Image Process. 2015 Sep;24(9):2646-57. doi: 10.1109/TIP.2015.2427518. Epub 2015 Apr 28.