• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于层次化二值自动编码器的自监督视频哈希。

Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder.

出版信息

IEEE Trans Image Process. 2018 Jul;27(7):3210-3221. doi: 10.1109/TIP.2018.2814344.

DOI:10.1109/TIP.2018.2814344
PMID:29641401
Abstract

Existing video hash functions are built on three isolated stages: frame pooling, relaxed learning, and binarization, which have not adequately explored the temporal order of video frames in a joint binary optimization model, resulting in severe information loss. In this paper, we propose a novel unsupervised video hashing framework dubbed self-supervised video hashing (SSVH), which is able to capture the temporal nature of videos in an end-to-end learning to hash fashion. We specifically address two central problems: 1) how to design an encoder-decoder architecture to generate binary codes for videos and 2) how to equip the binary codes with the ability of accurate video retrieval. We design a hierarchical binary auto-encoder to model the temporal dependencies in videos with multiple granularities, and embed the videos into binary codes with less computations than the stacked architecture. Then, we encourage the binary codes to simultaneously reconstruct the visual content and neighborhood structure of the videos. Experiments on two real-world data sets show that our SSVH method can significantly outperform the state-of-the-art methods and achieve the current best performance on the task of unsupervised video retrieval.

摘要

现有的视频哈希函数建立在三个孤立的阶段上

帧池化、松弛学习和二值化,它们没有在联合二进制优化模型中充分探索视频帧的时间顺序,导致严重的信息丢失。在本文中,我们提出了一种新的无监督视频哈希框架,称为自监督视频哈希(SSVH),它能够以端到端学习哈希的方式捕捉视频的时间特性。我们特别解决了两个核心问题:1)如何设计编码器-解码器架构为视频生成二进制代码,2)如何为二进制代码配备准确视频检索的能力。我们设计了一个分层二进制自动编码器,以多种粒度对视频中的时间依赖性进行建模,并以比堆叠架构更少的计算量将视频嵌入到二进制代码中。然后,我们鼓励二进制代码同时重建视频的视觉内容和邻域结构。在两个真实数据集上的实验表明,我们的 SSVH 方法可以显著优于最先进的方法,并在无监督视频检索任务上达到当前的最佳性能。

相似文献

1
Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder.基于层次化二值自动编码器的自监督视频哈希。
IEEE Trans Image Process. 2018 Jul;27(7):3210-3221. doi: 10.1109/TIP.2018.2814344.
2
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.
3
Simultaneous Feature Aggregating and Hashing for Compact Binary Code Learning.用于紧凑二进制码学习的同步特征聚合与哈希
IEEE Trans Image Process. 2019 Oct;28(10):4954-4969. doi: 10.1109/TIP.2019.2913509. Epub 2019 May 8.
4
Unsupervised Semantic-Preserving Adversarial Hashing for Image Search.用于图像搜索的无监督语义保持对抗哈希
IEEE Trans Image Process. 2019 Aug;28(8):4032-4044. doi: 10.1109/TIP.2019.2903661. Epub 2019 Mar 13.
5
A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval.基于深度 3D 卷积神经网络的监督视频哈希方法用于大规模视频检索。
Sensors (Basel). 2021 Apr 29;21(9):3094. doi: 10.3390/s21093094.
6
Deep Discrete Supervised Hashing.深度离散监督哈希。
IEEE Trans Image Process. 2018 Dec;27(12):5996-6009. doi: 10.1109/TIP.2018.2864894. Epub 2018 Aug 10.
7
Unsupervised Deep Video Hashing via Balanced Code for Large-Scale Video Retrieval.通过平衡码实现的无监督深度视频哈希用于大规模视频检索
IEEE Trans Image Process. 2018 Nov 19. doi: 10.1109/TIP.2018.2882155.
8
A General Framework for Linear Distance Preserving Hashing.一种线性距离保持哈希的通用框架。
IEEE Trans Image Process. 2018 Feb;27(2):907-922. doi: 10.1109/TIP.2017.2751150. Epub 2017 Sep 11.
9
Large-Scale Unsupervised Hashing with Shared Structure Learning.大规模无监督哈希共享结构学习。
IEEE Trans Cybern. 2015 Sep;45(9):1811-22. doi: 10.1109/TCYB.2014.2360856. Epub 2014 Nov 20.
10
Graph-Collaborated Auto-Encoder Hashing for Multiview Binary Clustering.用于多视图二元聚类的图协作自动编码器哈希
IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):10121-10133. doi: 10.1109/TNNLS.2023.3239033. Epub 2024 Jul 8.

引用本文的文献

1
Computer Vision Applications in Intelligent Transportation Systems: A Survey.计算机视觉在智能交通系统中的应用:综述。
Sensors (Basel). 2023 Mar 8;23(6):2938. doi: 10.3390/s23062938.
2
DNA circuits compatible encoder and demultiplexer based on a single biomolecular platform with DNA strands as outputs.基于单个生物分子平台的 DNA 电路兼容编码器和解复用器,其输出为 DNA 链。
Nucleic Acids Res. 2022 Aug 26;50(15):8431-8440. doi: 10.1093/nar/gkac650.
3
Application of region-based video surveillance in smart cities using deep learning.
基于区域的视频监控在智慧城市中利用深度学习的应用。
Multimed Tools Appl. 2021 Dec 27:1-26. doi: 10.1007/s11042-021-11468-w.
4
A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval.基于深度 3D 卷积神经网络的监督视频哈希方法用于大规模视频检索。
Sensors (Basel). 2021 Apr 29;21(9):3094. doi: 10.3390/s21093094.
5
Self-Supervised Learning to Detect Key Frames in Videos.自监督学习在视频关键帧检测中的应用
Sensors (Basel). 2020 Dec 4;20(23):6941. doi: 10.3390/s20236941.
6
An Overview of Image Caption Generation Methods.图像字幕生成方法概述。
Comput Intell Neurosci. 2020 Jan 9;2020:3062706. doi: 10.1155/2020/3062706. eCollection 2020.