• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度 3D 卷积神经网络的监督视频哈希方法用于大规模视频检索。

A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval.

机构信息

Shanghai Engineering Research Center of Assistive Devices, School of Medical Instrument and Food Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China.

School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China.

出版信息

Sensors (Basel). 2021 Apr 29;21(9):3094. doi: 10.3390/s21093094.

DOI:10.3390/s21093094
PMID:33946745
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8124307/
Abstract

Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, content-based video retrieval (CBVR) has aroused the interest of many researchers. A typical CBVR system mainly contains the following two essential parts: video feature extraction and similarity comparison. Feature extraction of video is very challenging, previous video retrieval methods are mostly based on extracting features from single video frames, while resulting the loss of temporal information in the videos. Hashing methods are extensively used in multimedia information retrieval due to its retrieval efficiency, but most of them are currently only applied to image retrieval. In order to solve these problems in video retrieval, we build an end-to-end framework called deep supervised video hashing (DSVH), which employs a 3D convolutional neural network (CNN) to obtain spatial-temporal features of videos, then train a set of hash functions by supervised hashing to transfer the video features into binary space and get the compact binary codes of videos. Finally, we use triplet loss for network training. We conduct a lot of experiments on three public video datasets UCF-101, JHMDB and HMDB-51, and the results show that the proposed method has advantages over many state-of-the-art video retrieval methods. Compared with the DVH method, the mAP value of UCF-101 dataset is improved by 9.3%, and the minimum improvement on JHMDB dataset is also increased by 0.3%. At the same time, we also demonstrate the stability of the algorithm in the HMDB-51 dataset.

摘要

最近,随着手机等摄像工具的普及和各类短视频平台的兴起,大量视频随时被上传到互联网上,因此非常需要一个检索速度快、精度高的视频检索系统。因此,基于内容的视频检索(CBVR)引起了许多研究人员的兴趣。一个典型的 CBVR 系统主要包含以下两个基本部分:视频特征提取和相似性比较。视频特征提取非常具有挑战性,之前的视频检索方法大多基于从单个视频帧中提取特征,而导致视频中的时间信息丢失。哈希方法由于其检索效率而在多媒体信息检索中得到广泛应用,但目前大多数方法仅应用于图像检索。为了解决视频检索中的这些问题,我们构建了一个名为深度监督视频哈希(DSVH)的端到端框架,该框架使用 3D 卷积神经网络(CNN)获取视频的时空特征,然后通过监督哈希训练一组哈希函数,将视频特征转换到二进制空间,并获得视频的紧凑二进制码。最后,我们使用三元组损失进行网络训练。我们在三个公共视频数据集 UCF-101、JHMDB 和 HMDB-51 上进行了大量实验,结果表明,所提出的方法优于许多最新的视频检索方法。与 DVH 方法相比,UCF-101 数据集的 mAP 值提高了 9.3%,JHMDB 数据集的最小改进也提高了 0.3%。同时,我们还在 HMDB-51 数据集上展示了算法的稳定性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/fbb22e450987/sensors-21-03094-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/638adfe0b3b7/sensors-21-03094-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/f5bbd677b994/sensors-21-03094-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/6beae9827e0d/sensors-21-03094-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/77afd1c8bbf5/sensors-21-03094-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/a85410bbc0f9/sensors-21-03094-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/c53b66a063d4/sensors-21-03094-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/fbb22e450987/sensors-21-03094-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/638adfe0b3b7/sensors-21-03094-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/f5bbd677b994/sensors-21-03094-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/6beae9827e0d/sensors-21-03094-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/77afd1c8bbf5/sensors-21-03094-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/a85410bbc0f9/sensors-21-03094-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/c53b66a063d4/sensors-21-03094-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec63/8124307/fbb22e450987/sensors-21-03094-g007.jpg

相似文献

1
A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval.基于深度 3D 卷积神经网络的监督视频哈希方法用于大规模视频检索。
Sensors (Basel). 2021 Apr 29;21(9):3094. doi: 10.3390/s21093094.
2
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.
3
Triplet Deep Hashing with Joint Supervised Loss Based on Deep Neural Networks.基于深度神经网络的联合监督损失三重深度哈希。
Comput Intell Neurosci. 2019 Oct 9;2019:8490364. doi: 10.1155/2019/8490364. eCollection 2019.
4
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals.用于可扩展图像-文本和视频-文本检索的深度语义多模态哈希网络
IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1838-1851. doi: 10.1109/TNNLS.2020.2997020. Epub 2023 Apr 4.
5
Face Video Retrieval Based on the Deep CNN With RBF Loss.基于带径向基函数损失的深度卷积神经网络的面部视频检索
IEEE Trans Image Process. 2021;30:1015-1029. doi: 10.1109/TIP.2020.3040847. Epub 2020 Dec 9.
6
Deep supervised hashing for gait retrieval.深度监督哈希用于步态检索。
F1000Res. 2021 Oct 12;10:1038. doi: 10.12688/f1000research.51368.2. eCollection 2021.
7
Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder.基于层次化二值自动编码器的自监督视频哈希。
IEEE Trans Image Process. 2018 Jul;27(7):3210-3221. doi: 10.1109/TIP.2018.2814344.
8
Deep triplet hashing network for case-based medical image retrieval.基于实例的医学图像检索的深度三重哈希网络。
Med Image Anal. 2021 Apr;69:101981. doi: 10.1016/j.media.2021.101981. Epub 2021 Feb 3.
9
An Efficient Supervised Deep Hashing Method for Image Retrieval.一种用于图像检索的高效监督深度哈希方法。
Entropy (Basel). 2022 Oct 7;24(10):1425. doi: 10.3390/e24101425.
10
Object-Location-Aware Hashing for Multi-Label Image Retrieval via Automatic Mask Learning.基于自动掩模学习的多标签图像检索的目标-位置感知哈希。
IEEE Trans Image Process. 2018 Sep;27(9):4490-4502. doi: 10.1109/TIP.2018.2839522.

引用本文的文献

1
OPHash: learning of organ and pathology context-sensitive hashing for medical image retrieval.OPHash:用于医学图像检索的器官与病理上下文敏感哈希学习
J Med Imaging (Bellingham). 2025 Jan;12(1):017503. doi: 10.1117/1.JMI.12.1.017503. Epub 2025 Feb 19.
2
Quantum machine learning enhanced laser speckle analysis for precise speed prediction.量子机器学习增强激光散斑分析以实现精确速度预测。
Sci Rep. 2024 Nov 12;14(1):27665. doi: 10.1038/s41598-024-78884-4.

本文引用的文献

1
Enhancing Sketch-Based Image Retrieval by CNN Semantic Re-ranking.通过卷积神经网络语义重排增强基于草图的图像检索
IEEE Trans Cybern. 2020 Jul;50(7):3330-3342. doi: 10.1109/TCYB.2019.2894498. Epub 2019 Mar 15.
2
Unsupervised Deep Video Hashing via Balanced Code for Large-Scale Video Retrieval.通过平衡码实现的无监督深度视频哈希用于大规模视频检索
IEEE Trans Image Process. 2018 Nov 19. doi: 10.1109/TIP.2018.2882155.
3
Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder.基于层次化二值自动编码器的自监督视频哈希。
IEEE Trans Image Process. 2018 Jul;27(7):3210-3221. doi: 10.1109/TIP.2018.2814344.
4
Unsupervised Topic Hypergraph Hashing for Efficient Mobile Image Retrieval.用于高效移动图像检索的无监督主题超图哈希
IEEE Trans Cybern. 2017 Nov;47(11):3941-3954. doi: 10.1109/TCYB.2016.2591068. Epub 2016 Oct 21.
5
Iterative quantization: a Procrustean approach to learning binary codes for large-scale image retrieval.迭代量化:一种用于大规模图像检索的学习二进制代码的普罗克汝斯忒斯方法。
IEEE Trans Pattern Anal Mach Intell. 2013 Dec;35(12):2916-29. doi: 10.1109/TPAMI.2012.193.
6
3D convolutional neural networks for human action recognition.三维卷积神经网络的人体动作识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):221-31. doi: 10.1109/TPAMI.2012.59.