• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于带径向基函数损失的深度卷积神经网络的面部视频检索

Face Video Retrieval Based on the Deep CNN With RBF Loss.

作者信息

Choi Young Rok, Kil Rhee Man

出版信息

IEEE Trans Image Process. 2021;30:1015-1029. doi: 10.1109/TIP.2020.3040847. Epub 2020 Dec 9.

DOI:10.1109/TIP.2020.3040847
PMID:33275580
Abstract

This paper presents a novel framework to extract highly compact and discriminative features for face video retrieval tasks using the deep convolutional neural network (CNN). The face video retrieval task is to find the videos containing the face of a specific person from a database with a face image or a face video of the same person as a query. A key challenge is to extract discriminative features with small storage space from face videos with large intra-class variations caused by different angle, illumination, and facial expression. In recent years, the CNN-based binary hashing and metric learning methods showed notable progress in image/video retrieval tasks. However, the existing CNN-based binary hashing and metric learning have limitations in terms of inevitable information loss and storage inefficiency, respectively. To cope with these problems, the proposed framework consists of two parts: first, a novel loss function using a radial basis function kernel (RBF Loss) is introduced to train a neural network to generate compact and discriminative high-level features, and secondly, an optimized quantization using a logistic function (Logistic Quantization) is suggested to convert a real-valued feature to a 1-byte integer with the minimum information loss. Through the face video retrieval experiments on a challenging TV series data set (ICT-TV), it is demonstrated that the proposed framework outperforms the existing state-of-the-art feature extraction methods. Furthermore, the effectiveness of RBF loss was also demonstrated through the image classification and retrieval experiments on the CIFAR-10 and Fashion-MNIST data sets with LeNet-5.

摘要

本文提出了一种新颖的框架,用于使用深度卷积神经网络(CNN)为面部视频检索任务提取高度紧凑且具有判别力的特征。面部视频检索任务是从数据库中找到包含特定人物面部的视频,该数据库以同一人物的面部图像或面部视频作为查询。一个关键挑战是从因不同角度、光照和面部表情而具有较大类内变化的面部视频中提取具有小存储空间的判别性特征。近年来,基于CNN的二进制哈希和度量学习方法在图像/视频检索任务中取得了显著进展。然而,现有的基于CNN的二进制哈希和度量学习分别在不可避免的信息损失和存储效率方面存在局限性。为了解决这些问题,所提出的框架由两部分组成:首先,引入一种使用径向基函数核的新颖损失函数(RBF损失)来训练神经网络,以生成紧凑且具有判别力的高级特征;其次,建议使用逻辑函数的优化量化(逻辑量化)将实值特征转换为具有最小信息损失的1字节整数。通过在具有挑战性的电视剧数据集(ICT-TV)上进行的面部视频检索实验,证明了所提出的框架优于现有的最先进特征提取方法。此外,通过使用LeNet-5在CIFAR-10和Fashion-MNIST数据集上进行的图像分类和检索实验,也证明了RBF损失的有效性。

相似文献

1
Face Video Retrieval Based on the Deep CNN With RBF Loss.基于带径向基函数损失的深度卷积神经网络的面部视频检索
IEEE Trans Image Process. 2021;30:1015-1029. doi: 10.1109/TIP.2020.3040847. Epub 2020 Dec 9.
2
A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval.基于深度 3D 卷积神经网络的监督视频哈希方法用于大规模视频检索。
Sensors (Basel). 2021 Apr 29;21(9):3094. doi: 10.3390/s21093094.
3
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.
4
Discriminative Deep Quantization Hashing for Face Image Retrieval.用于面部图像检索的判别式深度量化哈希
IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):6154-6162. doi: 10.1109/TNNLS.2018.2816743. Epub 2018 May 3.
5
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals.用于可扩展图像-文本和视频-文本检索的深度语义多模态哈希网络
IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1838-1851. doi: 10.1109/TNNLS.2020.2997020. Epub 2023 Apr 4.
6
Triplet Deep Hashing with Joint Supervised Loss Based on Deep Neural Networks.基于深度神经网络的联合监督损失三重深度哈希。
Comput Intell Neurosci. 2019 Oct 9;2019:8490364. doi: 10.1155/2019/8490364. eCollection 2019.
7
Deep supervised hashing for gait retrieval.深度监督哈希用于步态检索。
F1000Res. 2021 Oct 12;10:1038. doi: 10.12688/f1000research.51368.2. eCollection 2021.
8
Deep Heterogeneous Hashing for Face Video Retrieval.
IEEE Trans Image Process. 2019 Sep 16. doi: 10.1109/TIP.2019.2940683.
9
Trunk-Branch Ensemble Convolutional Neural Networks for Video-Based Face Recognition.基于主干-分支集成卷积神经网络的视频人脸识别。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):1002-1014. doi: 10.1109/TPAMI.2017.2700390. Epub 2017 May 2.
10
A novel biomedical image indexing and retrieval system via deep preference learning.一种基于深度偏好学习的新型生物医学图像索引和检索系统。
Comput Methods Programs Biomed. 2018 May;158:53-69. doi: 10.1016/j.cmpb.2018.02.003. Epub 2018 Feb 6.

引用本文的文献

1
Aircraft range fuel prediction study based on WPD with IAPO optimized BiLSTM-KAN model.基于带IAPO优化BiLSTM-KAN模型的WPD的飞机航程燃油预测研究
Sci Rep. 2025 Apr 10;15(1):12231. doi: 10.1038/s41598-025-97264-0.
2
Deep-Learning-Based Analysis of Electronic Skin Sensing Data.基于深度学习的电子皮肤传感数据分析
Sensors (Basel). 2025 Mar 6;25(5):1615. doi: 10.3390/s25051615.
3
An improved sparrow search algorithm and CNN-BiLSTM neural network for predicting sea level height.一种用于预测海平面高度的改进麻雀搜索算法与卷积神经网络-双向长短期记忆神经网络
Sci Rep. 2024 Feb 24;14(1):4560. doi: 10.1038/s41598-024-55266-4.