• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于主干-分支集成卷积神经网络的视频人脸识别。

Trunk-Branch Ensemble Convolutional Neural Networks for Video-Based Face Recognition.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):1002-1014. doi: 10.1109/TPAMI.2017.2700390. Epub 2017 May 2.

DOI:10.1109/TPAMI.2017.2700390
PMID:28475048
Abstract

Human faces in surveillance videos often suffer from severe image blur, dramatic pose variations, and occlusion. In this paper, we propose a comprehensive framework based on Convolutional Neural Networks (CNN) to overcome challenges in video-based face recognition (VFR). First, to learn blur-robust face representations, we artificially blur training data composed of clear still images to account for a shortfall in real-world video training data. Using training data composed of both still images and artificially blurred data, CNN is encouraged to learn blur-insensitive features automatically. Second, to enhance robustness of CNN features to pose variations and occlusion, we propose a Trunk-Branch Ensemble CNN model (TBE-CNN), which extracts complementary information from holistic face images and patches cropped around facial components. TBE-CNN is an end-to-end model that extracts features efficiently by sharing the low- and middle-level convolutional layers between the trunk and branch networks. Third, to further promote the discriminative power of the representations learnt by TBE-CNN, we propose an improved triplet loss function. Systematic experiments justify the effectiveness of the proposed techniques. Most impressively, TBE-CNN achieves state-of-the-art performance on three popular video face databases: PaSC, COX Face, and YouTube Faces. With the proposed techniques, we also obtain the first place in the BTAS 2016 Video Person Recognition Evaluation.

摘要

在监控视频中,人脸经常受到严重的图像模糊、剧烈的姿势变化和遮挡的影响。在本文中,我们提出了一个基于卷积神经网络(CNN)的综合框架,以克服基于视频的人脸识别(VFR)中的挑战。首先,为了学习抗模糊的人脸表示,我们人为地模糊了由清晰的静态图像组成的训练数据,以弥补真实世界视频训练数据的不足。使用由静态图像和人为模糊数据组成的训练数据,鼓励 CNN 自动学习对模糊不敏感的特征。其次,为了增强 CNN 特征对姿势变化和遮挡的鲁棒性,我们提出了一种主干-分支集成 CNN 模型(TBE-CNN),该模型从整体人脸图像和裁剪自面部组件周围的补丁中提取互补信息。TBE-CNN 是一个端到端的模型,通过在主干网络和分支网络之间共享低和中层次的卷积层,有效地提取特征。第三,为了进一步提高 TBE-CNN 学习的表示的判别能力,我们提出了一种改进的三元组损失函数。系统的实验验证了所提出技术的有效性。最令人印象深刻的是,TBE-CNN 在三个流行的视频人脸数据库:PaSC、COX Face 和 YouTube Faces 上实现了最先进的性能。通过所提出的技术,我们还在 BTAS 2016 视频人物识别评估中获得了第一名。

相似文献

1
Trunk-Branch Ensemble Convolutional Neural Networks for Video-Based Face Recognition.基于主干-分支集成卷积神经网络的视频人脸识别。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):1002-1014. doi: 10.1109/TPAMI.2017.2700390. Epub 2017 May 2.
2
Multi-Task Convolutional Neural Network for Pose-Invariant Face Recognition.多任务卷积神经网络的姿态不变人脸识别。
IEEE Trans Image Process. 2018 Feb;27(2):964-975. doi: 10.1109/TIP.2017.2765830.
3
Two-Stream Transformer Networks for Video-Based Face Alignment.基于双流Transformer 网络的视频人脸对齐。
IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2546-2554. doi: 10.1109/TPAMI.2017.2734779. Epub 2017 Aug 1.
4
Cross Euclidean-to-Riemannian Metric Learning with Application to Face Recognition from Video.基于欧式到黎曼度量学习的视频人脸识别方法
IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2827-2840. doi: 10.1109/TPAMI.2017.2776154. Epub 2017 Nov 22.
5
Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition. Wasserstein CNN:用于近红外-可见光人脸识别的不变特征学习。
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1761-1773. doi: 10.1109/TPAMI.2018.2842770. Epub 2018 Jun 1.
6
Face Recognition Using the SR-CNN Model.基于 SR-CNN 模型的人脸识别
Sensors (Basel). 2018 Dec 3;18(12):4237. doi: 10.3390/s18124237.
7
EAC-Net: Deep Nets with Enhancing and Cropping for Facial Action Unit Detection.EAC-Net:用于面部动作单元检测的增强和裁剪的深度网络。
IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2583-2596. doi: 10.1109/TPAMI.2018.2791608. Epub 2018 Jan 10.
8
Probabilistic Elastic Part Model: A Pose-Invariant Representation for Real-World Face Verification.概率弹性部件模型:用于真实世界人脸验证的姿态不变表示。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):918-930. doi: 10.1109/TPAMI.2017.2695183. Epub 2017 Apr 18.
9
Blur and illumination robust face recognition via set-theoretic characterization.基于集合论刻画的抗模糊和抗光照人脸识别。
IEEE Trans Image Process. 2013 Apr;22(4):1362-72. doi: 10.1109/TIP.2012.2228498. Epub 2012 Nov 29.
10
Multi-task pose-invariant face recognition.多任务不变姿态人脸识别。
IEEE Trans Image Process. 2015 Mar;24(3):980-93. doi: 10.1109/TIP.2015.2390959. Epub 2015 Jan 12.

引用本文的文献

1
LittleFaceNet: A Small-Sized Face Recognition Method Based on RetinaFace and AdaFace.小脸网:一种基于视网膜脸和自适应脸的小型人脸识别方法。
J Imaging. 2025 Jan 13;11(1):24. doi: 10.3390/jimaging11010024.
2
Historical Blurry Video-Based Face Recognition.基于历史模糊视频的人脸识别。
J Imaging. 2024 Sep 20;10(9):236. doi: 10.3390/jimaging10090236.
3
Machine learning approach for the prediction of macrosomia.用于预测巨大儿的机器学习方法。
Vis Comput Ind Biomed Art. 2024 Aug 27;7(1):22. doi: 10.1186/s42492-024-00172-9.
4
Contextual emotion detection in images using deep learning.使用深度学习进行图像中的情境情感检测。
Front Artif Intell. 2024 Jun 17;7:1386753. doi: 10.3389/frai.2024.1386753. eCollection 2024.
5
Improved likelihood ratios for face recognition in surveillance video by multimodal feature pairing.通过多模态特征配对提高监控视频中人脸识别的似然比。
Forensic Sci Int Synerg. 2024 Feb 29;8:100458. doi: 10.1016/j.fsisyn.2024.100458. eCollection 2024.
6
Heterogeneous Fusion of Camera and mmWave Radar Sensor of Optimizing Convolutional Neural Networks for Parking Meter System.相机和毫米波雷达传感器的异构融合优化卷积神经网络的停车计费系统。
Sensors (Basel). 2023 Apr 21;23(8):4159. doi: 10.3390/s23084159.
7
A deep ensemble learning method for single finger-vein identification.一种用于单手指静脉识别的深度集成学习方法。
Front Neurorobot. 2023 Jan 11;16:1065099. doi: 10.3389/fnbot.2022.1065099. eCollection 2022.
8
Applying the Properties of Neurons in Machine Learning: A Brain-like Neural Model with Interactive Stimulation for Data Classification.神经元特性在机器学习中的应用:一种具有交互式刺激的数据分类类脑神经模型。
Brain Sci. 2022 Sep 3;12(9):1191. doi: 10.3390/brainsci12091191.
9
A deep neural network for the classification of epileptic seizures using hierarchical attention mechanism.一种使用分层注意力机制进行癫痫发作分类的深度神经网络。
Soft comput. 2022;26(11):5389-5397. doi: 10.1007/s00500-022-07122-8. Epub 2022 Apr 16.
10
Glass Refraction Distortion Object Detection via Abstract Features.基于抽象特征的玻璃折射失真目标检测。
Comput Intell Neurosci. 2022 Mar 24;2022:5456818. doi: 10.1155/2022/5456818. eCollection 2022.