• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多任务学习策略与伪标签:人脸识别、面部地标检测和头部姿势估计。

Multitask Learning Strategy with Pseudo-Labeling: Face Recognition, Facial Landmark Detection, and Head Pose Estimation.

机构信息

School of Electrical and Electronic Engineering, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea.

School of Computer Science and Engineering, Kunsan National University, 558 Daehak-ro, Gunsan 54150, Jeollabuk-do, Republic of Korea.

出版信息

Sensors (Basel). 2024 May 18;24(10):3212. doi: 10.3390/s24103212.

DOI:10.3390/s24103212
PMID:38794068
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11125148/
Abstract

Most facial analysis methods perform well in standardized testing but not in real-world testing. The main reason is that training models cannot easily learn various human features and background noise, especially for facial landmark detection and head pose estimation tasks with limited and noisy training datasets. To alleviate the gap between standardized and real-world testing, we propose a pseudo-labeling technique using a face recognition dataset consisting of various people and background noise. The use of our pseudo-labeled training dataset can help to overcome the lack of diversity among the people in the dataset. Our integrated framework is constructed using complementary multitask learning methods to extract robust features for each task. Furthermore, introducing pseudo-labeling and multitask learning improves the face recognition performance by enabling the learning of pose-invariant features. Our method achieves state-of-the-art (SOTA) or near-SOTA performance on the AFLW2000-3D and BIWI datasets for facial landmark detection and head pose estimation, with competitive face verification performance on the IJB-C test dataset for face recognition. We demonstrate this through a novel testing methodology that categorizes cases as soft, medium, and hard based on the pose values of IJB-C. The proposed method achieves stable performance even when the dataset lacks diverse face identifications.

摘要

大多数面部分析方法在标准化测试中表现良好,但在实际测试中表现不佳。主要原因是训练模型难以轻松学习各种人类特征和背景噪声,特别是对于面部地标检测和头部姿势估计任务,这些任务的训练数据集有限且存在噪声。为了缩小标准化测试和实际测试之间的差距,我们提出了一种使用包含各种人和背景噪声的人脸识别数据集的伪标签技术。使用我们的伪标签训练数据集可以帮助克服数据集中人员多样性不足的问题。我们的集成框架使用互补的多任务学习方法构建,以提取每个任务的稳健特征。此外,引入伪标签和多任务学习可以通过学习不变特征来提高人脸识别性能。我们的方法在 AFLW2000-3D 和 BIWI 数据集上的面部地标检测和头部姿势估计方面实现了最先进(SOTA)或接近 SOTA 的性能,在 IJB-C 测试数据集上的人脸识别方面具有竞争力的人脸验证性能。我们通过一种新的测试方法证明了这一点,该方法根据 IJB-C 的姿势值将情况分为软、中、硬三类。即使数据集缺乏多样化的人脸识别,该方法也能保持稳定的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/a48aadc1c282/sensors-24-03212-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/ad6a80882ec6/sensors-24-03212-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/d907a6d92b28/sensors-24-03212-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/db3fae50159f/sensors-24-03212-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/ee14bc5126bb/sensors-24-03212-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/df21fc20f4c7/sensors-24-03212-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/4282657b4047/sensors-24-03212-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/5e7f08ba5701/sensors-24-03212-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/f023c35926c4/sensors-24-03212-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/26b096ff5144/sensors-24-03212-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/bed516f74d13/sensors-24-03212-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/a48aadc1c282/sensors-24-03212-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/ad6a80882ec6/sensors-24-03212-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/d907a6d92b28/sensors-24-03212-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/db3fae50159f/sensors-24-03212-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/ee14bc5126bb/sensors-24-03212-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/df21fc20f4c7/sensors-24-03212-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/4282657b4047/sensors-24-03212-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/5e7f08ba5701/sensors-24-03212-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/f023c35926c4/sensors-24-03212-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/26b096ff5144/sensors-24-03212-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/bed516f74d13/sensors-24-03212-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/a48aadc1c282/sensors-24-03212-g011.jpg

相似文献

1
Multitask Learning Strategy with Pseudo-Labeling: Face Recognition, Facial Landmark Detection, and Head Pose Estimation.多任务学习策略与伪标签:人脸识别、面部地标检测和头部姿势估计。
Sensors (Basel). 2024 May 18;24(10):3212. doi: 10.3390/s24103212.
2
Task-Oriented Feature-Fused Network With Multivariate Dataset for Joint Face Analysis.面向任务的多变量数据集特征融合网络联合人脸分析
IEEE Trans Cybern. 2020 Mar;50(3):1292-1305. doi: 10.1109/TCYB.2019.2917049. Epub 2019 Jun 5.
3
Multi-Task Convolutional Neural Network for Pose-Invariant Face Recognition.多任务卷积神经网络的姿态不变人脸识别。
IEEE Trans Image Process. 2018 Feb;27(2):964-975. doi: 10.1109/TIP.2017.2765830.
4
Face-from-Depth for Head Pose Estimation on Depth Images.基于深度图像的人脸朝向估计
IEEE Trans Pattern Anal Mach Intell. 2020 Mar;42(3):596-609. doi: 10.1109/TPAMI.2018.2885472. Epub 2018 Dec 7.
5
A Symmetrical Siamese Network Framework With Contrastive Learning for Pose-Robust Face Recognition.基于对比学习的对称连体网络框架的姿态鲁棒人脸识别。
IEEE Trans Image Process. 2023;32:5652-5663. doi: 10.1109/TIP.2023.3322593. Epub 2023 Oct 17.
6
Head Pose Estimation through Keypoints Matching between Reconstructed 3D Face Model and 2D Image.基于重建 3D 人脸模型和 2D 图像关键点匹配的头部姿势估计
Sensors (Basel). 2021 Mar 6;21(5):1841. doi: 10.3390/s21051841.
7
Differential 3D Facial Recognition: Adding 3D to Your State-of-the-Art 2D Method.三维差分人脸识别:为您的最先进二维方法添加三维技术。
IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1582-1593. doi: 10.1109/TPAMI.2020.2986951. Epub 2020 Apr 13.
8
HeadFusion: 360 Head Pose Tracking Combining 3D Morphable Model and 3D Reconstruction.HeadFusion:结合 3D 可变形模型和 3D 重建的 360 度头部姿势跟踪。
IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2653-2667. doi: 10.1109/TPAMI.2018.2841403. Epub 2018 May 29.
9
CapsField: Light Field-Based Face and Expression Recognition in the Wild Using Capsule Routing.基于胶囊路由的野外光场人脸和表情识别
IEEE Trans Image Process. 2021;30:2627-2642. doi: 10.1109/TIP.2021.3054476. Epub 2021 Feb 5.
10
FASHE: A FrActal Based Strategy for Head Pose Estimation.FASHE:一种基于分形的头部姿势估计策略。
IEEE Trans Image Process. 2021;30:3192-3203. doi: 10.1109/TIP.2021.3059409. Epub 2021 Feb 25.

引用本文的文献

1
Enhancing 3D Face Recognition: Achieving Significant Gains via 2D-Aided Generative Augmentation.增强三维人脸识别:通过二维辅助生成增强实现显著提升。
Sensors (Basel). 2025 Aug 14;25(16):5049. doi: 10.3390/s25165049.

本文引用的文献

1
Face Alignment in Full Pose Range: A 3D Total Solution.全姿态范围内的面部对齐:一种三维整体解决方案。
IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):78-92. doi: 10.1109/TPAMI.2017.2778152. Epub 2017 Nov 28.
2
Multi-PIE.多姿态、光照和表情数据库
Proc Int Conf Autom Face Gesture Recognit. 2010 May 1;28(5):807-813. doi: 10.1016/j.imavis.2009.08.002.