多任务学习策略与伪标签：人脸识别、面部地标检测和头部姿势估计。

Multitask Learning Strategy with Pseudo-Labeling: Face Recognition, Facial Landmark Detection, and Head Pose Estimation.

机构信息

School of Electrical and Electronic Engineering, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea.

School of Computer Science and Engineering, Kunsan National University, 558 Daehak-ro, Gunsan 54150, Jeollabuk-do, Republic of Korea.

出版信息

Sensors (Basel). 2024 May 18;24(10):3212. doi: 10.3390/s24103212.

DOI:10.3390/s24103212

PMID:38794068

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11125148/

Abstract

Most facial analysis methods perform well in standardized testing but not in real-world testing. The main reason is that training models cannot easily learn various human features and background noise, especially for facial landmark detection and head pose estimation tasks with limited and noisy training datasets. To alleviate the gap between standardized and real-world testing, we propose a pseudo-labeling technique using a face recognition dataset consisting of various people and background noise. The use of our pseudo-labeled training dataset can help to overcome the lack of diversity among the people in the dataset. Our integrated framework is constructed using complementary multitask learning methods to extract robust features for each task. Furthermore, introducing pseudo-labeling and multitask learning improves the face recognition performance by enabling the learning of pose-invariant features. Our method achieves state-of-the-art (SOTA) or near-SOTA performance on the AFLW2000-3D and BIWI datasets for facial landmark detection and head pose estimation, with competitive face verification performance on the IJB-C test dataset for face recognition. We demonstrate this through a novel testing methodology that categorizes cases as soft, medium, and hard based on the pose values of IJB-C. The proposed method achieves stable performance even when the dataset lacks diverse face identifications.

摘要

大多数面部分析方法在标准化测试中表现良好，但在实际测试中表现不佳。主要原因是训练模型难以轻松学习各种人类特征和背景噪声，特别是对于面部地标检测和头部姿势估计任务，这些任务的训练数据集有限且存在噪声。为了缩小标准化测试和实际测试之间的差距，我们提出了一种使用包含各种人和背景噪声的人脸识别数据集的伪标签技术。使用我们的伪标签训练数据集可以帮助克服数据集中人员多样性不足的问题。我们的集成框架使用互补的多任务学习方法构建，以提取每个任务的稳健特征。此外，引入伪标签和多任务学习可以通过学习不变特征来提高人脸识别性能。我们的方法在 AFLW2000-3D 和 BIWI 数据集上的面部地标检测和头部姿势估计方面实现了最先进（SOTA）或接近 SOTA 的性能，在 IJB-C 测试数据集上的人脸识别方面具有竞争力的人脸验证性能。我们通过一种新的测试方法证明了这一点，该方法根据 IJB-C 的姿势值将情况分为软、中、硬三类。即使数据集缺乏多样化的人脸识别，该方法也能保持稳定的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a5/11125148/ad6a80882ec6/sensors-24-03212-g001.jpg

相似文献

Multitask Learning Strategy with Pseudo-Labeling: Face Recognition, Facial Landmark Detection, and Head Pose Estimation.

Sensors (Basel). 2024 May 18;24(10):3212. doi: 10.3390/s24103212.

Task-Oriented Feature-Fused Network With Multivariate Dataset for Joint Face Analysis.

IEEE Trans Cybern. 2020 Mar;50(3):1292-1305. doi: 10.1109/TCYB.2019.2917049. Epub 2019 Jun 5.

Multi-Task Convolutional Neural Network for Pose-Invariant Face Recognition.

IEEE Trans Image Process. 2018 Feb;27(2):964-975. doi: 10.1109/TIP.2017.2765830.

Face-from-Depth for Head Pose Estimation on Depth Images.

IEEE Trans Pattern Anal Mach Intell. 2020 Mar;42(3):596-609. doi: 10.1109/TPAMI.2018.2885472. Epub 2018 Dec 7.

A Symmetrical Siamese Network Framework With Contrastive Learning for Pose-Robust Face Recognition.

IEEE Trans Image Process. 2023;32:5652-5663. doi: 10.1109/TIP.2023.3322593. Epub 2023 Oct 17.

Head Pose Estimation through Keypoints Matching between Reconstructed 3D Face Model and 2D Image.

Sensors (Basel). 2021 Mar 6;21(5):1841. doi: 10.3390/s21051841.

Differential 3D Facial Recognition: Adding 3D to Your State-of-the-Art 2D Method.

IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1582-1593. doi: 10.1109/TPAMI.2020.2986951. Epub 2020 Apr 13.

HeadFusion: 360 Head Pose Tracking Combining 3D Morphable Model and 3D Reconstruction.

IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2653-2667. doi: 10.1109/TPAMI.2018.2841403. Epub 2018 May 29.

CapsField: Light Field-Based Face and Expression Recognition in the Wild Using Capsule Routing.

IEEE Trans Image Process. 2021;30:2627-2642. doi: 10.1109/TIP.2021.3054476. Epub 2021 Feb 5.

FASHE: A FrActal Based Strategy for Head Pose Estimation.

IEEE Trans Image Process. 2021;30:3192-3203. doi: 10.1109/TIP.2021.3059409. Epub 2021 Feb 25.

引用本文的文献

Enhancing 3D Face Recognition: Achieving Significant Gains via 2D-Aided Generative Augmentation.

Sensors (Basel). 2025 Aug 14;25(16):5049. doi: 10.3390/s25165049.

本文引用的文献

Face Alignment in Full Pose Range: A 3D Total Solution.

IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):78-92. doi: 10.1109/TPAMI.2017.2778152. Epub 2017 Nov 28.

Multi-PIE.

Proc Int Conf Autom Face Gesture Recognit. 2010 May 1;28(5):807-813. doi: 10.1016/j.imavis.2009.08.002.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多任务学习策略与伪标签：人脸识别、面部地标检测和头部姿势估计。

Multitask Learning Strategy with Pseudo-Labeling: Face Recognition, Facial Landmark Detection, and Head Pose Estimation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献