事件流中的人脸（FES）：用于事件相机的带注释人脸数据集

Faces in Event Streams (FES): An Annotated Face Dataset for Event Cameras.

作者信息

Bissarinova Ulzhan, Rakhimzhanova Tomiris, Kenzhebalin Daulet, Varol Huseyin Atakan

机构信息

Institute of Smart Systems and Artificial Intelligence, Nazarbayev University, Astana 010000, Kazakhstan.

出版信息

Sensors (Basel). 2024 Feb 22;24(5):1409. doi: 10.3390/s24051409.

DOI:10.3390/s24051409

PMID:38474947

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10935361/

Abstract

The use of event-based cameras in computer vision is a growing research direction. However, despite the existing research on face detection using the event camera, a substantial gap persists in the availability of a large dataset featuring annotations for faces and facial landmarks on event streams, thus hampering the development of applications in this direction. In this work, we address this issue by publishing the first large and varied dataset (Faces in Event Streams) with a duration of 689 min for face and facial landmark detection in direct event-based camera outputs. In addition, this article presents 12 models trained on our dataset to predict bounding box and facial landmark coordinates with an mAP score of more than 90%. We also performed a demonstration of real-time detection with an event-based camera using our models.

摘要

基于事件的相机在计算机视觉中的应用是一个不断发展的研究方向。然而，尽管已有关于使用事件相机进行面部检测的研究，但在事件流上具有面部和面部地标注释的大型数据集的可用性方面仍存在很大差距，从而阻碍了该方向应用的发展。在这项工作中，我们通过发布第一个大型多样的数据集（事件流中的面部）来解决这个问题，该数据集时长689分钟，用于直接基于事件的相机输出中的面部和面部地标检测。此外，本文展示了在我们的数据集上训练的12个模型，这些模型用于预测边界框和面部地标坐标，平均精度均值（mAP）得分超过90%。我们还使用我们的模型进行了基于事件相机的实时检测演示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ea7/10935361/52a4017d8b72/sensors-24-01409-g001.jpg

相似文献

Faces in Event Streams (FES): An Annotated Face Dataset for Event Cameras.

Sensors (Basel). 2024 Feb 22;24(5):1409. doi: 10.3390/s24051409.

Development of Real-Time Landmark-Based Emotion Recognition CNN for Masked Faces.

Sensors (Basel). 2022 Nov 11;22(22):8704. doi: 10.3390/s22228704.

Three-D Wide Faces (3DWF): Facial Landmark Detection and 3D Reconstruction over a New RGB⁻D Multi-Camera Dataset.

Sensors (Basel). 2019 Mar 4;19(5):1103. doi: 10.3390/s19051103.

CIFAR10-DVS: An Event-Stream Dataset for Object Classification.

Front Neurosci. 2017 May 30;11:309. doi: 10.3389/fnins.2017.00309. eCollection 2017.

Real-time face & eye tracking and blink detection using event cameras.

Neural Netw. 2021 Sep;141:87-97. doi: 10.1016/j.neunet.2021.03.019. Epub 2021 Mar 27.

Backlight and dim space object detection based on a novel event camera.

PeerJ Comput Sci. 2024 Jul 12;10:e2192. doi: 10.7717/peerj-cs.2192. eCollection 2024.

Event-Based Robotic Grasping Detection With Neuromorphic Vision Sensor and Event-Grasping Dataset.

Front Neurorobot. 2020 Oct 8;14:51. doi: 10.3389/fnbot.2020.00051. eCollection 2020.

Face Pose Alignment with Event Cameras.

Sensors (Basel). 2020 Dec 10;20(24):7079. doi: 10.3390/s20247079.

Joint Multi-view Face Alignment in the Wild.

IEEE Trans Image Process. 2019 Feb 13. doi: 10.1109/TIP.2019.2899267.

Visual and Thermal Image Processing for Facial Specific Landmark Detection to Infer Emotions in a Child-Robot Interaction.

Sensors (Basel). 2019 Jun 26;19(13):2844. doi: 10.3390/s19132844.

本文引用的文献

Isolated single sound lip-reading using a frame-based camera and event-based camera.

Front Artif Intell. 2023 Jan 11;5:1070964. doi: 10.3389/frai.2022.1070964. eCollection 2022.

Facial imaging to screen for fetal alcohol spectrum disorder: A scoping review.

Alcohol Clin Exp Res. 2022 Jul;46(7):1166-1180. doi: 10.1111/acer.14875. Epub 2022 Jun 6.

Multispectral Face Recognition Using Transfer Learning with Adaptation of Domain Specific Units.

Sensors (Basel). 2021 Jul 1;21(13):4520. doi: 10.3390/s21134520.

Real-time face & eye tracking and blink detection using event cameras.

Neural Netw. 2021 Sep;141:87-97. doi: 10.1016/j.neunet.2021.03.019. Epub 2021 Mar 27.

Face Pose Alignment with Event Cameras.

Sensors (Basel). 2020 Dec 10;20(24):7079. doi: 10.3390/s20247079.

Event-Based Face Detection and Tracking Using the Dynamics of Eye Blinks.

Front Neurosci. 2020 Jul 27;14:587. doi: 10.3389/fnins.2020.00587. eCollection 2020.

Event-Based Vision: A Survey.

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):154-180. doi: 10.1109/TPAMI.2020.3008413. Epub 2021 Dec 7.

High Speed and High Dynamic Range Video with an Event Camera.

IEEE Trans Pattern Anal Mach Intell. 2021 Jun;43(6):1964-1980. doi: 10.1109/TPAMI.2019.2963386. Epub 2021 May 11.

What Is a Face? Critical Features for Face Detection.

Perception. 2019 May;48(5):437-446. doi: 10.1177/0301006619838734. Epub 2019 Apr 2.

The fusiform face area: a module in human extrastriate cortex specialized for face perception.

J Neurosci. 1997 Jun 1;17(11):4302-11. doi: 10.1523/JNEUROSCI.17-11-04302.1997.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

事件流中的人脸（FES）：用于事件相机的带注释人脸数据集

Faces in Event Streams (FES): An Annotated Face Dataset for Event Cameras.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献