基于 RGB-IR 相机的 3D 注视估计。

3D Gaze Estimation Using RGB-IR Cameras.

机构信息

The Department of Information Systems, University of Haifa, Mount Carmel, Haifa 3498838, Israel.

出版信息

Sensors (Basel). 2022 Dec 29;23(1):381. doi: 10.3390/s23010381.

DOI:10.3390/s23010381

PMID:36616978

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9823916/

Abstract

In this paper, we present a framework for 3D gaze estimation intended to identify the user's focus of attention in a corneal imaging system. The framework uses a headset that consists of three cameras, a scene camera and two eye cameras: an IR camera and an RGB camera. The IR camera is used to continuously and reliably track the pupil and the RGB camera is used to acquire corneal images of the same eye. Deep learning algorithms are trained to detect the pupil in IR and RGB images and to compute a per user 3D model of the eye in real time. Once the 3D model is built, the 3D gaze direction is computed starting from the eyeball center and passing through the pupil center to the outside world. This model can also be used to transform the pupil position detected in the IR image into its corresponding position in the RGB image and to detect the gaze direction in the corneal image. This technique circumvents the problem of pupil detection in RGB images, which is especially difficult and unreliable when the scene is reflected in the corneal images. In our approach, the auto-calibration process is transparent and unobtrusive. Users do not have to be instructed to look at specific objects to calibrate the eye tracker. They need only to act and gaze normally. The framework was evaluated in a user study in realistic settings and the results are promising. It achieved a very low 3D gaze error (2.12°) and very high accuracy in acquiring corneal images (intersection over union-IoU = 0.71). The framework may be used in a variety of real-world mobile scenarios (indoors, indoors near windows and outdoors) with high accuracy.

摘要

在本文中，我们提出了一种用于 3D 注视估计的框架，旨在识别角膜成像系统中用户的关注焦点。该框架使用一个由三个摄像头组成的头戴式设备，包括一个场景摄像头和两个眼摄像头：一个红外摄像头和一个 RGB 摄像头。红外摄像头用于连续、可靠地跟踪瞳孔，而 RGB 摄像头用于获取同一眼睛的角膜图像。深度学习算法经过训练可用于检测红外和 RGB 图像中的瞳孔，并实时计算每个用户的眼部 3D 模型。一旦建立了 3D 模型，就可以从眼球中心开始，通过瞳孔中心到外部世界计算 3D 注视方向。该模型还可以用于将在红外图像中检测到的瞳孔位置转换为其在 RGB 图像中的对应位置，并检测角膜图像中的注视方向。该技术解决了在 RGB 图像中检测瞳孔的问题，当场景反射在角膜图像中时，该问题尤其困难且不可靠。在我们的方法中，自动校准过程是透明且不引人注目的。用户无需被指示看向特定对象来校准眼动追踪器。他们只需正常行动和注视即可。该框架在现实设置中的用户研究中进行了评估，结果很有前景。它实现了非常低的 3D 注视误差（2.12°）和非常高的角膜图像采集精度（交并比-IoU = 0.71）。该框架可用于各种现实世界的移动场景（室内、室内靠近窗户和室外），具有很高的精度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ece/9823916/c0e0c665af3b/sensors-23-00381-g001.jpg

相似文献

3D Gaze Estimation Using RGB-IR Cameras.基于 RGB-IR 相机的 3D 注视估计。

Sensors (Basel). 2022 Dec 29;23(1):381. doi: 10.3390/s23010381.

Calibration-Free Mobile Eye-Tracking Using Corneal Imaging.基于角膜成像的无标定移动眼动追踪。

Sensors (Basel). 2024 Feb 15;24(4):1237. doi: 10.3390/s24041237.

A free geometry model-independent neural eye-gaze tracking system.一个自由几何模型独立的神经眼动追踪系统。

J Neuroeng Rehabil. 2012 Nov 16;9:82. doi: 10.1186/1743-0003-9-82.

Head-free, remote eye-gaze detection system based on pupil-corneal reflection method with easy calibration using two stereo-calibrated video cameras.基于瞳孔角膜反射法的免头戴、远程眼动追踪系统，使用两个经过立体标定的摄像机，可轻松进行标定。

IEEE Trans Biomed Eng. 2013 Oct;60(10):2952-60. doi: 10.1109/TBME.2013.2266478. Epub 2013 Jun 6.

Estimation of Gaze Detection Accuracy Using the Calibration Information-Based Fuzzy System.基于校准信息的模糊系统对注视检测准确率的估计

Sensors (Basel). 2016 Jan 5;16(1):60. doi: 10.3390/s16010060.

High-Accuracy 3D Gaze Estimation with Efficient Recalibration for Head-Mounted Gaze Tracking Systems.高效重标定的高精度 3D 注视估计用于头戴式注视跟踪系统。

Sensors (Basel). 2022 Jun 8;22(12):4357. doi: 10.3390/s22124357.

General theory of remote gaze estimation using the pupil center and corneal reflections.利用瞳孔中心和角膜反射进行远程注视估计的一般理论。

IEEE Trans Biomed Eng. 2006 Jun;53(6):1124-33. doi: 10.1109/TBME.2005.863952.

Noise estimation for head-mounted 3D binocular eye tracking using Pupil Core eye-tracking goggles.使用瞳孔核心眼动追踪护目镜对头戴式 3D 双目眼动追踪进行噪声估计。

Behav Res Methods. 2024 Jan;56(1):53-79. doi: 10.3758/s13428-023-02150-0. Epub 2023 Jun 27.

Long-Range Gaze Tracking System for Large Movements.用于大动作的远距离注视跟踪系统

IEEE Trans Biomed Eng. 2013 Dec;60(12):3432-40. doi: 10.1109/TBME.2013.2266413. Epub 2013 Jun 6.

A real-time gaze position estimation method based on a 3-D eye model.一种基于三维眼睛模型的实时注视位置估计方法。

IEEE Trans Syst Man Cybern B Cybern. 2007 Feb;37(1):199-212. doi: 10.1109/tsmcb.2006.883426.

引用本文的文献

Dual Focus-3D: A Hybrid Deep Learning Approach for Robust 3D Gaze Estimation.双焦点3D：一种用于稳健3D注视估计的混合深度学习方法。

Sensors (Basel). 2025 Jun 30;25(13):4086. doi: 10.3390/s25134086.

Calibration-Free Mobile Eye-Tracking Using Corneal Imaging.基于角膜成像的无标定移动眼动追踪。

Sensors (Basel). 2024 Feb 15;24(4):1237. doi: 10.3390/s24041237.

Eye-Gaze Controlled Wheelchair Based on Deep Learning.基于深度学习的眼控轮椅。

Sensors (Basel). 2023 Jul 7;23(13):6239. doi: 10.3390/s23136239.

Gaze Estimation Based on Convolutional Structure and Sliding Window-Based Attention Mechanism.基于卷积结构和基于滑动窗口的注意力机制的注视估计

Sensors (Basel). 2023 Jul 7;23(13):6226. doi: 10.3390/s23136226.

Computer Vision in Human Analysis: From Face and Body to Clothes.计算机视觉在人体分析中的应用：从人脸和人体到衣物。

Sensors (Basel). 2023 Jun 6;23(12):5378. doi: 10.3390/s23125378.

本文引用的文献

DeepVOG: Open-source pupil segmentation and gaze estimation in neuroscience using deep learning.DeepVOG：利用深度学习在神经科学中进行开源瞳孔分割和注视估计。

J Neurosci Methods. 2019 Aug 1;324:108307. doi: 10.1016/j.jneumeth.2019.05.016. Epub 2019 Jun 6.

MPIIGaze: Real-World Dataset and Deep Appearance-Based Gaze Estimation.马克斯·普朗克智能系统研究所注视数据集：真实世界数据集与基于深度外观的注视估计

IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):162-175. doi: 10.1109/TPAMI.2017.2778103. Epub 2017 Nov 28.

Deep Learning-Based Gaze Detection System for Automobile Drivers Using a NIR Camera Sensor.基于深度学习的汽车驾驶员凝视检测系统，使用近红外相机传感器。

Sensors (Basel). 2018 Feb 3;18(2):456. doi: 10.3390/s18020456.

Application of eye tracking in medicine: A survey, research issues and challenges.眼动追踪在医学中的应用：综述、研究问题与挑战。

Comput Med Imaging Graph. 2018 Apr;65:176-190. doi: 10.1016/j.compmedimag.2017.04.006. Epub 2017 May 30.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN：基于区域建议网络的实时目标检测。

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

Centration axis in refractive surgery.屈光手术中的中心轴

Eye Vis (Lond). 2015 Feb 24;2:4. doi: 10.1186/s40662-015-0014-6. eCollection 2015.

Application of eye-tracking in the testing of drivers: A review of research.眼动追踪技术在驾驶员测试中的应用：研究综述

Int J Occup Med Environ Health. 2015;28(6):941-54. doi: 10.13075/ijomeh.1896.00317.

SET: a pupil detection method using sinusoidal approximation.SET：一种使用正弦近似的瞳孔检测方法。

Front Neuroeng. 2015 Apr 9;8:4. doi: 10.3389/fneng.2015.00004. eCollection 2015.

Variations in eyeball diameters of the healthy adults.健康成年人眼球直径的差异。

J Ophthalmol. 2014;2014:503645. doi: 10.1155/2014/503645. Epub 2014 Nov 5.

Gaze and eye-tracking solutions for psychological research.用于心理学研究的注视和眼动追踪解决方案。

Cogn Process. 2012 Aug;13 Suppl 1:S261-5. doi: 10.1007/s10339-012-0499-z.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于 RGB-IR 相机的 3D 注视估计。

3D Gaze Estimation Using RGB-IR Cameras.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献