• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于低质量网络摄像头图像的特定人注视估计。

Person-Specific Gaze Estimation from Low-Quality Webcam Images.

机构信息

Department of Applied Informatics, Silesian University of Technology, 44-100 Gliwice, Poland.

Faculty of Computer and Information Science, University of Ljubljana, Večna Pot 113, SI-1000 Ljubljana, Slovenia.

出版信息

Sensors (Basel). 2023 Apr 20;23(8):4138. doi: 10.3390/s23084138.

DOI:10.3390/s23084138
PMID:37112478
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10147084/
Abstract

Gaze estimation is an established research problem in computer vision. It has various applications in real life, from human-computer interactions to health care and virtual reality, making it more viable for the research community. Due to the significant success of deep learning techniques in other computer vision tasks-for example, image classification, object detection, object segmentation, and object tracking-deep learning-based gaze estimation has also received more attention in recent years. This paper uses a convolutional neural network (CNN) for person-specific gaze estimation. The person-specific gaze estimation utilizes a single model trained for one individual user, contrary to the commonly-used generalized models trained on multiple people's data. We utilized only low-quality images directly collected from a standard desktop webcam, so our method can be applied to any computer system equipped with such a camera without additional hardware requirements. First, we used the web camera to collect a dataset of face and eye images. Then, we tested different combinations of CNN parameters, including the learning and dropout rates. Our findings show that building a person-specific eye-tracking model produces better results with a selection of good hyperparameters when compared to universal models that are trained on multiple users' data. In particular, we achieved the best results for the left eye with 38.20 MAE (Mean Absolute Error) in pixels, the right eye with 36.01 MAE, both eyes combined with 51.18 MAE, and the whole face with 30.09 MAE, which is equivalent to approximately 1.45 degrees for the left eye, 1.37 degrees for the right eye, 1.98 degrees for both eyes combined, and 1.14 degrees for full-face images.

摘要

注视估计是计算机视觉中的一个成熟研究问题。它在现实生活中有各种应用,从人机交互到医疗保健和虚拟现实,因此对于研究社区来说更具可行性。由于深度学习技术在其他计算机视觉任务中的巨大成功——例如,图像分类、目标检测、目标分割和目标跟踪——基于深度学习的注视估计近年来也受到了更多关注。本文使用卷积神经网络(CNN)进行特定于人的注视估计。特定于人的注视估计使用针对单个用户训练的单个模型,而不是针对多人数据训练的常用通用模型。我们仅使用直接从标准台式网络摄像头收集的低质量图像,因此我们的方法可以应用于任何配备此类摄像头的计算机系统,而无需额外的硬件要求。首先,我们使用网络摄像头收集了一组人脸和眼部图像数据集。然后,我们测试了 CNN 参数的不同组合,包括学习率和辍学率。我们的研究结果表明,与针对多个用户数据训练的通用模型相比,构建特定于人的眼动追踪模型并选择良好的超参数可以产生更好的结果。特别是,我们在左眼获得了 38.20 MAE(平均绝对误差)的最佳结果,在右眼获得了 36.01 MAE,双眼组合获得了 51.18 MAE,整个面部获得了 30.09 MAE,这相当于左眼约为 1.45 度,右眼为 1.37 度,双眼组合为 1.98 度,全脸图像为 1.14 度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e7/10147084/559ffd527e96/sensors-23-04138-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e7/10147084/c73d8567a7ad/sensors-23-04138-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e7/10147084/34d7fc39dfc5/sensors-23-04138-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e7/10147084/559ffd527e96/sensors-23-04138-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e7/10147084/c73d8567a7ad/sensors-23-04138-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e7/10147084/34d7fc39dfc5/sensors-23-04138-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e7/10147084/559ffd527e96/sensors-23-04138-g004.jpg

相似文献

1
Person-Specific Gaze Estimation from Low-Quality Webcam Images.基于低质量网络摄像头图像的特定人注视估计。
Sensors (Basel). 2023 Apr 20;23(8):4138. doi: 10.3390/s23084138.
2
An integrated neural network model for eye-tracking during human-computer interaction.一种用于人机交互过程中眼动追踪的集成神经网络模型。
Math Biosci Eng. 2023 Jun 21;20(8):13974-13988. doi: 10.3934/mbe.2023622.
3
Webcam-based gaze estimation for computer screen interaction.基于网络摄像头的用于计算机屏幕交互的注视估计
Front Robot AI. 2024 Apr 2;11:1369566. doi: 10.3389/frobt.2024.1369566. eCollection 2024.
4
LiteGaze: Neural architecture search for efficient gaze estimation.LiteGaze:用于高效注视估计的神经结构搜索。
PLoS One. 2023 May 1;18(5):e0284814. doi: 10.1371/journal.pone.0284814. eCollection 2023.
5
Hybrid Eye-Tracking on a Smartphone with CNN Feature Extraction and an Infrared 3D Model.智能手机上的混合眼动追踪:基于 CNN 特征提取和红外 3D 模型。
Sensors (Basel). 2020 Jan 19;20(2):543. doi: 10.3390/s20020543.
6
When I Look into Your Eyes: A Survey on Computer Vision Contributions for Human Gaze Estimation and Tracking.当我凝视你的双眼:计算机视觉在人类视线估计和追踪中的应用综述。
Sensors (Basel). 2020 Jul 3;20(13):3739. doi: 10.3390/s20133739.
7
Multiview Multitask Gaze Estimation With Deep Convolutional Neural Networks.基于深度卷积神经网络的多视图多任务注视估计。
IEEE Trans Neural Netw Learn Syst. 2019 Oct;30(10):3010-3023. doi: 10.1109/TNNLS.2018.2865525. Epub 2018 Sep 3.
8
Model-Based 3D Gaze Estimation Using a TOF Camera.基于模型的使用飞行时间相机的3D注视估计
Sensors (Basel). 2024 Feb 6;24(4):1070. doi: 10.3390/s24041070.
9
Improved Feature-Based Gaze Estimation Using Self-Attention Module and Synthetic Eye Images.基于自注意力模块和合成眼图像的改进特征注视估计。
Sensors (Basel). 2022 May 26;22(11):4026. doi: 10.3390/s22114026.
10
Convolutional Neural Network-Based Technique for Gaze Estimation on Mobile Devices.基于卷积神经网络的移动设备注视估计技术
Front Artif Intell. 2022 Jan 26;4:796825. doi: 10.3389/frai.2021.796825. eCollection 2021.

引用本文的文献

1
Vision toolkit part 1. Neurophysiological foundations and experimental paradigms in eye-tracking research: a review.视觉工具包第1部分。眼动追踪研究中的神经生理学基础与实验范式:综述。
Front Physiol. 2025 Jun 19;16:1571534. doi: 10.3389/fphys.2025.1571534. eCollection 2025.

本文引用的文献

1
MPIIGaze: Real-World Dataset and Deep Appearance-Based Gaze Estimation.马克斯·普朗克智能系统研究所注视数据集:真实世界数据集与基于深度外观的注视估计
IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):162-175. doi: 10.1109/TPAMI.2017.2778103. Epub 2017 Nov 28.
2
Eye Movements During Everyday Behavior Predict Personality Traits.日常行为中的眼动可预测人格特质。
Front Hum Neurosci. 2018 Apr 13;12:105. doi: 10.3389/fnhum.2018.00105. eCollection 2018.