• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种针对大旋转头部姿态的新型眼部中心定位方法。

A Novel Eye Center Localization Method for Head Poses With Large Rotations.

作者信息

Hsu Wei-Yen, Chung Chi-Jui

出版信息

IEEE Trans Image Process. 2021;30:1369-1381. doi: 10.1109/TIP.2020.3044209. Epub 2020 Dec 23.

DOI:10.1109/TIP.2020.3044209
PMID:33332268
Abstract

Eye localization is undoubtedly crucial to acquiring large amounts of information. It not only helps people improve their understanding of others but is also a technology that enables machines to better understand humans. Although studies have reported satisfactory accuracy for frontal faces or head poses at limited angles, large head rotations generate numerous defects (e.g., disappearance of the eye), and existing methods are not effective enough to accurately localize eye centers. Therefore, this study makes three contributions to address these limitations. First, we propose a novel complete representation (CR) pipeline that can flexibly learn and generate two complete representations, namely the CR-center and CR-region, of the same identity. We also propose two novel eye center localization methods. This first method employs geometric transformation to estimate the rotational difference between two faces and an unknown-localization strategy for accurate transformation of the CR-center. The second method is based on image translation learning and uses the CR-region to train the generative adversarial network, which can then accurately generate and localize eye centers. Five image databases are employed to verify the proposed methods, and tests reveal that compared with existing methods, the proposed method can more accurately and robustly localize eye centers in challenging images, such as those showing considerable head rotation (both yaw rotation of -67.5° to +67.5° and roll rotation of +120° to -120°), complete occlusion of both eyes, poor illumination in addition to head rotation, head pose changes in the dark, and various gaze interaction.

摘要

眼睛定位对于获取大量信息无疑至关重要。它不仅有助于人们增进对他人的理解,也是一项能让机器更好地理解人类的技术。尽管已有研究报告称在有限角度下正面人脸或头部姿势的准确率令人满意,但大幅度的头部旋转会产生许多缺陷(例如眼睛消失),并且现有方法在准确确定眼睛中心位置方面效果不够理想。因此,本研究为解决这些局限性做出了三点贡献。首先,我们提出了一种新颖的完整表示(CR)管道,它可以灵活地学习并生成同一身份的两种完整表示,即CR中心和CR区域。我们还提出了两种新颖的眼睛中心定位方法。第一种方法采用几何变换来估计两张脸之间的旋转差异,并采用一种未知定位策略来精确转换CR中心。第二种方法基于图像平移学习,并使用CR区域来训练生成对抗网络,该网络随后可以准确地生成并定位眼睛中心。我们使用了五个图像数据库来验证所提出的方法,测试结果表明,与现有方法相比,所提出的方法能够在具有挑战性的图像中更准确、更稳健地定位眼睛中心,这些具有挑战性的图像包括那些头部有大幅旋转(偏航旋转范围为-67.5°至+67.5°,翻滚旋转范围为+120°至-120°)、双眼完全遮挡、除头部旋转外光照不佳、黑暗中头部姿势变化以及各种注视交互的图像。

相似文献

1
A Novel Eye Center Localization Method for Head Poses With Large Rotations.一种针对大旋转头部姿态的新型眼部中心定位方法。
IEEE Trans Image Process. 2021;30:1369-1381. doi: 10.1109/TIP.2020.3044209. Epub 2020 Dec 23.
2
Combining head pose and eye location information for gaze estimation.结合头部姿势和眼睛位置信息进行注视估计。
IEEE Trans Image Process. 2012 Feb;21(2):802-15. doi: 10.1109/TIP.2011.2162740. Epub 2011 Jul 22.
3
Combined influence of vergence and eye position on three-dimensional vestibulo-ocular reflex in the monkey.双眼会聚和眼位对猴子三维前庭眼反射的联合影响。
J Neurophysiol. 2002 Nov;88(5):2368-76. doi: 10.1152/jn.00796.2001.
4
Three-dimensional vector analysis of the human vestibuloocular reflex in response to high-acceleration head rotations. I. Responses in normal subjects.人体前庭眼反射对高加速度头部旋转反应的三维矢量分析。I. 正常受试者的反应。
J Neurophysiol. 1996 Dec;76(6):4009-20. doi: 10.1152/jn.1996.76.6.4009.
5
Three-dimensional vector analysis of the human vestibuloocular reflex in response to high-acceleration head rotations. II. responses in subjects with unilateral vestibular loss and selective semicircular canal occlusion.人类前庭眼反射对高加速度头部旋转反应的三维矢量分析。II. 单侧前庭丧失和选择性半规管阻塞受试者的反应。
J Neurophysiol. 1996 Dec;76(6):4021-30. doi: 10.1152/jn.1996.76.6.4021.
6
Three-dimensional organization of otolith-ocular reflexes in rhesus monkeys. I. Linear acceleration responses during off-vertical axis rotation.恒河猴耳石-眼反射的三维组织。I. 非垂直轴旋转期间的线性加速度反应。
J Neurophysiol. 1996 Jun;75(6):2405-24. doi: 10.1152/jn.1996.75.6.2405.
7
Noncommutative control in the rotational vestibuloocular reflex.旋转性前庭眼反射中的非交换控制
J Neurophysiol. 2008 Jan;99(1):96-111. doi: 10.1152/jn.00804.2007. Epub 2007 Nov 7.
8
Representation Learning by Rotating Your Faces.旋转人脸进行表示学习。
IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):3007-3021. doi: 10.1109/TPAMI.2018.2868350. Epub 2018 Sep 3.
9
Accurate eye center location through invariant isocentric patterns.通过不变等中心模式实现准确的眼睛中心定位。
IEEE Trans Pattern Anal Mach Intell. 2012 Sep;34(9):1785-98. doi: 10.1109/TPAMI.2011.251.
10
Behavior of the human translational vestibulo-ocular reflex during simultaneous head translation and rotation.人类平移性前庭眼反射在头部同时进行平移和旋转时的表现。
J Vestib Res. 2014;24(5-6):329-33. doi: 10.3233/VES-140522.

引用本文的文献

1
Bppv nystagmus signals diagnosis framework based on deep learning.基于深度学习的良性阵发性位置性眩晕眼震信号诊断框架
Phys Eng Sci Med. 2025 May 13. doi: 10.1007/s13246-025-01542-0.
2
An Effective Algorithm to Analyze the Optokinetic Nystagmus Waveforms from a Low-Cost Eye Tracker.一种用于分析低成本眼动仪记录的视动性眼震波形的有效算法。
Healthcare (Basel). 2022 Jul 10;10(7):1281. doi: 10.3390/healthcare10071281.
3
A Fast and Effective System for Detection of Neonatal Jaundice with a Dynamic Threshold White Balance Algorithm.一种基于动态阈值白平衡算法的快速高效新生儿黄疸检测系统。
Healthcare (Basel). 2021 Aug 16;9(8):1052. doi: 10.3390/healthcare9081052.