Suppr超能文献

结合头部姿势和眼睛位置信息进行注视估计。

Combining head pose and eye location information for gaze estimation.

机构信息

Intelligent Systems Lab, Amsterdam, University of Amsterdam, 1098 XH Amsterdam, The Netherlands.

出版信息

IEEE Trans Image Process. 2012 Feb;21(2):802-15. doi: 10.1109/TIP.2011.2162740. Epub 2011 Jul 22.

Abstract

Head pose and eye location for gaze estimation have been separately studied in numerous works in the literature. Previous research shows that satisfactory accuracy in head pose and eye location estimation can be achieved in constrained settings. However, in the presence of nonfrontal faces, eye locators are not adequate to accurately locate the center of the eyes. On the other hand, head pose estimation techniques are able to deal with these conditions; hence, they may be suited to enhance the accuracy of eye localization. Therefore, in this paper, a hybrid scheme is proposed to combine head pose and eye location information to obtain enhanced gaze estimation. To this end, the transformation matrix obtained from the head pose is used to normalize the eye regions, and in turn, the transformation matrix generated by the found eye location is used to correct the pose estimation procedure. The scheme is designed to enhance the accuracy of eye location estimations, particularly in low-resolution videos, to extend the operative range of the eye locators, and to improve the accuracy of the head pose tracker. These enhanced estimations are then combined to obtain a novel visual gaze estimation system, which uses both eye location and head information to refine the gaze estimates. From the experimental results, it can be derived that the proposed unified scheme improves the accuracy of eye estimations by 16% to 23%. Furthermore, it considerably extends its operating range by more than 15° by overcoming the problems introduced by extreme head poses. Moreover, the accuracy of the head pose tracker is improved by 12% to 24%. Finally, the experimentation on the proposed combined gaze estimation system shows that it is accurate (with a mean error between 2° and 5°) and that it can be used in cases where classic approaches would fail without imposing restraints on the position of the head.

摘要

头部姿势和眼睛位置在文献中有大量的研究工作。先前的研究表明,在受限环境下,头部姿势和眼睛位置的估计可以达到令人满意的精度。然而,在面对非正面人脸时,眼睛定位器无法准确地定位眼睛的中心。另一方面,头部姿势估计技术能够处理这些情况;因此,它们可能适合于提高眼睛定位的准确性。因此,在本文中,提出了一种混合方案,将头部姿势和眼睛位置信息结合起来,以获得增强的注视估计。为此,使用从头部姿势获得的变换矩阵来归一化眼睛区域,并且反过来,使用找到的眼睛位置生成的变换矩阵来校正姿势估计过程。该方案旨在提高眼睛位置估计的准确性,特别是在低分辨率视频中,扩展眼睛定位器的操作范围,并提高头部姿势跟踪器的准确性。然后,将这些增强的估计值结合起来,获得一种新的视觉注视估计系统,该系统使用眼睛位置和头部信息来改进注视估计值。从实验结果可以得出,所提出的统一方案将眼睛估计的精度提高了 16%到 23%。此外,通过克服由极端头部姿势引入的问题,它将操作范围扩展了 15°以上。此外,头部姿势跟踪器的准确性提高了 12%到 24%。最后,对所提出的组合注视估计系统的实验表明,它是准确的(平均误差在 2°到 5°之间),并且可以在经典方法由于没有对头部位置施加限制而失败的情况下使用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验