使用基于位置的增强现实技术量化停留时间：基于视觉Transformer的移动眼动追踪数据动态感兴趣区域分析

Quantifying Dwell Time With Location-based Augmented Reality: Dynamic AOI Analysis on Mobile Eye Tracking Data With Vision Transformer.

作者信息

Mercier Julien, Ertz Olivier, Bocher Erwan

机构信息

MEI, School of Engineering and Management Vaud, HES-SO, Switzerland.

Lab-STICC, UMR 6285, CNRS, Université Bretagne Sud, Vannes, France.

出版信息

J Eye Mov Res. 2024 Apr 29;17(3). doi: 10.16910/jemr.17.3.3. eCollection 2024.

DOI:10.16910/jemr.17.3.3

PMID:38863891

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11165940/

Abstract

Mobile eye tracking captures egocentric vision and is well-suited for naturalistic studies. However, its data is noisy, especially when acquired outdoor with multiple participants over several sessions. Area of interest analysis on moving targets is difficult because A) camera and objects move nonlinearly and may disappear/reappear from the scene; and B) off-the-shelf analysis tools are limited to linearly moving objects. As a result, researchers resort to time-consuming manual annotation, which limits the use of mobile eye tracking in naturalistic studies. We introduce a method based on a fine-tuned Vision Transformer (ViT) model for classifying frames with overlaying gaze markers. After fine-tuning a model on a manually labelled training set made of 1.98% (=7845 frames) of our entire data for three epochs, our model reached 99.34% accuracy as evaluated on hold-out data. We used the method to quantify participants' dwell time on a tablet during the outdoor user test of a mobile augmented reality application for biodiversity education. We discuss the benefits and limitations of our approach and its potential to be applied to other contexts.

摘要

移动眼动追踪能够捕捉以自我为中心的视觉，非常适合自然主义研究。然而，其数据存在噪声，尤其是在户外多个参与者在多个时间段采集数据时。对移动目标进行感兴趣区域分析很困难，原因如下：A）相机和物体非线性移动，可能从场景中消失/重新出现；B）现成的分析工具仅限于线性移动的物体。因此，研究人员只能采用耗时的手动标注，这限制了移动眼动追踪在自然主义研究中的应用。我们介绍一种基于微调视觉Transformer（ViT）模型的方法，用于对带有叠加注视标记的帧进行分类。在由我们全部数据的1.98%（即7845帧）组成的手动标注训练集上对模型进行三个轮次的微调后，我们的模型在留出数据上评估时达到了99.34%的准确率。我们使用该方法在一款用于生物多样性教育的移动增强现实应用的户外用户测试中量化参与者在平板电脑上的停留时间。我们讨论了我们方法的优点和局限性以及其应用于其他场景的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c32a/11165940/8f9505e20804/jemr-17-03-c-figure-01.jpg

相似文献

Quantifying Dwell Time With Location-based Augmented Reality: Dynamic AOI Analysis on Mobile Eye Tracking Data With Vision Transformer.

J Eye Mov Res. 2024 Apr 29;17(3). doi: 10.16910/jemr.17.3.3. eCollection 2024.

From lab-based studies to eye-tracking in virtual and real worlds: conceptual and methodological problems and solutions. Symposium 4 at the 20th European Conference on Eye Movement Research (ECEM) in Alicante, 20.8.2019.

J Eye Mov Res. 2019 Nov 25;12(7). doi: 10.16910/jemr.12.7.8.

Automatic Visual Attention Detection for Mobile Eye Tracking Using Pre-Trained Computer Vision Models and Human Gaze.

Sensors (Basel). 2021 Jun 16;21(12):4143. doi: 10.3390/s21124143.

A toolkit for wide-screen dynamic area of interest measurements using the Pupil Labs Core Eye Tracker.

Behav Res Methods. 2023 Oct;55(7):3820-3830. doi: 10.3758/s13428-022-01991-5. Epub 2022 Oct 17.

Automating Areas of Interest Analysis in Mobile Eye Tracking Experiments based on Machine Learning.

J Eye Mov Res. 2018 Dec 10;11(6). doi: 10.16910/jemr.11.6.6.

Eye-tracking glasses in face-to-face interactions: Manual versus automated assessment of areas-of-interest.

Behav Res Methods. 2021 Oct;53(5):2037-2048. doi: 10.3758/s13428-021-01544-2. Epub 2021 Mar 19.

ARETT: Augmented Reality Eye Tracking Toolkit for Head Mounted Displays.

Sensors (Basel). 2021 Mar 23;21(6):2234. doi: 10.3390/s21062234.

A naturalistic viewing paradigm using 360° panoramic video clips and real-time field-of-view changes with eye-gaze tracking.

Neuroimage. 2020 Aug 1;216:116617. doi: 10.1016/j.neuroimage.2020.116617. Epub 2020 Feb 10.

Eye tracking in surgical education: gaze-based dynamic area of interest can discriminate adverse events and expertise.

Surg Endosc. 2019 Jul;33(7):2249-2256. doi: 10.1007/s00464-018-6513-5. Epub 2018 Oct 19.

MoSART: Mobile Spatial Augmented Reality for 3D Interaction With Tangible Objects.

Front Robot AI. 2018 Aug 20;5:93. doi: 10.3389/frobt.2018.00093. eCollection 2018.

引用本文的文献

gazeMapper: A tool for automated world-based analysis of gaze data from one or multiple wearable eye trackers.

Behav Res Methods. 2025 Jun 3;57(7):188. doi: 10.3758/s13428-025-02704-4.

Understanding consumer perception and acceptance of AI art through eye tracking and Bidirectional Encoder Representations from Transformers-based sentiment analysis.

J Eye Mov Res. 2024 Dec 22;17(5). doi: 10.16910/jemr.17.5.3. eCollection 2024.

Towards Automatic Object Detection and Activity Recognition in Indoor Climbing.

Sensors (Basel). 2024 Oct 8;24(19):6479. doi: 10.3390/s24196479.

本文引用的文献

Shifting Perspectives: A proposed framework for analyzing head-mounted eye-tracking data with dynamic areas of interest and dynamic scenes.

Proc Hum Factors Ergon Soc Annu Meet. 2023 Sep;67(1):953-958. doi: 10.1177/21695067231192929. Epub 2023 Oct 25.

Faulty screen time measures hamper national policies: here is a way to address it.

Front Psychol. 2023 Jul 27;14:1243396. doi: 10.3389/fpsyg.2023.1243396. eCollection 2023.

Assessment of Changes in Child and Adolescent Screen Time During the COVID-19 Pandemic: A Systematic Review and Meta-analysis.

JAMA Pediatr. 2022 Dec 1;176(12):1188-1198. doi: 10.1001/jamapediatrics.2022.4116.

Beyond screen time: Using head-mounted eye tracking to study natural behavior.

Adv Child Dev Behav. 2022;62:61-91. doi: 10.1016/bs.acdb.2021.11.001. Epub 2022 Jan 20.

Eye-Tracking Feature Extraction for Biometric Machine Learning.

Front Neurorobot. 2022 Feb 1;15:796895. doi: 10.3389/fnbot.2021.796895. eCollection 2021.

Mobile Eye-Tracking Data Analysis Using Object Detection via YOLO v4.

Sensors (Basel). 2021 Nov 18;21(22):7668. doi: 10.3390/s21227668.

Automatic Visual Attention Detection for Mobile Eye Tracking Using Pre-Trained Computer Vision Models and Human Gaze.

Sensors (Basel). 2021 Jun 16;21(12):4143. doi: 10.3390/s21124143.

Introducing Point-of-Interest as an alternative to Area-of-Interest for fixation duration analysis.

PLoS One. 2021 May 10;16(5):e0250170. doi: 10.1371/journal.pone.0250170. eCollection 2021.

Automating Areas of Interest Analysis in Mobile Eye Tracking Experiments based on Machine Learning.

J Eye Mov Res. 2018 Dec 10;11(6). doi: 10.16910/jemr.11.6.6.

Best practices in eye tracking research.

Int J Psychophysiol. 2020 Sep;155:49-62. doi: 10.1016/j.ijpsycho.2020.05.010. Epub 2020 Jun 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用基于位置的增强现实技术量化停留时间：基于视觉Transformer的移动眼动追踪数据动态感兴趣区域分析

Quantifying Dwell Time With Location-based Augmented Reality: Dynamic AOI Analysis on Mobile Eye Tracking Data With Vision Transformer.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献