I-MPN：用于移动眼动追踪数据高效人工参与标注的归纳消息传递网络。

I-MPN: inductive message passing network for efficient human-in-the-loop annotation of mobile eye tracking data.

作者信息

Le Hoang H, Nguyen Duy M H, Bhatti Omair Shahzad, Kopácsi László, Ngo Thinh P, Nguyen Binh T, Barz Michael, Sonntag Daniel

机构信息

Interactive Machine Learning Department, German Research Center for Artificial Intelligence (DFKI), 66123, Saarbrücken, Germany.

Mathematics and Computer Science Department, University of Science, VNU-HCM, Ho Chi Minh City, Vietnam.

出版信息

Sci Rep. 2025 Apr 23;15(1):14192. doi: 10.1038/s41598-025-94593-y.

DOI:10.1038/s41598-025-94593-y

PMID:40268979

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12019404/

Abstract

Comprehending how humans process visual information in dynamic settings is crucial for psychology and designing user-centered interactions. While mobile eye-tracking systems combining egocentric video and gaze signals can offer valuable insights, manual analysis of these recordings is time-intensive. In this work, we present a novel human-centered learning algorithm designed for automated object recognition within mobile eye-tracking settings. Our approach seamlessly integrates an object detector with a spatial relation-aware inductive message-passing network (I-MPN), harnessing node profile information and capturing object correlations. Such mechanisms enable us to learn embedding functions capable of generalizing to new object angle views, facilitating rapid adaptation and efficient reasoning in dynamic contexts as users navigate their environment. Through experiments conducted on three distinct video sequences, our interactive-based method showcases significant performance improvements over fixed training/testing algorithms, even when trained on considerably smaller annotated samples collected through user feedback. Furthermore, we demonstrate exceptional efficiency in data annotation processes and surpass prior interactive methods that use complete object detectors, combine detectors with convolutional networks, or employ interactive video segmentation.

摘要

理解人类在动态环境中如何处理视觉信息对于心理学和设计以用户为中心的交互至关重要。虽然结合自我中心视频和注视信号的移动眼动追踪系统可以提供有价值的见解，但对这些记录进行人工分析非常耗时。在这项工作中，我们提出了一种新颖的以人类为中心的学习算法，设计用于在移动眼动追踪设置中进行自动目标识别。我们的方法将目标检测器与空间关系感知归纳消息传递网络（I-MPN）无缝集成，利用节点轮廓信息并捕捉目标相关性。这种机制使我们能够学习能够推广到新目标角度视图的嵌入函数，便于在用户在其环境中导航时在动态环境中快速适应和高效推理。通过在三个不同视频序列上进行的实验，我们基于交互的方法展示了相对于固定训练/测试算法的显著性能提升，即使在通过用户反馈收集的明显更小的带注释样本上进行训练时也是如此。此外，我们在数据注释过程中展示了卓越的效率，并超越了使用完整目标检测器、将检测器与卷积网络相结合或采用交互式视频分割的先前交互式方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1db/12019404/d9cda894205e/41598_2025_94593_Fig1_HTML.jpg

相似文献

I-MPN: inductive message passing network for efficient human-in-the-loop annotation of mobile eye tracking data.I-MPN：用于移动眼动追踪数据高效人工参与标注的归纳消息传递网络。

Sci Rep. 2025 Apr 23;15(1):14192. doi: 10.1038/s41598-025-94593-y.

Visual Analytics for Mobile Eye Tracking.移动眼动追踪的可视化分析。

IEEE Trans Vis Comput Graph. 2017 Jan;23(1):301-310. doi: 10.1109/TVCG.2016.2598695.

MYFix: Automated Fixation Annotation of Eye-Tracking Videos.MYFix：眼动追踪视频的自动固定注释。

Sensors (Basel). 2024 Apr 23;24(9):2666. doi: 10.3390/s24092666.

Automatic Visual Attention Detection for Mobile Eye Tracking Using Pre-Trained Computer Vision Models and Human Gaze.基于预训练计算机视觉模型和人眼注视的移动眼动追踪自动视觉注意力检测。

Sensors (Basel). 2021 Jun 16;21(12):4143. doi: 10.3390/s21124143.

Mobile Eye-Tracking Data Analysis Using Object Detection via YOLO v4.基于 YOLO v4 的目标检测的移动眼动追踪数据分析。

Sensors (Basel). 2021 Nov 18;21(22):7668. doi: 10.3390/s21227668.

Deep-SAGA: a deep-learning-based system for automatic gaze annotation from eye-tracking data.深度 SAGA：一种基于深度学习的眼动追踪数据自动注视点标注系统。

Behav Res Methods. 2023 Apr;55(3):1372-1391. doi: 10.3758/s13428-022-01833-4. Epub 2022 Jun 1.

Catching up with iCatcher: Comparing analyses of infant eye tracking based on trained human coders and iCatcher+ automated gaze coding software.跟上iCatcher的步伐：比较基于训练有素的人工编码员和iCatcher+自动注视编码软件的婴儿眼动追踪分析

Behav Res Methods. 2025 Apr 28;57(6):158. doi: 10.3758/s13428-025-02683-6.

Eye-tracking glasses in face-to-face interactions: Manual versus automated assessment of areas-of-interest.面对面互动中的眼动追踪眼镜：手动与自动评估兴趣区域。

Behav Res Methods. 2021 Oct;53(5):2037-2048. doi: 10.3758/s13428-021-01544-2. Epub 2021 Mar 19.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像（MRI）中进行脑肿瘤分割与检测

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Integrating Eye Tracking With Grouped Fusion Networks for Semantic Segmentation on Mammogram Images.将眼动追踪与分组融合网络集成用于乳腺X光图像的语义分割

IEEE Trans Med Imaging. 2025 Feb;44(2):868-879. doi: 10.1109/TMI.2024.3468404. Epub 2025 Feb 4.

引用本文的文献

eyeNotate: Interactive Annotation of Mobile Eye Tracking Data Based on Few-Shot Image Classification.EyeNotate：基于少样本图像分类的移动眼动追踪数据交互式标注

J Eye Mov Res. 2025 Jul 7;18(4):27. doi: 10.3390/jemr18040027. eCollection 2025 Aug.

本文引用的文献

Learning under label noise through few-shot human-in-the-loop refinement.通过少样本人工参与优化在标签噪声下进行学习。

Sci Rep. 2025 Feb 4;15(1):4276. doi: 10.1038/s41598-025-87046-z.

A Survey on Deep Learning Technique for Video Segmentation.深度学习技术在视频分割中的应用研究综述

IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7099-7122. doi: 10.1109/TPAMI.2022.3225573. Epub 2023 May 5.

Deep-SAGA: a deep-learning-based system for automatic gaze annotation from eye-tracking data.深度 SAGA：一种基于深度学习的眼动追踪数据自动注视点标注系统。

Behav Res Methods. 2023 Apr;55(3):1372-1391. doi: 10.3758/s13428-022-01833-4. Epub 2022 Jun 1.

EHTask: Recognizing User Tasks From Eye and Head Movements in Immersive Virtual Reality.EHTask：从沉浸式虚拟现实中的眼睛和头部运动识别用户任务。

IEEE Trans Vis Comput Graph. 2023 Apr;29(4):1992-2004. doi: 10.1109/TVCG.2021.3138902. Epub 2023 Feb 28.

Mobile Eye-Tracking Data Analysis Using Object Detection via YOLO v4.基于 YOLO v4 的目标检测的移动眼动追踪数据分析。

Sensors (Basel). 2021 Nov 18;21(22):7668. doi: 10.3390/s21227668.

Sensors (Basel). 2021 Jun 16;21(12):4143. doi: 10.3390/s21124143.

A Survey of Convolutional Neural Networks: Analysis, Applications, and Prospects.卷积神经网络综述：分析、应用与展望

IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):6999-7019. doi: 10.1109/TNNLS.2021.3084827. Epub 2022 Nov 30.

Eye tracking in Educational Science: Theoretical frameworks and research agendas.教育科学中的眼动追踪：理论框架与研究议程。

J Eye Mov Res. 2017 Feb 4;10(1). doi: 10.16910/jemr.10.1.3.

On Inductive-Transductive Learning With Graph Neural Networks.基于图神经网络的归纳-演绎学习。

IEEE Trans Pattern Anal Mach Intell. 2022 Feb;44(2):758-769. doi: 10.1109/TPAMI.2021.3054304. Epub 2022 Jan 7.

MPIIGaze: Real-World Dataset and Deep Appearance-Based Gaze Estimation.马克斯·普朗克智能系统研究所注视数据集：真实世界数据集与基于深度外观的注视估计

IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):162-175. doi: 10.1109/TPAMI.2017.2778103. Epub 2017 Nov 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

I-MPN：用于移动眼动追踪数据高效人工参与标注的归纳消息传递网络。

I-MPN: inductive message passing network for efficient human-in-the-loop annotation of mobile eye tracking data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献