iCatcher+：对在实验室、实地和在线研究中收集的视频里婴幼儿注视行为进行稳健且自动化的标注

iCatcher+: Robust and Automated Annotation of Infants' and Young Children's Gaze Behavior From Videos Collected in Laboratory, Field, and Online Studies.

作者信息

Erel Yotam, Shannon Katherine Adams, Chu Junyi, Scott Kim, Struhl Melissa Kline, Cao Peng, Tan Xincheng, Hart Peter, Raz Gal, Piccolo Sabrina, Mei Catherine, Potter Christine, Jaffe-Dax Sagi, Lew-Williams Casey, Tenenbaum Joshua, Fairchild Katherine, Bermano Amit, Liu Shari

机构信息

The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv-Yafo, Israel.

Department of Psychology, Stanford University, Palo Alto, California.

出版信息

Adv Methods Pract Psychol Sci. 2023 Apr-Jun;6(2). doi: 10.1177/25152459221147250. Epub 2023 Apr 18.

DOI:10.1177/25152459221147250

PMID:37655047

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10471135/

Abstract

Technological advances in psychological research have enabled large-scale studies of human behavior and streamlined pipelines for automatic processing of data. However, studies of infants and children have not fully reaped these benefits because the behaviors of interest, such as gaze duration and direction, still have to be extracted from video through a laborious process of manual annotation, even when these data are collected online. Recent advances in computer vision raise the possibility of automated annotation of these video data. In this article, we built on a system for automatic gaze annotation in young children, iCatcher, by engineering improvements and then training and testing the system (referred to hereafter as iCatcher+) on three data sets with substantial video and participant variability (214 videos collected in U.S. lab and field sites, 143 videos collected in Senegal field sites, and 265 videos collected via webcams in homes; participant age range = 4 months-3.5 years). When trained on each of these data sets, iCatcher+ performed with near human-level accuracy on held-out videos on distinguishing "LEFT" versus "RIGHT" and "ON" versus "OFF" looking behavior across all data sets. This high performance was achieved at the level of individual frames, experimental trials, and study videos; held across participant demographics (e.g., age, race/ethnicity), participant behavior (e.g., movement, head position), and video characteristics (e.g., luminance); and generalized to a fourth, entirely held-out online data set. We close by discussing next steps required to fully automate the life cycle of online infant and child behavioral studies, representing a key step toward enabling robust and high-throughput developmental research.

摘要

心理学研究中的技术进步使得对人类行为的大规模研究成为可能，并简化了数据自动处理流程。然而，针对婴幼儿的研究尚未充分受益于这些成果，因为即便数据是在线收集的，诸如注视时长和方向等感兴趣的行为，仍需通过繁琐的人工注释过程从视频中提取。计算机视觉领域的最新进展为这些视频数据的自动注释带来了可能。在本文中，我们基于一个用于幼儿自动注视注释的系统iCatcher，通过技术改进构建了一个新系统，然后在三个具有大量视频和参与者差异的数据集上对该系统（以下简称iCatcher+）进行训练和测试（这三个数据集分别是：在美国实验室和实地收集的214个视频、在塞内加尔实地收集的143个视频以及通过家庭网络摄像头收集的265个视频；参与者年龄范围为4个月至3.5岁）。当在每个数据集上进行训练时，iCatcher+在区分所有数据集中“左”与“右”以及“注视”与“未注视”的注视行为的留一法视频测试中，表现出接近人类水平的准确率。这种高性能在单个帧、实验试验和研究视频层面均得以实现；在参与者人口统计学特征（如年龄、种族/民族）、参与者行为（如运动、头部位置）和视频特征（如亮度）方面均保持一致；并且能够推广到第四个完全独立的在线数据集。我们最后讨论了实现在线婴幼儿行为研究生命周期完全自动化所需的后续步骤，这是迈向开展稳健且高通量发展研究的关键一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e2/10471135/07f38aceae8b/nihms-1916587-f0001.jpg

相似文献

iCatcher+: Robust and Automated Annotation of Infants' and Young Children's Gaze Behavior From Videos Collected in Laboratory, Field, and Online Studies.

Adv Methods Pract Psychol Sci. 2023 Apr-Jun;6(2). doi: 10.1177/25152459221147250. Epub 2023 Apr 18.

iCatcher: A neural network approach for automated coding of young children's eye movements.

Infancy. 2022 Jul;27(4):765-779. doi: 10.1111/infa.12468. Epub 2022 Apr 13.

Exploration of factors affecting webcam-based automated gaze coding.

Behav Res Methods. 2024 Oct;56(7):7374-7390. doi: 10.3758/s13428-024-02424-1. Epub 2024 May 1.

Comparing Online Webcam- and Laboratory-Based Eye-Tracking for the Assessment of Infants' Audio-Visual Synchrony Perception.

Front Psychol. 2022 Jan 11;12:733933. doi: 10.3389/fpsyg.2021.733933. eCollection 2021.

The developmental origins of naïve psychology in infancy.

Adv Child Dev Behav. 2009;37:55-104. doi: 10.1016/s0065-2407(09)03702-1.

Remote Data Collection During a Pandemic: A New Approach for Assessing and Coding Multisensory Attention Skills in Infants and Young Children.

Front Psychol. 2022 Jan 21;12:731618. doi: 10.3389/fpsyg.2021.731618. eCollection 2021.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

An asynchronous, hands-off workflow for looking time experiments with infants.

Dev Psychol. 2024 Aug;60(8):1447-1456. doi: 10.1037/dev0001791. Epub 2024 Jun 24.

A naturalistic viewing paradigm using 360° panoramic video clips and real-time field-of-view changes with eye-gaze tracking.

Neuroimage. 2020 Aug 1;216:116617. doi: 10.1016/j.neuroimage.2020.116617. Epub 2020 Feb 10.

OWLET: An automated, open-source method for infant gaze tracking using smartphone and webcam recordings.

Behav Res Methods. 2023 Sep;55(6):3149-3163. doi: 10.3758/s13428-022-01962-w. Epub 2022 Sep 7.

引用本文的文献

Use of computer vision analysis for labeling inattention periods in EEG recordings with visual stimuli.

Sci Rep. 2025 Aug 22;15(1):30963. doi: 10.1038/s41598-025-10511-2.

UpStory: the uppsala storytelling dataset.

Front Robot AI. 2025 Jul 21;12:1547578. doi: 10.3389/frobt.2025.1547578. eCollection 2025.

Semantic priming supports infants' ability to learn names of unseen objects.

PLoS One. 2025 Apr 23;20(4):e0321775. doi: 10.1371/journal.pone.0321775. eCollection 2025.

The fundamentals of eye tracking part 4: Tools for conducting an eye tracking study.

Behav Res Methods. 2025 Jan 6;57(1):46. doi: 10.3758/s13428-024-02529-7.

Webcams as Windows to the Mind? A Direct Comparison Between In-Lab and Web-Based Eye-Tracking Methods.

Open Mind (Camb). 2024 Nov 22;8:1369-1424. doi: 10.1162/opmi_a_00171. eCollection 2024.

Remote Infant Studies of Early Learning (RISE): Scalable online replications of key findings in infant cognitive development.

Dev Psychol. 2025 Jan;61(1):151-167. doi: 10.1037/dev0001849. Epub 2024 Nov 11.

Becoming word meaning experts: Infants' processing of familiar words in the context of typical and atypical exemplars.

Child Dev. 2024 Sep-Oct;95(5):e352-e372. doi: 10.1111/cdev.14120. Epub 2024 Jun 1.

Validation of an open source, remote web-based eye-tracking method (WebGazer) for research in early childhood.

Infancy. 2024 Jan-Feb;29(1):31-55. doi: 10.1111/infa.12564. Epub 2023 Oct 18.

本文引用的文献

We need to talk about validity - A commentary on "Six solutions for more reliable infant research" from the viewpoint of an early executive functions researcher.

Infant Child Dev. 2022 Sep-Oct;31(5):e2352. doi: 10.1002/icd.2352. Epub 2022 Jun 7.

Habituation Reflects Optimal Exploration Over Noisy Perceptual Samples.

Top Cogn Sci. 2023 Apr;15(2):290-302. doi: 10.1111/tops.12631. Epub 2022 Nov 2.

OWLET: An automated, open-source method for infant gaze tracking using smartphone and webcam recordings.

Behav Res Methods. 2023 Sep;55(6):3149-3163. doi: 10.3758/s13428-022-01962-w. Epub 2022 Sep 7.

iCatcher: A neural network approach for automated coding of young children's eye movements.

Infancy. 2022 Jul;27(4):765-779. doi: 10.1111/infa.12468. Epub 2022 Apr 13.

Toward Cumulative Cognitive Science: A Comparison of Meta-Analysis, Mega-Analysis, and Hybrid Approaches.

Open Mind (Camb). 2021 Nov 25;5:154-173. doi: 10.1162/opmi_a_00048. eCollection 2021.

Moderated Online Data-Collection for Developmental Research: Methods and Replications.

Front Psychol. 2021 Nov 3;12:734398. doi: 10.3389/fpsyg.2021.734398. eCollection 2021.

A Global Perspective on Testing Infants Online: Introducing ManyBabies-AtHome.

Front Psychol. 2021 Sep 9;12:703234. doi: 10.3389/fpsyg.2021.703234. eCollection 2021.

Advances in Eye Tracking in Infancy Research.

Infancy. 2012 Jan;17(1):1-8. doi: 10.1111/j.1532-7078.2011.00101.x. Epub 2011 Nov 1.

Racial Inequality in Psychological Research: Trends of the Past and Recommendations for the Future.

Perspect Psychol Sci. 2020 Nov;15(6):1295-1309. doi: 10.1177/1745691620927709. Epub 2020 Jun 24.

Using multiple agreement methods for continuous repeated measures data: a tutorial for practitioners.

BMC Med Res Methodol. 2020 Jun 12;20(1):154. doi: 10.1186/s12874-020-01022-x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

iCatcher+：对在实验室、实地和在线研究中收集的视频里婴幼儿注视行为进行稳健且自动化的标注

iCatcher+: Robust and Automated Annotation of Infants' and Young Children's Gaze Behavior From Videos Collected in Laboratory, Field, and Online Studies.

作者信息

机构信息

The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv-Yafo, Israel.

Department of Psychology, Stanford University, Palo Alto, California.

出版信息

Adv Methods Pract Psychol Sci. 2023 Apr-Jun;6(2). doi: 10.1177/25152459221147250. Epub 2023 Apr 18.

DOI:10.1177/25152459221147250

PMID:37655047

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10471135/

Abstract

摘要

iCatcher+：对在实验室、实地和在线研究中收集的视频里婴幼儿注视行为进行稳健且自动化的标注

iCatcher+: Robust and Automated Annotation of Infants' and Young Children's Gaze Behavior From Videos Collected in Laboratory, Field, and Online Studies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

iCatcher+：对在实验室、实地和在线研究中收集的视频里婴幼儿注视行为进行稳健且自动化的标注

iCatcher+: Robust and Automated Annotation of Infants' and Young Children's Gaze Behavior From Videos Collected in Laboratory, Field, and Online Studies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献