• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度神经网络检测眼神交流与人类专家一样准确。

Detection of eye contact with deep neural networks is as accurate as human experts.

作者信息

Chong Eunji, Clark-Whitney Elysha, Southerland Audrey, Stubbs Elizabeth, Miller Chanel, Ajodan Eliana L, Silverman Melanie R, Lord Catherine, Rozga Agata, Jones Rebecca M, Rehg James M

机构信息

School of Interactive Computing, Georgia Institute of Technology, Atlanta, USA.

Center for Autism and the Developing Brain, Weill Cornell Medicine, New York, USA.

出版信息

Nat Commun. 2020 Dec 14;11(1):6386. doi: 10.1038/s41467-020-19712-x.

DOI:10.1038/s41467-020-19712-x
PMID:33318484
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7736573/
Abstract

Eye contact is among the most primary means of social communication used by humans. Quantification of eye contact is valuable as a part of the analysis of social roles and communication skills, and for clinical screening. Estimating a subject's looking direction is a challenging task, but eye contact can be effectively captured by a wearable point-of-view camera which provides a unique viewpoint. While moments of eye contact from this viewpoint can be hand-coded, such a process tends to be laborious and subjective. In this work, we develop a deep neural network model to automatically detect eye contact in egocentric video. It is the first to achieve accuracy equivalent to that of human experts. We train a deep convolutional network using a dataset of 4,339,879 annotated images, consisting of 103 subjects with diverse demographic backgrounds. 57 subjects have a diagnosis of Autism Spectrum Disorder. The network achieves overall precision of 0.936 and recall of 0.943 on 18 validation subjects, and its performance is on par with 10 trained human coders with a mean precision 0.918 and recall 0.946. Our method will be instrumental in gaze behavior analysis by serving as a scalable, objective, and accessible tool for clinicians and researchers.

摘要

眼神交流是人类使用的最主要的社交沟通方式之一。眼神交流的量化作为社会角色和沟通技巧分析的一部分以及临床筛查手段,具有重要价值。估计受试者的注视方向是一项具有挑战性的任务,但可穿戴式视角摄像头能够有效捕捉眼神交流,该摄像头提供了独特的视角。虽然从这个视角的眼神交流瞬间可以手动编码,但这样的过程往往既费力又主观。在这项工作中,我们开发了一种深度神经网络模型,用于自动检测以自我为中心的视频中的眼神交流。这是首个达到与人类专家相当准确率的模型。我们使用一个包含4339879张标注图像的数据集训练了一个深度卷积网络,该数据集由103名具有不同人口统计学背景的受试者组成。其中57名受试者被诊断患有自闭症谱系障碍。该网络在18名验证受试者上实现了0.936的总体精度和0.943的召回率,其性能与10名经过训练的人类编码员相当,平均精度为0.918,召回率为0.946。我们的方法将作为一种可扩展、客观且易于使用的工具,为临床医生和研究人员的注视行为分析提供帮助。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/0e6266a0e781/41467_2020_19712_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/a623e6988a04/41467_2020_19712_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/98cfb773ec5b/41467_2020_19712_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/755649a24b8d/41467_2020_19712_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/c48898ee2d40/41467_2020_19712_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/0e6266a0e781/41467_2020_19712_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/a623e6988a04/41467_2020_19712_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/98cfb773ec5b/41467_2020_19712_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/755649a24b8d/41467_2020_19712_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/c48898ee2d40/41467_2020_19712_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db5/7736573/0e6266a0e781/41467_2020_19712_Fig5_HTML.jpg

相似文献

1
Detection of eye contact with deep neural networks is as accurate as human experts.利用深度神经网络检测眼神交流与人类专家一样准确。
Nat Commun. 2020 Dec 14;11(1):6386. doi: 10.1038/s41467-020-19712-x.
2
MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.MABAL:一种用于机器辅助骨龄标注的新型深度学习架构。
J Digit Imaging. 2018 Aug;31(4):513-519. doi: 10.1007/s10278-018-0053-3.
3
A spectrogram image based intelligent technique for automatic detection of autism spectrum disorder from EEG.基于声谱图图像的智能技术,用于从 EEG 中自动检测自闭症谱系障碍。
PLoS One. 2021 Jun 25;16(6):e0253094. doi: 10.1371/journal.pone.0253094. eCollection 2021.
4
Automated cardiovascular magnetic resonance image analysis with fully convolutional networks.基于全卷积网络的自动化心血管磁共振图像分析。
J Cardiovasc Magn Reson. 2018 Sep 14;20(1):65. doi: 10.1186/s12968-018-0471-x.
5
Video-audio neural network ensemble for comprehensive screening of autism spectrum disorder in young children.视频-音频神经网络集成用于全面筛查幼儿自闭症谱系障碍。
PLoS One. 2024 Oct 3;19(10):e0308388. doi: 10.1371/journal.pone.0308388. eCollection 2024.
6
Computer-Aided Screening of Autism Spectrum Disorder: Eye-Tracking Study Using Data Visualization and Deep Learning.自闭症谱系障碍的计算机辅助筛查:使用数据可视化和深度学习的眼动追踪研究
JMIR Hum Factors. 2021 Oct 25;8(4):e27706. doi: 10.2196/27706.
7
Hybrid deep transfer learning-based early diagnosis of autism spectrum disorder using scalogram representation of electroencephalography signals.基于脑电图信号的频谱图表示的混合深度迁移学习自闭症谱系障碍早期诊断。
Med Biol Eng Comput. 2024 Feb;62(2):495-503. doi: 10.1007/s11517-023-02959-z. Epub 2023 Nov 8.
8
Automated tool detection with deep learning for monitoring kinematics and eye-hand coordination in microsurgery.用于显微手术中运动学和眼手协调监测的深度学习自动工具检测
Comput Biol Med. 2022 Feb;141:105121. doi: 10.1016/j.compbiomed.2021.105121. Epub 2021 Dec 11.
9
A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。
Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.
10
Deep-SAGA: a deep-learning-based system for automatic gaze annotation from eye-tracking data.深度 SAGA:一种基于深度学习的眼动追踪数据自动注视点标注系统。
Behav Res Methods. 2023 Apr;55(3):1372-1391. doi: 10.3758/s13428-022-01833-4. Epub 2022 Jun 1.

引用本文的文献

1
Reliably quantifying the severity of social symptoms in children with autism using ASDSpeech.使用ASD语音可靠地量化自闭症儿童社交症状的严重程度。
Transl Psychiatry. 2025 Jan 18;15(1):14. doi: 10.1038/s41398-025-03233-6.
2
Real-Time Robotic Presentation Skill Scoring Using Multi-Model Analysis and Fuzzy Delphi-Analytic Hierarchy Process.基于多模型分析和模糊 Delphi-层次分析法的实时机器人演示技能评分。
Sensors (Basel). 2023 Dec 5;23(24):9619. doi: 10.3390/s23249619.
3
Augmented reality-based affective training for improving care communication skill and empathy.

本文引用的文献

1
Computer Vision Analysis for Quantification of Autism Risk Behaviors.用于量化自闭症风险行为的计算机视觉分析
IEEE Trans Affect Comput. 2021 Jan-Mar;12(1):215-226. doi: 10.1109/taffc.2018.2868196. Epub 2018 Sep 3.
2
Stationary and ambulatory attention patterns are differentially associated with early temperamental risk for socioemotional problems: Preliminary evidence from a multimodal eye-tracking investigation.静息和活动注意力模式与早期社会情感问题的气质风险相关:来自多模态眼动追踪研究的初步证据。
Dev Psychopathol. 2019 Aug;31(3):971-988. doi: 10.1017/S0954579419000427. Epub 2019 May 17.
3
Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network.
基于增强现实的情感训练,提高护理沟通技巧和同理心。
PLoS One. 2023 Jul 10;18(7):e0288175. doi: 10.1371/journal.pone.0288175. eCollection 2023.
4
Deep learning enables satellite-based monitoring of large populations of terrestrial mammals across heterogeneous landscape.深度学习使基于卫星的对不同景观中大量陆地哺乳动物的监测成为可能。
Nat Commun. 2023 May 27;14(1):3072. doi: 10.1038/s41467-023-38901-y.
5
Eye gaze metrics for skill assessment and feedback in kidney stone surgery.眼球追踪指标在肾结石手术中的技能评估和反馈。
Int J Comput Assist Radiol Surg. 2023 Jun;18(6):1127-1134. doi: 10.1007/s11548-023-02901-6. Epub 2023 May 18.
6
Objective Measurement of Social Gaze and Smile Behaviors in Children with Suspected Autism Spectrum Disorder During Administration of the Autism Diagnostic Observation Schedule, 2nd Edition.在使用《自闭症诊断观察量表(第二版)》对疑似自闭症谱系障碍儿童进行评估期间,对其社交注视和微笑行为进行客观测量。
J Autism Dev Disord. 2024 Jun;54(6):2124-2137. doi: 10.1007/s10803-023-05990-z. Epub 2023 Apr 27.
7
Classifying Autism From Crowdsourced Semistructured Speech Recordings: Machine Learning Model Comparison Study.从众包半结构化语音记录中分类自闭症:机器学习模型比较研究。
JMIR Pediatr Parent. 2022 Apr 14;5(2):e35406. doi: 10.2196/35406.
8
Toward an Attentive Robotic Architecture: Learning-Based Mutual Gaze Estimation in Human-Robot Interaction.迈向一种专注的机器人架构:人机交互中基于学习的相互注视估计
Front Robot AI. 2022 Mar 7;9:770165. doi: 10.3389/frobt.2022.770165. eCollection 2022.
9
Computational approaches to understanding interaction and development.计算方法理解相互作用和发展。
Adv Child Dev Behav. 2022;62:191-230. doi: 10.1016/bs.acdb.2021.12.002. Epub 2022 Feb 12.
10
Technical Challenges for Smooth Interaction With Seniors With Dementia: Lessons From Humanitude™.与老年痴呆症患者顺利互动的技术挑战:源自Humanitude™的经验教训
Front Robot AI. 2021 Jun 2;8:650906. doi: 10.3389/frobt.2021.650906. eCollection 2021.
使用深度神经网络在动态心电图中进行心脏病学家级别的心律失常检测和分类。
Nat Med. 2019 Jan;25(1):65-69. doi: 10.1038/s41591-018-0268-3. Epub 2019 Jan 7.
4
Deep neural network improves fracture detection by clinicians.深度学习神经网络可帮助临床医生提高骨折检出率。
Proc Natl Acad Sci U S A. 2018 Nov 6;115(45):11591-11596. doi: 10.1073/pnas.1806905115. Epub 2018 Oct 22.
5
MPIIGaze: Real-World Dataset and Deep Appearance-Based Gaze Estimation.马克斯·普朗克智能系统研究所注视数据集:真实世界数据集与基于深度外观的注视估计
IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):162-175. doi: 10.1109/TPAMI.2017.2778103. Epub 2017 Nov 28.
6
Computer vision analysis captures atypical attention in toddlers with autism.计算机视觉分析捕捉到自闭症幼儿的非典型注意力。
Autism. 2019 Apr;23(3):619-628. doi: 10.1177/1362361318766247. Epub 2018 Mar 29.
7
Using dual eye tracking to uncover personal gaze patterns during social interaction.利用双眼追踪技术揭示社交互动中个人的注视模式。
Sci Rep. 2018 Mar 9;8(1):4271. doi: 10.1038/s41598-018-22726-7.
8
A survey on deep learning in medical image analysis.深度学习在医学图像分析中的应用研究综述。
Med Image Anal. 2017 Dec;42:60-88. doi: 10.1016/j.media.2017.07.005. Epub 2017 Jul 26.
9
The Human Fetus Preferentially Engages with Face-like Visual Stimuli.人类胎儿更倾向于与类脸视觉刺激互动。
Curr Biol. 2017 Jun 19;27(12):1825-1828.e3. doi: 10.1016/j.cub.2017.05.044. Epub 2017 Jun 9.
10
Hand-Eye Coordination Predicts Joint Attention.手眼协调预测共同注意。
Child Dev. 2017 Nov;88(6):2060-2078. doi: 10.1111/cdev.12730. Epub 2017 Feb 10.