• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面对面互动中语音驱动的注视

Speech Driven Gaze in a Face-to-Face Interaction.

作者信息

Arslan Aydin Ülkü, Kalkan Sinan, Acartürk Cengiz

机构信息

Cognitive Science Department, Middle East Technical University, Ankara, Turkey.

Computer Engineering Department, Middle East Technical University, Ankara, Turkey.

出版信息

Front Neurorobot. 2021 Mar 4;15:598895. doi: 10.3389/fnbot.2021.598895. eCollection 2021.

DOI:10.3389/fnbot.2021.598895
PMID:33746729
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7970197/
Abstract

Gaze and language are major pillars in multimodal communication. Gaze is a non-verbal mechanism that conveys crucial social signals in face-to-face conversation. However, compared to language, gaze has been less studied as a communication modality. The purpose of the present study is 2-fold: (i) to investigate gaze direction (i.e., aversion and face gaze) and its relation to speech in a face-to-face interaction; and (ii) to propose a computational model for multimodal communication, which predicts gaze direction using high-level speech features. Twenty-eight pairs of participants participated in data collection. The experimental setting was a mock job interview. The eye movements were recorded for both participants. The speech data were annotated by ISO 24617-2 Standard for Dialogue Act Annotation, as well as manual tags based on previous social gaze studies. A comparative analysis was conducted by Convolutional Neural Network (CNN) models that employed specific architectures, namely, VGGNet and ResNet. The results showed that the frequency and the duration of gaze differ significantly depending on the role of participant. Moreover, the ResNet models achieve higher than 70% accuracy in predicting gaze direction.

摘要

注视和语言是多模态交流的主要支柱。注视是一种非语言机制,在面对面交谈中传达关键的社交信号。然而,与语言相比,注视作为一种交流方式的研究较少。本研究的目的有两个:(i)在面对面互动中研究注视方向(即回避和面部注视)及其与言语的关系;(ii)提出一种多模态交流的计算模型,该模型使用高级语音特征预测注视方向。28对参与者参与了数据收集。实验场景是模拟求职面试。记录了两位参与者的眼动。语音数据根据ISO 24617-2对话行为标注标准以及基于先前社会注视研究的手动标签进行标注。采用特定架构(即VGGNet和ResNet)的卷积神经网络(CNN)模型进行了对比分析。结果表明,注视的频率和持续时间因参与者的角色不同而有显著差异。此外,ResNet模型在预测注视方向方面的准确率高于70%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/f58175c6b880/fnbot-15-598895-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/774eba262db9/fnbot-15-598895-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/18b7942283b5/fnbot-15-598895-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/fc08ebf273e4/fnbot-15-598895-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/4a10d25b251e/fnbot-15-598895-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/4ff94942cd14/fnbot-15-598895-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/3e09cec3217e/fnbot-15-598895-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/a404a0eb2308/fnbot-15-598895-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/9bd96f2bf777/fnbot-15-598895-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/54a746113966/fnbot-15-598895-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/f58175c6b880/fnbot-15-598895-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/774eba262db9/fnbot-15-598895-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/18b7942283b5/fnbot-15-598895-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/fc08ebf273e4/fnbot-15-598895-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/4a10d25b251e/fnbot-15-598895-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/4ff94942cd14/fnbot-15-598895-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/3e09cec3217e/fnbot-15-598895-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/a404a0eb2308/fnbot-15-598895-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/9bd96f2bf777/fnbot-15-598895-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/54a746113966/fnbot-15-598895-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/7970197/f58175c6b880/fnbot-15-598895-g0010.jpg

相似文献

1
Speech Driven Gaze in a Face-to-Face Interaction.面对面互动中语音驱动的注视
Front Neurorobot. 2021 Mar 4;15:598895. doi: 10.3389/fnbot.2021.598895. eCollection 2021.
2
MAGiC: A Multimodal Framework for Analysing Gaze in Dyadic Communication.MAGiC:用于分析二元交流中注视的多模态框架。
J Eye Mov Res. 2018 Nov 12;11(6). doi: 10.16910/jemr.11.6.2.
3
Timing of gazes in child dialogues: a time-course analysis of requests and back channelling in referential communication.儿童对话中的注视时间:参照交流中请求和反馈的时间进程分析。
Int J Lang Commun Disord. 2012 Jul-Aug;47(4):373-83. doi: 10.1111/j.1460-6984.2012.00151.x. Epub 2012 Mar 5.
4
Objective eye-gaze behaviour during face-to-face communication with proficient alaryngeal speakers: a preliminary study.面对面交流中使用人工发声器的熟练失音者的客观眼球注视行为:初步研究。
Int J Lang Commun Disord. 2011 Sep-Oct;46(5):535-49. doi: 10.1111/j.1460-6984.2011.00005.x. Epub 2011 Mar 7.
5
Gaze aversion to stuttered speech: a pilot study investigating differential visual attention to stuttered and fluent speech.对口吃言语的回避注视:一项对视口吃和流畅言语的差异视觉注意的初步研究。
Int J Lang Commun Disord. 2010 Mar-Apr;45(2):133-44. doi: 10.3109/13682820902763951.
6
How does the topic of conversation affect verbal exchange and eye gaze? A comparison between typical development and high-functioning autism.话题如何影响言语交流和目光注视?正常发展与高功能自闭症的比较。
Neuropsychologia. 2010 Jul;48(9):2730-9. doi: 10.1016/j.neuropsychologia.2010.05.020. Epub 2010 May 21.
7
Gaze aversion in conversational settings: An investigation based on mock job interview.对话场景中的目光回避:基于模拟求职面试的调查
J Eye Mov Res. 2021 May 19;14(1). doi: 10.16910/jemr.14.1.1.
8
Multimodal Communication in Aphasia: Perception and Production of Co-speech Gestures During Face-to-Face Conversation.失语症中的多模态交流:面对面交谈中伴随言语的手势的感知与产生
Front Hum Neurosci. 2018 Jun 14;12:200. doi: 10.3389/fnhum.2018.00200. eCollection 2018.
9
Effects of being watched on eye gaze and facial displays of typical and autistic individuals during conversation.交谈过程中被注视对典型个体和自闭症个体的眼神注视及面部表情的影响。
Autism. 2021 Jan;25(1):210-226. doi: 10.1177/1362361320951691. Epub 2020 Aug 27.
10
Using dual eye tracking to uncover personal gaze patterns during social interaction.利用双眼追踪技术揭示社交互动中个人的注视模式。
Sci Rep. 2018 Mar 9;8(1):4271. doi: 10.1038/s41598-018-22726-7.

本文引用的文献

1
MAGiC: A Multimodal Framework for Analysing Gaze in Dyadic Communication.MAGiC:用于分析二元交流中注视的多模态框架。
J Eye Mov Res. 2018 Nov 12;11(6). doi: 10.16910/jemr.11.6.2.
2
Eye tracking in Educational Science: Theoretical frameworks and research agendas.教育科学中的眼动追踪:理论框架与研究议程。
J Eye Mov Res. 2017 Feb 4;10(1). doi: 10.16910/jemr.10.1.3.
3
Deep Neural Networks as Scientific Models.深度神经网络作为科学模型。
Trends Cogn Sci. 2019 Apr;23(4):305-317. doi: 10.1016/j.tics.2019.01.009. Epub 2019 Feb 19.
4
Using dual eye tracking to uncover personal gaze patterns during social interaction.利用双眼追踪技术揭示社交互动中个人的注视模式。
Sci Rep. 2018 Mar 9;8(1):4271. doi: 10.1038/s41598-018-22726-7.
5
Speaking and Listening with the Eyes: Gaze Signaling during Dyadic Interactions.用眼睛交谈与倾听:二元互动中的注视信号
PLoS One. 2015 Aug 26;10(8):e0136905. doi: 10.1371/journal.pone.0136905. eCollection 2015.
6
The origin of human multi-modal communication.人类多模态交流的起源。
Philos Trans R Soc Lond B Biol Sci. 2014 Sep 19;369(1651):20130302. doi: 10.1098/rstb.2013.0302.
7
The processing of speech, gesture, and action during language comprehension.语言理解过程中语音、手势和动作的处理。
Psychon Bull Rev. 2015 Apr;22(2):517-23. doi: 10.3758/s13423-014-0681-7.
8
From gaze cueing to dual eye-tracking: novel approaches to investigate the neural correlates of gaze in social interaction.从注视线索到双眼追踪:研究社交互动中注视的神经相关性的新方法。
Neurosci Biobehav Rev. 2013 Dec;37(10 Pt 2):2516-28. doi: 10.1016/j.neubiorev.2013.07.017. Epub 2013 Aug 5.
9
Standardization of automated analyses of oculomotor fixation and saccadic behaviors.眼动注视和扫视行为自动分析的标准化
IEEE Trans Biomed Eng. 2010 Nov;57(11). doi: 10.1109/TBME.2010.2057429. Epub 2010 Jul 26.
10
Eye tracking in infancy research.婴儿期研究中的眼动追踪
Dev Neuropsychol. 2010;35(1):1-19. doi: 10.1080/87565640903325758.