Semizer Yelda, Rosenholtz Ruth
Department of Humanities and Social Sciences, New Jersey Institute of Technology, Newark, NJ, USA.
NVIDIA, Westford, MA, USA.
Cogn Res Princ Implic. 2025 Jul 9;10(1):40. doi: 10.1186/s41235-025-00643-4.
The use of video conferencing tools has become increasingly common recently. The visual displays in these tools are highly complex, being composed of multiple faces with varying image quality and lighting conditions. On top of this, users have the ability to choose their own backgrounds. Some choose simple artificial backgrounds, some appear in front of a real or simulated room, and some use something more abstract. How do these choices affect the user's ability to use the tool, for example, finding the current speaker or a reaction symbol? Vision science can certainly provide answers to these questions; however, most search studies use simple displays with a uniform background, or more recently, real-world scenes. How does what we know about search generalize to these more complex displays? The current study sought to examine how our understanding of visual search applies to well-controlled video conferencing displays. Specifically, we investigated the effect of display clutter (i.e., background complexity and variability) on perceptual tasks relevant for video conferencing. In an eye-tracking set-up, participants searched either for the speaker whose image was highlighted (Experiment 1) or for a reaction symbol (raised-hand) embedded on one of the attendees' background. Results showed a significant effect of background complexity and variability, suggesting that search performance declined as the display clutter increased. Image-based analysis showed that the choice of backgrounds mediated these effects, suggesting that some virtual backgrounds were not optimal for perceptual processes.
视频会议工具的使用近来愈发普遍。这些工具中的视觉显示极为复杂,由多个面部组成,图像质量和光照条件各不相同。除此之外,用户能够选择自己的背景。一些人选择简单的虚拟背景,一些人出现在真实或模拟的房间前,还有一些人使用更抽象的背景。这些选择如何影响用户使用该工具的能力,比如找到当前发言者或一个反应符号?视觉科学肯定能为这些问题提供答案;然而,大多数搜索研究使用具有统一背景的简单显示,或者最近使用真实世界场景。我们对搜索的了解如何推广到这些更复杂的显示中?当前的研究旨在考察我们对视觉搜索的理解如何应用于控制良好的视频会议显示。具体而言,我们研究了显示杂乱(即背景复杂性和可变性)对与视频会议相关的感知任务的影响。在一个眼动追踪设置中,参与者要么搜索图像被突出显示的发言者(实验1),要么搜索嵌入在其中一位参会者背景上的反应符号(举手)。结果显示背景复杂性和可变性有显著影响,表明随着显示杂乱增加,搜索性能下降。基于图像的分析表明背景的选择介导了这些影响,这表明一些虚拟背景对感知过程并非最优。