Hector Research Institute of Education Sciences and Psychology, University of Tübingen, Europastraße 6, 72072, Tübingen, Germany.
Human-Computer Interaction, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
Sci Rep. 2024 May 29;14(1):12329. doi: 10.1038/s41598-024-60370-6.
Mental rotation is the ability to rotate mental representations of objects in space. Shepard and Metzler's shape-matching tasks, frequently used to test mental rotation, involve presenting pictorial representations of 3D objects. This stimulus material has raised questions regarding the ecological validity of the test for mental rotation with actual visual 3D objects. To systematically investigate differences in mental rotation with pictorial and visual stimuli, we compared data of university students from a virtual reality experiment. Comparing both conditions within subjects, we found higher accuracy and faster reaction times for 3D visual figures. We expected eye tracking to reveal differences in participants' stimulus processing and mental rotation strategies induced by the visual differences. We statistically compared fixations (locations), saccades (directions), pupil changes, and head movements. Supplementary Shapley values of a Gradient Boosting Decision Tree algorithm were analyzed, which correctly classified the two conditions using eye and head movements. The results indicated that with visual 3D figures, the encoding of spatial information was less demanding, and participants may have used egocentric transformations and perspective changes. Moreover, participants showed eye movements associated with more holistic processing for visual 3D figures and more piecemeal processing for pictorial 2D figures.
心理旋转是在空间中旋转物体心理表象的能力。谢巴德和梅茨勒的形状匹配任务经常被用来测试心理旋转,涉及呈现三维物体的图像表示。这种刺激材料引发了关于使用实际视觉三维物体进行心理旋转测试的生态有效性的问题。为了系统地研究心理旋转在图像和视觉刺激之间的差异,我们比较了来自虚拟现实实验的大学生的数据。在被试内比较两种条件,我们发现 3D 视觉图形的准确性更高,反应时间更快。我们预计眼动追踪将揭示参与者的刺激处理和由视觉差异引起的心理旋转策略的差异。我们对注视点(位置)、扫视(方向)、瞳孔变化和头部运动进行了统计学比较。还分析了梯度提升决策树算法的 Shapley 值补充,该算法使用眼动和头部运动正确地对两种情况进行了分类。结果表明,对于视觉三维图形,空间信息的编码要求较低,参与者可能使用了自我中心转换和视角变化。此外,参与者表现出与视觉三维图形更整体的处理相关的眼动,与图像二维图形更零碎的处理相关的眼动。