Bradshaw Abigail R, McGettigan Carolyn
Department of Speech, Hearing and Phonetic Sciences, University College London, London, UK.
Department of Experimental Psychology, University of Oxford, Oxford, UK.
Q J Exp Psychol (Hove). 2021 Aug;74(8):1344-1359. doi: 10.1177/1747021821999663. Epub 2021 Mar 21.
Recent research suggests that reinforcement learning may underlie trait formation in social interactions with faces. The current study investigated whether the same learning mechanisms could be engaged for trait learning from voices. On each trial of a training phase, participants ( = 192) chose from pairs of human or slot machine targets that varied in the (1) reward value and (2) generosity of their payouts. Targets were either auditory (voices or tones; Experiment 1) or visual (faces or icons; Experiment 2) and were presented sequentially before payout feedback. A test phase measured participant choice behaviour, and a post-test recorded their target preference ratings. For auditory targets, we found a significant effect of reward only on target choices, but saw higher preference ratings for more generous humans and slot machines. For visual targets, findings from previous studies were replicated: participants learned about both generosity and reward, but generosity was prioritised in the human condition. These findings provide one of the first demonstrations of reinforcement learning of reward with auditory stimuli in a social learning task, but suggest that the use of auditory targets does alter learning in this paradigm. Conversely, reinforcement learning of reward and trait information with visual stimuli remains intact even when sequential presentation introduces a delay in feedback.
最近的研究表明,强化学习可能是与面孔进行社交互动时特质形成的基础。当前的研究调查了从声音中进行特质学习时是否可以采用相同的学习机制。在训练阶段的每次试验中,参与者(n = 192)从人类或老虎机目标对中进行选择,这些目标在(1)奖励价值和(2)赔付的慷慨程度方面有所不同。目标要么是听觉的(声音或音调;实验1),要么是视觉的(面孔或图标;实验2),并在赔付反馈之前依次呈现。测试阶段测量参与者的选择行为,测试后记录他们的目标偏好评分。对于听觉目标,我们发现奖励仅对目标选择有显著影响,但对于更慷慨的人类和老虎机有更高的偏好评分。对于视觉目标,重复了先前研究的结果:参与者了解了慷慨程度和奖励,但在人类条件下慷慨程度被优先考虑。这些发现首次证明了在社交学习任务中通过听觉刺激进行奖励的强化学习,但表明使用听觉目标确实会改变这种范式下的学习。相反,即使顺序呈现会导致反馈延迟,通过视觉刺激进行奖励和特质信息的强化学习仍然完好无损。