Stevenage Sarah V, Singh Lucy, Dixey Pru
School of Psychology, University of Southampton, Southampton SO17 1BJ, UK.
Brain Sci. 2023 Feb 19;13(2):358. doi: 10.3390/brainsci13020358.
Vocal identity processing depends on the ability to two instances of different speakers whilst also being able to two instances of the same speaker. Whilst previous research has examined these voice processing capabilities under relatively common listening conditions, it has not yet tested the limits of these capabilities. Here, two studies are presented that employ challenging listening tasks to determine just how good we are at these voice processing tasks. In Experiment 1, 54 university students were asked to distinguish between very similar sounding, yet different speakers (celebrity targets and their impersonators). Participants completed a 'Same/Different' task and a 'Which is the Celebrity?' task to pairs of speakers, and a 'Real or Not?' task to individual speakers. In Experiment 2, a separate group of 40 university students was asked to pair very different sounding instances of the same speakers (speaking and singing). Participants were presented with an array of voice clips and completed a 'Pairs Task' as a variant of the more traditional voice sorting task. The results of Experiment 1 suggested that significantly more mistakes were made when distinguishing celebrity targets from their impersonators than when distinguishing the same targets from control voices. Nevertheless, listeners were significantly better than chance in all three tasks despite the challenge. Similarly, the results of Experiment 2 suggested that it was significantly more difficult to pair singing and speaking clips than to pair two speaking clips, particularly when the speakers were unfamiliar. Again, however, the performance was significantly above zero, and was again better than chance in a cautious comparison. Taken together, the results suggest that vocal identity processing is a highly adaptable task, assisted by familiarity with the speaker. However, the fact that performance remained above chance in all tasks suggests that we had not reached the limit of our listeners' capability, despite the considerable listening challenges introduced. We conclude that voice processing is far better than previous research might have presumed.
声音身份识别取决于区分不同说话者的两个实例的能力,同时也取决于区分同一说话者的两个实例的能力。虽然先前的研究已经在相对常见的聆听条件下考察了这些语音处理能力,但尚未测试这些能力的极限。在此,我们展示了两项研究,它们采用具有挑战性的聆听任务来确定我们在这些语音处理任务上的能力究竟有多强。在实验1中,54名大学生被要求区分听起来非常相似但不同的说话者(名人目标及其模仿者)。参与者对成对的说话者完成了“相同/不同”任务和“哪个是名人?”任务,对单个说话者完成了“真实与否?”任务。在实验2中,另一组40名大学生被要求将同一说话者听起来非常不同的实例(说话和唱歌)配对。参与者被呈现一系列语音片段,并完成了“配对任务”,这是更传统的语音分类任务的一种变体。实验1的结果表明,将名人目标与他们的模仿者区分开来时所犯的错误明显多于将相同目标与对照声音区分开来时所犯的错误。然而,尽管有挑战,听众在所有三项任务中的表现都明显高于随机水平。同样,实验2的结果表明,将唱歌和说话片段配对比将两个说话片段配对要困难得多,尤其是当说话者不熟悉时。然而,同样,表现明显高于零,并且在谨慎比较中再次高于随机水平。综合来看,结果表明声音身份识别是一项高度适应性的任务,熟悉说话者会有所帮助。然而,所有任务中的表现都高于随机水平这一事实表明,尽管引入了相当大的聆听挑战,但我们尚未达到听众能力的极限。我们得出结论,语音处理比先前的研究所推测的要好得多。