Kanber Elise, Lally Clare, Razin Raha, Rosi Victor, Garrido Lúcia, Lavan Nadine, McGettigan Carolyn
Department of Speech, Hearing and Phonetic Sciences, UCL, Chandler House, 2 Wakefield Street, London WC1N 1PF, UK.
Department of Speech, Hearing and Phonetic Sciences, UCL, Chandler House, 2 Wakefield Street, London WC1N 1PF, UK; Department Experimental Psychology, UCL, 26 Bedford Way, London WC1H 0AP, UK.
Curr Biol. 2025 May 19;35(10):2424-2432.e6. doi: 10.1016/j.cub.2025.03.081. Epub 2025 Apr 18.
The human voice is highly flexible, allowing for diverse expression during communication, but presents perceptual challenges through large acoustic variability. The ability to recognize an individual person's voice depends on the listener's ability to overcome this within-speaker variability to extract a single identity percept. Previous work has found that this process is greatly assisted by familiarity, with evidence suggesting that more extensive and varied exposure to a voice is associated with the formation of a more robust mental representation of it. Here, we used functional magnetic resonance imaging (fMRI) with representational similarity analysis to characterize how personal familiarity with a voice is reflected in neural representations. We measured and compared brain responses with voices of differing familiarity-a personally familiar voice, a voice familiarized through lab training, and a new (untrained) voice-while listeners identified these voices from naturally varying, spontaneous speech clips. Personally familiar voices elicited brain response patterns in voice-, face-, and person-selective corticesthat showed higher within- and between-speaker dissimilarity, compared with lower-familiarity lab-trained and untrained voices. These findings indicated that representations for the sounds of personally familiar voices are better resolved from each other in the brain, and they align with other research reporting intelligibility advantages for speech produced by familiar talkers. Overall, our findings suggest that extensive and varied exposure to personally familiar voices results in the development of finer-grained representations of those voices, which cannot be achieved via short-term lab training.
人类的声音具有高度的灵活性,在交流过程中能够实现多样的表达,但由于其声学变异性大,也带来了感知上的挑战。识别一个人的声音的能力取决于听众克服说话者内部变异性以提取单一身份感知的能力。先前的研究发现,熟悉度对这一过程有很大帮助,有证据表明,对一种声音进行更广泛、更多样化的接触与形成更稳固的心理表征相关。在这里,我们使用功能磁共振成像(fMRI)和表征相似性分析来表征对一种声音的个人熟悉度如何在神经表征中得到体现。我们测量并比较了听众从自然变化的自发语音片段中识别不同熟悉度的声音(个人熟悉的声音、通过实验室训练熟悉的声音和新的(未训练的)声音)时的大脑反应。与熟悉度较低的实验室训练声音和未训练声音相比,个人熟悉的声音在语音、面部和人物选择性皮层中引发的大脑反应模式显示出更高的说话者内部和说话者之间的差异。这些发现表明,大脑中对个人熟悉声音的表征彼此之间的分辨更好,并且与其他关于熟悉的说话者产生的语音具有可懂度优势的研究结果一致。总体而言,我们的研究结果表明,对个人熟悉的声音进行广泛、多样的接触会导致对这些声音形成更精细的表征,而这是通过短期实验室训练无法实现的。