Suppr超能文献

说话人内和说话人间的声学嗓音变化。

Acoustic voice variation within and between speakers.

机构信息

Department of Head and Neck Surgery, UCLA School of Medicine, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA.

Department of Linguistics, University of California, Los Angeles, 3125 Campbell Hall, Box 951543, Los Angeles, California 90095-1543, USA.

出版信息

J Acoust Soc Am. 2019 Sep;146(3):1568. doi: 10.1121/1.5125134.

Abstract

Little is known about the nature or extent of everyday variability in voice quality. This paper describes a series of principal component analyses to explore within- and between-talker acoustic variation and the extent to which they conform to expectations derived from current models of voice perception. Based on studies of faces and cognitive models of speaker recognition, the authors hypothesized that a few measures would be important across speakers, but that much of within-speaker variability would be idiosyncratic. Analyses used multiple sentence productions from 50 female and 50 male speakers of English, recorded over three days. Twenty-six acoustic variables from a psychoacoustic model of voice quality were measured every 5 ms on vowels and approximants. Across speakers the balance between higher harmonic amplitudes and inharmonic energy in the voice accounted for the most variance (females = 20%, males = 22%). Formant frequencies and their variability accounted for an additional 12% of variance across speakers. Remaining variance appeared largely idiosyncratic, suggesting that the speaker-specific voice space is different for different people. Results further showed that voice spaces for individuals and for the population of talkers have very similar acoustic structures. Implications for prototype models of voice perception and recognition are discussed.

摘要

关于日常语音质量的变化性质或程度,人们知之甚少。本文描述了一系列主成分分析,以探索说话者内和说话者间的声学变化,以及它们在多大程度上符合当前语音感知模型的预期。基于对人脸和说话者识别认知模型的研究,作者假设一些指标在说话者之间很重要,但大多数说话者内的变化是特质的。分析使用了来自 50 名女性和 50 名男性英语说话者的三天内多次句子产生的数据,对元音和近音进行了每 5 毫秒的 26 个声学变量的测量。在说话者之间,声音中较高谐波振幅与非谐波能量之间的平衡解释了最大的方差(女性= 20%,男性= 22%)。共振峰频率及其可变性占说话者间方差的 12%。其余的方差似乎主要是特质的,这表明不同的人具有不同的特定于说话者的声音空间。结果还表明,个体和说话者群体的声音空间具有非常相似的声学结构。讨论了对语音感知和识别原型模型的影响。

相似文献

2
Acoustic voice variation in spontaneous speech.自发言语中的语音变化。
J Acoust Soc Am. 2022 May;151(5):3462. doi: 10.1121/10.0011471.

引用本文的文献

1
CoVox: A dataset of contrasting vocalizations.CoVox:一个包含对比发声的数据集。
Behav Res Methods. 2025 Apr 11;57(5):142. doi: 10.3758/s13428-025-02664-9.
2
Effects of laryngeal manipulations on voice gender perception.喉部操作对嗓音性别感知的影响。
Interspeech. 2022 Sep;2022:1856-1860. doi: 10.21437/interspeech.2022-10815.
4
Foreign language talker identification does not generalize to new talkers.外语说话者识别不能推广到新的说话者。
Psychon Bull Rev. 2025 Apr;32(2):941-950. doi: 10.3758/s13423-024-02598-x. Epub 2024 Oct 23.
10
The own-voice benefit for word recognition in early bilinguals.早期双语者中母语语音对单词识别的益处。
Front Psychol. 2022 Sep 2;13:901326. doi: 10.3389/fpsyg.2022.901326. eCollection 2022.

本文引用的文献

4
Understanding the mechanisms of familiar voice-identity recognition in the human brain.理解人类大脑中熟悉声音识别的机制。
Neuropsychologia. 2018 Jul 31;116(Pt B):179-193. doi: 10.1016/j.neuropsychologia.2018.03.039. Epub 2018 Mar 31.
6
Learning faces from variability.从变异性中学习面部特征。
Q J Exp Psychol (Hove). 2017 May;70(5):897-905. doi: 10.1080/17470218.2015.1136656. Epub 2016 Mar 7.
7
The Scree Test For The Number Of Factors.因子数量的碎石检验
Multivariate Behav Res. 1966 Apr 1;1(2):245-76. doi: 10.1207/s15327906mbr0102_10.
8
Recognizing and identifying people: A neuropsychological review.识人和认人:一项神经心理学综述。
Cortex. 2016 Feb;75:132-150. doi: 10.1016/j.cortex.2015.11.023. Epub 2015 Dec 25.
9
Exemplar variance supports robust learning of facial identity.示例方差支持对面部身份的稳健学习。
J Exp Psychol Hum Percept Perform. 2015 Jun;41(3):577-81. doi: 10.1037/xhp0000049. Epub 2015 Apr 13.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验