• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基频、共振峰频率、非周期性和频谱水平对语音性别的感知影响。

Influences of fundamental frequency, formant frequencies, aperiodicity, and spectrum level on the perception of voice gender.

作者信息

Skuk Verena G, Schweinberger Stefan R

出版信息

J Speech Lang Hear Res. 2014 Feb;57(1):285-96. doi: 10.1044/1092-4388(2013/12-0314).

DOI:10.1044/1092-4388(2013/12-0314)
PMID:23882002
Abstract

PURPOSE

To determine the relative importance of acoustic parameters (fundamental frequency [F0], formant frequencies [FFs], aperiodicity, and spectrum level [SL]) on voice gender perception, the authors used a novel parameter-morphing approach that, unlike spectral envelope shifting, allows the application of nonuniform scale factors to transform formants and more direct comparison of parameter impact.

METHOD

In each of 2 experiments, 16 listeners with normal hearing (8 female, 8 male) classified voice gender for morphs between female and male speakers, using syllable tokens from 2 male-female speaker pairs. Morphs varied single acoustic parameters (Experiment 1) or selected combinations (Experiment 2), keeping residual parameters androgynous, as determined in a baseline experiment.

RESULTS

The strongest cue related to gender perception was F0, followed by FF and SL. Aperiodicity did not systematically influence gender perception. Morphing F0 and FF in conjunction produced convincing changes in perceived gender-changes that were equivalent to those for Full morphs interpolating all parameters. Despite the importance of F0, morphing FF and SL in combination produced effective changes in voice gender perception.

CONCLUSIONS

The most important single parameters for gender perception are, in order, F0, FF, and SL. At the same time, F0 and vocal tract resonances have a comparable impact on voice gender perception.

摘要

目的

为了确定声学参数(基频[F0]、共振峰频率[FFs]、非周期性和频谱水平[SL])在语音性别感知中的相对重要性,作者采用了一种新颖的参数变形方法,与频谱包络移动不同,该方法允许应用非均匀比例因子来变换共振峰,并能更直接地比较参数的影响。

方法

在两个实验中,16名听力正常的受试者(8名女性,8名男性)使用来自两对男女说话者的音节样本,对女性和男性说话者之间的变形语音进行性别分类。在一个基线实验中确定保持残余参数中性的情况下,变形语音改变单个声学参数(实验1)或选定的参数组合(实验2)。

结果

与性别感知相关的最强线索是F0,其次是FF和SL。非周期性并未系统地影响性别感知。同时改变F0和FF会使感知到的性别产生令人信服的变化——这些变化等同于对所有参数进行插值的完全变形所产生的变化。尽管F0很重要,但同时改变FF和SL也会在语音性别感知上产生有效的变化。

结论

性别感知最重要的单个参数依次为F0、FF和SL。同时,F0和声道共振对语音性别感知有相当的影响。

相似文献

1
Influences of fundamental frequency, formant frequencies, aperiodicity, and spectrum level on the perception of voice gender.基频、共振峰频率、非周期性和频谱水平对语音性别的感知影响。
J Speech Lang Hear Res. 2014 Feb;57(1):285-96. doi: 10.1044/1092-4388(2013/12-0314).
2
Parameter-Specific Morphing Reveals Contributions of Timbre and Fundamental Frequency Cues to the Perception of Voice Gender and Age in Cochlear Implant Users.特定参数变形揭示了音色和基频线索对人工耳蜗使用者语音性别和年龄感知的贡献。
J Speech Lang Hear Res. 2020 Sep 15;63(9):3155-3175. doi: 10.1044/2020_JSLHR-20-00026. Epub 2020 Sep 3.
3
Role of timbre and fundamental frequency in voice gender adaptation.音色和基频在语音性别适应中的作用。
J Acoust Soc Am. 2015 Aug;138(2):1180-93. doi: 10.1121/1.4927696.
4
Speaking fundamental frequency and vowel formant frequencies: effects on perception of gender.基频和元音共振峰频率的发声:对性别感知的影响。
J Voice. 2013 Sep;27(5):556-66. doi: 10.1016/j.jvoice.2012.11.008. Epub 2013 Feb 13.
5
Training listeners to report the acoustic correlate of formant-frequency scaling using synthetic voices.训练听众使用合成语音报告共振峰频率缩放的声学相关物。
J Acoust Soc Am. 2013 Feb;133(2):1065-77. doi: 10.1121/1.4773858.
6
Relationship between fundamental and formant frequencies in voice preference.语音偏好中基频与共振峰频率之间的关系。
J Acoust Soc Am. 2007 Aug;122(2):EL35-43. doi: 10.1121/1.2719045.
7
Influence of voice properties on vowel perception depends on speaker context.语音特性对元音感知的影响取决于说话者背景。
J Acoust Soc Am. 2022 Aug;152(2):820. doi: 10.1121/10.0013363.
8
Vocal fundamental and formant frequencies affect perceptions of speaker cooperativeness.嗓音基频和共振峰频率会影响对说话者合作性的感知。
Q J Exp Psychol (Hove). 2016;69(9):1657-75. doi: 10.1080/17470218.2015.1091484. Epub 2015 Nov 24.
9
The relative contributions of speaking fundamental frequency and formant frequencies to gender identification based on isolated vowels.基于孤立元音的基频和共振峰频率在性别识别中的相对贡献。
J Voice. 2005 Dec;19(4):544-54. doi: 10.1016/j.jvoice.2004.10.006.
10
Gender Perception After Raising Vowel Fundamental and Formant Frequencies: Considerations for Oral Resonance Research.提高元音基频和共振峰频率后的性别感知:口腔共振研究的考量
J Voice. 2018 Sep;32(5):592-601. doi: 10.1016/j.jvoice.2017.06.023. Epub 2017 Aug 24.

引用本文的文献

1
Effects of laryngeal manipulations on voice gender perception.喉部操作对嗓音性别感知的影响。
Interspeech. 2022 Sep;2022:1856-1860. doi: 10.21437/interspeech.2022-10815.
2
Validation of the Language ENvironment Analysis (LENA) Automated Speech Processing Algorithm Labels for Adult and Child Segments in a Sample of Families From India.印度家庭样本中成人和儿童片段的语言环境分析(LENA)自动语音处理算法标签的验证
J Speech Lang Hear Res. 2025 Jan 2;68(1):40-53. doi: 10.1044/2024_JSLHR-24-00099. Epub 2024 Dec 5.
3
Word and Gender Identification in the Speech of Transgender Individuals.
跨性别者言语中的词汇与性别识别
J Voice. 2024 Jul 16. doi: 10.1016/j.jvoice.2024.06.007.
4
Principal dimensions of voice production and their role in vocal expression.发声的主要维度及其在声音表达中的作用。
J Acoust Soc Am. 2024 Jul 1;156(1):278-283. doi: 10.1121/10.0027913.
5
Social evaluative implications of sensory adaptation to human voices.对人类声音的感官适应的社会评价意义。
R Soc Open Sci. 2024 Mar 27;11:231348. doi: 10.1098/rsos.231348. eCollection 2024 Mar.
6
Artifact removal by template subtraction enables recordings of the frequency following response in cochlear-implant users.通过模板相减去除伪迹使得在人工耳蜗使用者中记录频率跟随反应成为可能。
Sci Rep. 2024 Mar 14;14(1):6158. doi: 10.1038/s41598-024-56047-9.
7
Evaluating speech-in-speech perception via a humanoid robot.通过人形机器人评估语音中语音的感知。
Front Neurosci. 2024 Feb 9;18:1293120. doi: 10.3389/fnins.2024.1293120. eCollection 2024.
8
Use of a humanoid robot for auditory psychophysical testing.使用类人机器人进行听觉心理物理学测试。
PLoS One. 2023 Dec 13;18(12):e0294328. doi: 10.1371/journal.pone.0294328. eCollection 2023.
9
The Jena Audiovisual Stimuli of Morphed Emotional Pseudospeech (JAVMEPS): A database for emotional auditory-only, visual-only, and congruent and incongruent audiovisual voice and dynamic face stimuli with varying voice intensities.《耶拿情感伪语音变声视听刺激库(JAVMEPS)》:一个包含情感纯听觉、纯视觉以及与声音强度变化的语音和动态面部相匹配和不匹配的视听声音刺激的数据库。
Behav Res Methods. 2024 Aug;56(5):5103-5115. doi: 10.3758/s13428-023-02249-4. Epub 2023 Oct 11.
10
Recognition of Sentences With Complex Syntax in Speech Babble by Adolescents With Normal Hearing or Cochlear Implants.正常听力青少年和人工耳蜗植入青少年识别言语噪声中复杂句法的句子。
J Speech Lang Hear Res. 2023 Mar 7;66(3):1110-1135. doi: 10.1044/2022_JSLHR-22-00407. Epub 2023 Feb 9.