声学测量能否预测语音中的性别感知？

Can acoustic measurements predict gender perception in the voice?

机构信息

Department of Human Development, Universidade Estadual de Campinas-UNICAMP, Campinas, Brazil.

Department of Speech and Language Pathology, Federal University of Paraíba-UFPB, João Pessoa, Brazil.

出版信息

PLoS One. 2024 Nov 14;19(11):e0310794. doi: 10.1371/journal.pone.0310794. eCollection 2024.

DOI:10.1371/journal.pone.0310794

Abstract

PURPOSE

To determine if there is an association between vocal gender presentation and the gender and context of the listener.

METHOD

Quantitative and transversal study. 47 speakers of Brazilian Portuguese of different genders were recorded. Recordings included sustained vowel emission, connected speech, and the expressive recital of a poem. Subsequently, four scripts were used in Praat to extract 16 acoustic measurements related to prosody. Voices underwent Auditory-Perceptual Assessment (APA) of the gender presentation by 236 people [65 speech and language pathologist (SLP) with experience in the area of the voice (SLP), 101 cisgender people (CG), and 70 transgender and non-binary people (TNB)]. Gender presentation was evaluated by visual analogue scale. Agreement analyses were executed among quantitative variables and multiple linear regression models were generated to predict APA, taking the judge context/gender and speaker gender into consideration.

RESULTS

Acoustic analysis revealed that cis and transgender women had higher median fundamental frequency (fo) values than other genders. Cisgender women exhibited greater breathiness, while cisgender men showed more vocal quality deviations. In terms of APA, significant differences were observed among judge groups: SLP judged vowel samples differently from other groups, and TNB judged speech samples differently (both p<0.001). The predictive measures for the APA varied based on the sample type, speaker gender, and judge group. For vowel samples, only SLP judges had predictive measures (fo and ABI Jitter) for cisgender speakers. In number counting samples, predictive measures for cisgender speakers included fomed and HNR for CG judges, and fomed for both SLP and TNB judges. For transgender and non-binary speakers, predictive measures were fomed for CG and SLP judges, and fomed, CPPs, and ABI for TNB judges. In the poem recital task, predictive measures for cisgender speakers were fomed and HNR for both SLP and CG judges, with additional measures of cvint and sr for CG judges, and fomed, HNR, cvint, and fopeakwidth for TNB judges. For transgender and non-binary speakers, the predictive measures included a wider range of acoustic features such as fomed, fosd, sr, fomin, emph, HNR, Shimmer, and fo peakwidth for SLP judges, and fomed, fosd, sr, fomax, emph, HNR, and Shimmer for CG judges, while TNB judges considered fomed, sr, emph, fosd, Shimmer, HNR, Jitter, and fomax.

CONCLUSIONS

There is an association between the perception of gender presentation in the voice and the gender or context of the listener and the speaker. Transgender and non-binary judges diverged to a higher degree from cisgender and SLP judges. Compared to the evaluation of cisgender speakers, all judge groups used a greater number of acoustic measurements when analyzing the speech of transgender and non-binary individuals in the poem recital samples.

摘要

目的

确定嗓音性别表现与听众的性别和语境之间是否存在关联。

方法

定量和横向研究。记录了来自不同性别的 47 名巴西葡萄牙语说话者的持续性元音发音、连贯言语和富有表现力的诗歌朗诵。随后，使用 Praat 中的四个脚本提取与韵律相关的 16 个声学测量值。由 236 人（65 名具有嗓音领域经验的言语语言病理学家（SLP）、101 名顺性别者（CG）和 70 名跨性别和非二进制者（TNB））进行听觉感知评估（APA）来评估声音的性别表现。性别表现通过视觉模拟量表进行评估。对定量变量进行了一致性分析，并生成了多元线性回归模型，以考虑到法官的性别和语境/说话者的性别来预测 APA。

结果

声学分析表明，顺性别女性和跨性别女性的基频（fo）中位数值高于其他性别。顺性别女性表现出更高的呼吸音，而顺性别男性表现出更多的嗓音质量偏差。在 APA 方面，法官群体之间存在显著差异：SLP 法官对元音样本的判断与其他群体不同，TNB 法官对言语样本的判断也不同（均<0.001）。APA 的预测指标因样本类型、说话者性别和法官群体而异。对于元音样本，只有 SLP 法官对顺性别说话者具有预测指标（fo 和 ABI Jitter）。在数字计数样本中，顺性别说话者的预测指标包括 CG 法官的 fomed 和 HNR，以及 SLP 和 TNB 法官的 fomed。对于跨性别和非二进制说话者，CG 和 SLP 法官的预测指标包括 fomed，而 TNB 法官的预测指标包括 fomed、CPPs 和 ABI。在诗歌朗诵任务中，顺性别说话者的预测指标包括 SLP 和 CG 法官的 fomed 和 HNR，CG 法官的 cvint 和 sr，以及 TNB 法官的 fomed、HNR、cvint 和 fopeakwidth。对于跨性别和非二进制说话者，预测指标包括更广泛的声学特征，如 SLP 法官的 fomed、fosd、sr、fomin、emph、HNR、Shimmer 和 fo peakwidth，以及 CG 法官的 fomed、fosd、sr、 fomax、emph、HNR 和 Shimmer，而 TNB 法官则考虑了 fomed、sr、emph、fosd、Shimmer、HNR、Jitter 和 fomax。

结论

嗓音性别表现的感知与听众和说话者的性别或语境之间存在关联。跨性别和非二进制法官与顺性别和 SLP 法官的分歧更大。与顺性别说话者的评估相比，所有法官群体在分析诗歌朗诵样本中跨性别和非二进制个体的言语时，使用了更多的声学测量值。

相似文献

Can acoustic measurements predict gender perception in the voice?声学测量能否预测语音中的性别感知？

PLoS One. 2024 Nov 14;19(11):e0310794. doi: 10.1371/journal.pone.0310794. eCollection 2024.

Auditory-Perceptual Assessment and Acoustic Analysis of Gender Expression in the Voice.嗓音中性别表达的听觉感知评估与声学分析

J Voice. 2024 Feb 8. doi: 10.1016/j.jvoice.2023.12.024.

Prosodic Differences in the Voices of Transgender and Cisgender Women: Self-Perception of Voice - An Auditory and Acoustic Analysis.跨性别女性和 cisgender 女性声音的韵律差异：对声音的自我感知 - 听觉和声学分析。

J Voice. 2024 Jul;38(4):844-857. doi: 10.1016/j.jvoice.2021.12.020. Epub 2022 Feb 5.

Acoustic Predictors of Gender Attribution, Masculinity-Femininity, and Vocal Naturalness Ratings Amongst Transgender and Cisgender Speakers.跨性别者和顺性别者中性别归因、男性气质-女性气质及嗓音自然度评分的声学预测因素

J Voice. 2020 Mar;34(2):300.e11-300.e26. doi: 10.1016/j.jvoice.2018.10.002. Epub 2018 Nov 28.

Perceptual-Auditory and Acoustical Analysis of the Voices of Transgender Women.跨性别女性声音的感知听觉与声学分析

J Voice. 2018 Sep;32(5):602-608. doi: 10.1016/j.jvoice.2017.07.003. Epub 2017 Sep 28.

Perceptual-Auditory and Acoustic Analysis of Breathiness in Cis and Transgender Men and Women.顺性别和跨性别男性与女性呼吸声的感知听觉与声学分析。

J Voice. 2024 Mar 30. doi: 10.1016/j.jvoice.2024.02.015.

Cepstral analysis of hypokinetic and ataxic voices: correlations with perceptual and other acoustic measures.运动减退性和共济失调性嗓音的谐波倒谱分析：与感知及其他声学指标的相关性

J Voice. 2014 Nov;28(6):673-80. doi: 10.1016/j.jvoice.2014.01.013. Epub 2014 May 16.

Effects of Fundamental Frequency, Vocal Intensity, Sample Duration, and Vowel Context in Cepstral and Spectral Measures of Dysphonic Voices.基频、发声强度、样本时长及元音语境对嗓音障碍语音的倒谱和频谱测量的影响

J Speech Lang Hear Res. 2020 May 22;63(5):1326-1339. doi: 10.1044/2020_JSLHR-19-00049. Epub 2020 Apr 29.

Evaluation of Acoustic Analyses of Voice in Nonoptimized Conditions.非优化条件下嗓音声学分析评估。

J Speech Lang Hear Res. 2020 Dec 14;63(12):3991-3999. doi: 10.1044/2020_JSLHR-20-00212. Epub 2020 Nov 13.

Acoustic Perturbation Measures Improve with Increasing Vocal Intensity in Individuals With and Without Voice Disorders.无论是否患有嗓音障碍，随着嗓音强度增加，声学微扰指标均有所改善。

J Voice. 2018 Mar;32(2):162-168. doi: 10.1016/j.jvoice.2017.04.008. Epub 2017 May 18.

引用本文的文献

An acoustic model of speech dysprosody in patients with Parkinson's disease.帕金森病患者言语韵律障碍的声学模型。

Front Hum Neurosci. 2025 Apr 28;19:1566274. doi: 10.3389/fnhum.2025.1566274. eCollection 2025.

本文引用的文献

Auditory-Perceptual Assessment and Acoustic Analysis of Gender Expression in the Voice.嗓音中性别表达的听觉感知评估与声学分析

J Voice. 2024 Feb 8. doi: 10.1016/j.jvoice.2023.12.024.

Gender Attributions by Cisgender and Gender Diverse Listeners Rating Vowels, Reading, and Monologues.顺性别和性别多元听众对元音、朗读和独白的性别归因。

J Voice. 2023 Nov 14. doi: 10.1016/j.jvoice.2023.09.011.

Reducing the GAP between science and clinic: lessons from academia and professional practice - part A: perceptual-auditory judgment of vocal quality, acoustic vocal signal analysis and voice self-assessment.缩小科学与临床之间的差距：学术和专业实践的经验教训 - 第 A 部分：声音质量的知觉 - 听觉判断、声学嗓音信号分析和嗓音自我评估。

Codas. 2022 Aug 1;34(5):e20210240. doi: 10.1590/2317-1782/20212021240pt. eCollection 2022.

Meta-Analysis on the Validity of the Acoustic Voice Quality Index.基于声学嗓音质量指数的有效性的元分析

J Voice. 2024 Nov;38(6):1527.e1-1527.e19. doi: 10.1016/j.jvoice.2022.04.022. Epub 2022 Jun 23.

Acoustic Voice Quality Index (AVQI) in the Measurement of Voice Quality: A Systematic Review and Meta-Analysis.嗓音声学质量指数（AVQI）在嗓音质量测量中的应用：系统评价和荟萃分析。

J Voice. 2024 Sep;38(5):1055-1069. doi: 10.1016/j.jvoice.2022.03.018. Epub 2022 Apr 20.

Effect of Anchor Term on Auditory-Perceptual Ratings of Feminine and Masculine Speakers.锚定词对女性和男性说话者听觉感知评分的影响。

J Speech Lang Hear Res. 2022 Jun 8;65(6):2064-2080. doi: 10.1044/2022_JSLHR-21-00476. Epub 2022 Apr 22.

Perceived Gender and Client Satisfaction in Transgender Voice Work: Comparing Self and Listener Rating Scales across a Training Program.跨性别者嗓音工作中的感知性别与客户满意度：培训计划中自我评估和听众评估量表的比较。

Folia Phoniatr Logop. 2022;74(5):364-379. doi: 10.1159/000521226. Epub 2021 Nov 30.

The Influence of Linguistic Bias Upon Speech-Language Pathologists' Attitudes Toward Clinical Scenarios Involving Nonstandard Dialects of English.语言偏见对言语语言病理学家对待涉及非标准英语方言临床场景态度的影响。

Am J Speech Lang Pathol. 2021 Sep 23;30(5):1973-1989. doi: 10.1044/2021_AJSLP-20-00382. Epub 2021 Aug 31.

Proposal of the vocal attendance protocol and vocal redesignation program in the services of the transsexualizing process.跨性别者服务中的发声出席协议和发声再指定方案的提案。

Codas. 2021 Apr 12;33(1):e20190188. doi: 10.1590/2317-1782/20202019188. eCollection 2021.

Deep Learning for Voice Gender Identification: Proof-of-concept for Gender-Affirming Voice Care.深度学习在语音性别识别中的应用：用于性别肯定型嗓音护理的概念验证。

Laryngoscope. 2021 May;131(5):E1611-E1615. doi: 10.1002/lary.29281. Epub 2020 Nov 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

声学测量能否预测语音中的性别感知？

Can acoustic measurements predict gender perception in the voice?

机构信息

出版信息

PURPOSE

METHOD

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献