• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Perceptual Error Analysis of Human and Synthesized Voices.

作者信息

Englert Marina, Madazio Glaucya, Gielow Ingrid, Lucero Jorge, Behlau Mara

机构信息

Universidade Federal de São Paulo, São Paulo, Brazil; Centro de Estudos da Voz-CEV, São Paulo, Brazil.

Centro de Estudos da Voz-CEV, São Paulo, Brazil.

出版信息

J Voice. 2017 Jul;31(4):516.e5-516.e18. doi: 10.1016/j.jvoice.2016.12.015. Epub 2017 Jan 12.

DOI:10.1016/j.jvoice.2016.12.015
PMID:28089485
Abstract

OBJECTIVE/HYPOTHESIS: To assess the quality of synthesized voices through listeners' skills in discriminating human and synthesized voices.

STUDY DESIGN

Prospective study.

METHODS

Eighteen human voices with different types and degrees of deviation (roughness, breathiness, and strain, with three degrees of deviation: mild, moderate, and severe) were selected by three voice specialists. Synthesized samples with the same deviations of human voices were produced by the VoiceSim system. The manipulated parameters were vocal frequency perturbation (roughness), additive noise (breathiness), increasing tension, subglottal pressure, and decreasing vocal folds separation (strain). Two hundred sixty-nine listeners were divided in three groups: voice specialist speech language pathologists (V-SLPs), general clinician SLPs (G-SLPs), and naive listeners (NLs). The SLP listeners also indicated the type and degree of deviation.

RESULTS

The listeners misclassified 39.3% of the voices, both synthesized (42.3%) and human (36.4%) samples (P = 0.001). V-SLPs presented the lowest error percentage considering the voice nature (34.6%); G-SLPs and NLs identified almost half of the synthesized samples as human (46.9%, 45.6%). The male voices were more susceptible for misidentification. The synthesized breathy samples generated a greater perceptual confusion. The samples with severe deviation seemed to be more susceptible for errors. The synthesized female deviations were correctly classified. The male breathiness and strain were identified as roughness.

CONCLUSION

VoiceSim produced stimuli very similar to the voices of patients with dysphonia. V-SLPs had a better ability to classify human and synthesized voices. VoiceSim is better to simulate vocal breathiness and female deviations; the male samples need adjustment.

摘要

相似文献

1
Perceptual Error Analysis of Human and Synthesized Voices.
J Voice. 2017 Jul;31(4):516.e5-516.e18. doi: 10.1016/j.jvoice.2016.12.015. Epub 2017 Jan 12.
2
Perceptual Error Identification of Human and Synthesized Voices.人类声音与合成声音的感知错误识别
J Voice. 2016 Sep;30(5):639.e17-23. doi: 10.1016/j.jvoice.2015.07.017. Epub 2015 Aug 31.
3
Auditory Perception of Roughness and Breathiness by Dysphonic Women.嗓音粗糙和气息声感知的研究:嗓音障碍女性的感知。
J Voice. 2024 Sep;38(5):1249.e1-1249.e18. doi: 10.1016/j.jvoice.2022.01.005. Epub 2022 Jan 23.
4
The Influence of Native Language on Auditory-Perceptual Evaluation of Vocal Samples Completed by Brazilian and Canadian SLPs.母语对巴西和加拿大语言病理学家完成的语音样本听觉感知评估的影响。
J Voice. 2017 Mar;31(2):258.e1-258.e5. doi: 10.1016/j.jvoice.2016.05.021. Epub 2016 Jul 11.
5
Effect of Auditory-Perceptual Training With Natural Voice Anchors on Vocal Quality Evaluation.基于自然语音基准的听觉感知训练对嗓音质量评估的影响
J Voice. 2019 Mar;33(2):220-225. doi: 10.1016/j.jvoice.2017.10.020. Epub 2018 Jan 10.
6
Performance of the phonatory deviation diagram in the evaluation of rough and breathy synthesized voices.发声偏差图在粗糙和呼吸声合成语音评估中的性能
Braz J Otorhinolaryngol. 2018 Jul-Aug;84(4):460-472. doi: 10.1016/j.bjorl.2017.05.012. Epub 2017 Jul 5.
7
The role of listener experience on Consensus Auditory-perceptual Evaluation of Voice (CAPE-V) ratings of postthyroidectomy voice.听众体验在甲状腺切除术后嗓音的共识性听觉感知评估(CAPE-V)评分中的作用。
Am J Speech Lang Pathol. 2010 Aug;19(3):248-58. doi: 10.1044/1058-0360(2010/09-0012). Epub 2010 May 19.
8
Predictive Factors of Listeners' Attitudes Related to Dysphonic Voices in Native Brazilian Portuguese.巴西葡萄牙语母语者对嗓音障碍语音听众态度的预测因素
J Voice. 2025 May;39(3):849.e9-849.e25. doi: 10.1016/j.jvoice.2022.11.028. Epub 2022 Dec 13.
9
Severity of voice disorders: integration of perceptual and acoustic data in dysphonic patients.嗓音障碍的严重程度:发音障碍患者感知和声学数据的整合
Codas. 2014 Sep-Oct;26(5):382-8. doi: 10.1590/2317-1782/20142013033.
10
Auditory-perceptual Assessment of Healthy and Disordered Voices Using the Voice Deviation Scale.嗓音障碍患者的听觉感知评估——嗓音障碍严重度量表的应用
J Voice. 2024 May;38(3):654-659. doi: 10.1016/j.jvoice.2021.10.017. Epub 2021 Dec 11.