• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类声音与合成声音的感知错误识别

Perceptual Error Identification of Human and Synthesized Voices.

作者信息

Englert Marina, Madazio Glaucya, Gielow Ingrid, Lucero Jorge, Behlau Mara

机构信息

Department of Speech Language Pathology and Audiology, Universidade Federal de São Paulo, São Paulo, Brazil; Voice Department, Centro de Estudos da Voz-CEV, São Paulo, Brazil.

Voice Department, Centro de Estudos da Voz-CEV, São Paulo, Brazil.

出版信息

J Voice. 2016 Sep;30(5):639.e17-23. doi: 10.1016/j.jvoice.2015.07.017. Epub 2015 Aug 31.

DOI:10.1016/j.jvoice.2015.07.017
PMID:26337775
Abstract

OBJECTIVES/HYPOTHESIS: To verify the discriminatory ability of human and synthesized voice samples.

STUDY DESIGN

This is a prospective study.

METHODS

A total of 70 subjects, 20 voice specialist speech-language pathologists (V-SLPs), 20 general SLPs (G-SLPs), and 30 naive listeners (NLs) participated of a listening task that was simply to classify the stimuli as human or synthesized. Samples of 36 voices, 18 human and 18 synthesized vowels, male and female (9 each), with different type and degree of deviation, were presented with 50% of repetition to verify intrarater consistency. Human voices were collected from a vocal clinic database. Voice disorders were simulated by perturbations of vocal frequency, jitter (roughness), additive noise (breathiness) and by increasing tension and decreasing separation of the vocal folds (strain).

RESULTS

The average amount of error considering all groups was 37.8%, 31.9% for V-SLP, 39.3% for G-SLP, and 40.8% for NL. V-SLP had smaller mean percentage error for synthesized (24.7%), breathy (36.7%), synthesized breathy (30.8%), and tense (25%) and female (27.5%) voices. G-SLP and NL presented equal mean percentage error for all voices classification. All groups together presented no difference on the mean percentage error between human and synthesized voices (P value = 0.452).

CONCLUSIONS

The quality of synthesized samples was very high. V-SLP presented a lower amount of error, which allows us to infer that auditory training assists on vocal analysis tasks.

摘要

目的/假设:验证人类语音样本和合成语音样本的辨别能力。

研究设计

这是一项前瞻性研究。

方法

共有70名受试者参与了一项听力任务,其中包括20名嗓音专科言语语言病理学家(V-SLP)、20名普通言语语言病理学家(G-SLP)和30名普通听众(NL),任务仅仅是将刺激声音分类为人类语音或合成语音。呈现了36个语音样本,其中18个是人类语音,18个是合成元音,包括男性和女性(各9个),具有不同类型和程度的偏差,以50%的重复率呈现以验证评分者内部一致性。人类语音从一个嗓音诊所数据库中收集。通过改变嗓音频率、抖动(粗糙度)、加性噪声(呼吸声)以及增加声带张力和减小声带间距(紧张度)来模拟嗓音障碍。

结果

考虑所有组的平均错误率为37.8%,V-SLP为31.9%,G-SLP为39.3%,NL为40.8%。V-SLP在合成语音(24.7%)、呼吸声语音(36.7%)、合成呼吸声语音(30.8%)、紧张语音(25%)和女性语音(27.5%)方面的平均百分比错误较小。G-SLP和NL在所有语音分类中的平均百分比错误相同。所有组在人类语音和合成语音之间的平均百分比错误上没有差异(P值 = 0.452)。

结论

合成样本的质量非常高。V-SLP的错误率较低,这使我们能够推断听觉训练有助于嗓音分析任务。

相似文献

1
Perceptual Error Identification of Human and Synthesized Voices.人类声音与合成声音的感知错误识别
J Voice. 2016 Sep;30(5):639.e17-23. doi: 10.1016/j.jvoice.2015.07.017. Epub 2015 Aug 31.
2
Perceptual Error Analysis of Human and Synthesized Voices.
J Voice. 2017 Jul;31(4):516.e5-516.e18. doi: 10.1016/j.jvoice.2016.12.015. Epub 2017 Jan 12.
3
Auditory Perception of Roughness and Breathiness by Dysphonic Women.嗓音粗糙和气息声感知的研究:嗓音障碍女性的感知。
J Voice. 2024 Sep;38(5):1249.e1-1249.e18. doi: 10.1016/j.jvoice.2022.01.005. Epub 2022 Jan 23.
4
Auditory-perceptual Evaluation of Normal and Dysphonic Voices Using the Voice Deviation Scale.使用嗓音偏差量表对正常嗓音和发声障碍嗓音进行听觉-感知评估。
J Voice. 2017 Jan;31(1):67-71. doi: 10.1016/j.jvoice.2016.01.004. Epub 2016 Feb 9.
5
Validation of the Acoustic Voice Quality Index in the Lithuanian Language.立陶宛语声学语音质量指数的验证。
J Voice. 2017 Mar;31(2):257.e1-257.e11. doi: 10.1016/j.jvoice.2016.06.002. Epub 2016 Jul 15.
6
Effect of Auditory-Perceptual Training With Natural Voice Anchors on Vocal Quality Evaluation.基于自然语音基准的听觉感知训练对嗓音质量评估的影响
J Voice. 2019 Mar;33(2):220-225. doi: 10.1016/j.jvoice.2017.10.020. Epub 2018 Jan 10.
7
The Influence of Native Language on Auditory-Perceptual Evaluation of Vocal Samples Completed by Brazilian and Canadian SLPs.母语对巴西和加拿大语言病理学家完成的语音样本听觉感知评估的影响。
J Voice. 2017 Mar;31(2):258.e1-258.e5. doi: 10.1016/j.jvoice.2016.05.021. Epub 2016 Jul 11.
8
Perceptual and Quantitative Assessment of Dysphonia Across Vowel Categories.在不同元音类别下的嗓音障碍的感知和定量评估。
J Voice. 2019 Jul;33(4):473-481. doi: 10.1016/j.jvoice.2017.12.018. Epub 2018 May 24.
9
Effect of Performance Time of the Semi-Occluded Vocal Tract Exercises in Dysphonic Children.半闭塞声道练习执行时间对嗓音障碍儿童的影响。
J Voice. 2017 May;31(3):329-335. doi: 10.1016/j.jvoice.2016.05.011. Epub 2016 Sep 19.
10
Severity of voice disorders: integration of perceptual and acoustic data in dysphonic patients.嗓音障碍的严重程度:发音障碍患者感知和声学数据的整合
Codas. 2014 Sep-Oct;26(5):382-8. doi: 10.1590/2317-1782/20142013033.

引用本文的文献

1
Performance of the phonatory deviation diagram in the evaluation of rough and breathy synthesized voices.发声偏差图在粗糙和呼吸声合成语音评估中的性能
Braz J Otorhinolaryngol. 2018 Jul-Aug;84(4):460-472. doi: 10.1016/j.bjorl.2017.05.012. Epub 2017 Jul 5.