• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用语音合成技术测量嗓音质量。

Measuring vocal quality with speech synthesis.

作者信息

Gerratt B R, Kreiman J

机构信息

Division of Head and Neck Surgery, UCLA School of Medicine, Los Angeles, California 90095-1794, USA.

出版信息

J Acoust Soc Am. 2001 Nov;110(5 Pt 1):2560-6. doi: 10.1121/1.1409969.

DOI:10.1121/1.1409969
PMID:11757945
Abstract

Much previous research has demonstrated that listeners do not agree well when using traditional rating scales to measure pathological voice quality. Although these findings may indicate that listeners are inherently unable to agree in their perception of such complex auditory stimuli, another explanation implicates the particular measurement method-rating scale judgments-as the culprit. An alternative method of assessing quality-listener-mediated analysis-synthesis-was devised to assess this possibility. In this new approach, listeners explicitly compare synthetic and natural voice samples, and adjust speech synthesizer parameters to create auditory matches to voice stimuli. This method is designed to replace unstable internal standards for qualities like breathiness and roughness with externally presented stimuli, to overcome major hypothetical sources of disagreement in rating scale judgments. In a preliminary test of the reliability of this method, listeners were asked to adjust the signal-to-noise ratio for 12 synthetic pathological voices so that the resulting stimuli matched the natural target voices as well as possible For comparison to the synthesis judgments, listeners also judged the noisiness of the natural stimuli in a separate task using a traditional visual-analog rating scale. For 9 of the 12 voices, agreement among listeners was significantly (and substantially) greater for the synthesis task than for the rating scale task. Response variances for the two tasks did not differ for the remaining three voices. However, a second experiment showed that the synthesis settings that listeners selected for these three voices were within a difference limen, and therefore observed differences were perceptually insignificant. These results indicate that listeners can in fact agree in their perceptual assessments of voice quality, and that analysis-synthesis can measure perception reliably.

摘要

此前的许多研究表明,在使用传统评分量表来衡量病理性嗓音质量时,听众之间的意见不太一致。尽管这些发现可能表明听众在感知此类复杂听觉刺激方面天生就无法达成一致,但另一种解释将问题归咎于特定的测量方法——评分量表判断。为了评估这种可能性,设计了一种评估嗓音质量的替代方法——听众介导的分析合成法。在这种新方法中,听众明确比较合成语音样本和自然语音样本,并调整语音合成器参数以创建与语音刺激的听觉匹配。这种方法旨在用外部呈现的刺激取代诸如呼吸声和粗糙声等质量的不稳定内部标准,以克服评分量表判断中主要的假设性分歧来源。在对该方法可靠性的初步测试中,要求听众调整12个合成病理性嗓音的信噪比,以使生成的刺激尽可能与自然目标嗓音匹配。为了与合成判断进行比较,听众还在一项单独任务中使用传统的视觉模拟评分量表对自然刺激的嘈杂程度进行了判断。对于12个嗓音中的9个,听众在合成任务中的一致性明显(且显著)高于评分量表任务。对于其余三个嗓音,两项任务的反应方差没有差异。然而,第二项实验表明,听众为这三个嗓音选择的合成设置在辨别阈限之内,因此观察到的差异在感知上并不显著。这些结果表明,听众实际上能够在对嗓音质量的感知评估上达成一致,并且分析合成法能够可靠地测量感知。

相似文献

1
Measuring vocal quality with speech synthesis.利用语音合成技术测量嗓音质量。
J Acoust Soc Am. 2001 Nov;110(5 Pt 1):2560-6. doi: 10.1121/1.1409969.
2
Sources of listener disagreement in voice quality assessment.语音质量评估中听众意见不一致的来源。
J Acoust Soc Am. 2000 Oct;108(4):1867-76. doi: 10.1121/1.1289362.
3
The Perception of Breathiness in the Voices of Pediatric Speakers.小儿说话者声音中呼吸音的感知。
J Voice. 2019 Mar;33(2):204-213. doi: 10.1016/j.jvoice.2017.09.024. Epub 2017 Nov 20.
4
Perception of aperiodicity in pathological voice.病理性嗓音中非周期性的感知
J Acoust Soc Am. 2005 Apr;117(4 Pt 1):2201-11. doi: 10.1121/1.1858351.
5
When and why listeners disagree in voice quality assessment tasks.听众在嗓音质量评估任务中出现分歧的时间及原因。
J Acoust Soc Am. 2007 Oct;122(4):2354-64. doi: 10.1121/1.2770547.
6
Testing the reliability of Grade, Roughness and Breathiness scores by means of synthetic speech stimuli.通过合成语音刺激来测试等级、粗糙度和呼吸声评分的可靠性。
Logoped Phoniatr Vocol. 2015 Apr;40(1):5-13. doi: 10.3109/14015439.2013.837502. Epub 2013 Oct 11.
7
Comparing internal and external standards in voice quality judgments.比较语音质量判断中的内部标准和外部标准。
J Speech Hear Res. 1993 Feb;36(1):14-20. doi: 10.1044/jshr.3601.14.
8
The multidimensional nature of pathologic vocal quality.病理性嗓音质量的多维度特性。
J Acoust Soc Am. 1994 Sep;96(3):1291-302. doi: 10.1121/1.410277.
9
The effect of anchors and training on the reliability of perceptual voice evaluation.锚定和训练对感知语音评估可靠性的影响。
J Speech Lang Hear Res. 2002 Feb;45(1):111-26. doi: 10.1044/1092-4388(2002/009).
10
Perception of synthesized voice quality in connected speech by Cantonese speakers.粤语使用者对连贯语音中合成语音质量的感知。
J Acoust Soc Am. 2002 Sep;112(3 Pt 1):1091-101. doi: 10.1121/1.1500753.

引用本文的文献

1
The influence of listener experience, measurement scale and speech task on the reliability of auditory-perceptual evaluation of vocal quality.聆听者经验、测量尺度和言语任务对嗓音质量听觉感知评估可靠性的影响。
Codas. 2024 Apr 15;36(3):e20230175. doi: 10.1590/2317-1782/20232023175. eCollection 2024.
2
Improving Perceptual Speech Ratings: The Effects of Auditory Training on Judgments of Dysarthric Speech.提高感知性言语评估:听觉训练对构音障碍言语判断的影响。
J Speech Lang Hear Res. 2023 Nov 9;66(11):4236-4258. doi: 10.1044/2023_JSLHR-23-00322. Epub 2023 Sep 29.
3
Effects of Vibratory Source on Auditory-Perceptual and Bio-Inspired Computational Measures of Pediatric Voice Quality.
振动源对儿童嗓音质量的听觉感知及生物启发式计算指标的影响
J Voice. 2023 Sep 20. doi: 10.1016/j.jvoice.2023.08.016.
4
Comparative Analysis of Two Methods of Perceptual Voice Assessment.两种感知语音评估方法的比较分析
J Voice. 2023 Mar 10. doi: 10.1016/j.jvoice.2023.01.005.
5
Psychometric properties associated with perceived vocal roughness using a matching task.使用匹配任务评估感知声音粗糙的心理测量学特性。
J Acoust Soc Am. 2013 Oct;134(4):EL294-300. doi: 10.1121/1.4819183.
6
Identifying a comparison for matching rough voice quality.识别匹配粗糙音质的比较对象。
J Speech Lang Hear Res. 2012 Oct;55(5):1407-22. doi: 10.1044/1092-4388(2012/11-0160). Epub 2012 Feb 21.
7
Listener effort for highly intelligible tracheoesophageal speech.高清晰度食管气管语音的听者努力程度。
J Commun Disord. 2012 May-Jun;45(3):235-45. doi: 10.1016/j.jcomdis.2012.01.001. Epub 2012 Jan 20.
8
A model for the prediction of breathiness in vowels.用于预测元音浊音度的模型。
J Acoust Soc Am. 2011 Mar;129(3):1605-15. doi: 10.1121/1.3543993.
9
Integrated software for analysis and synthesis of voice quality.语音质量分析与合成的集成软件。
Behav Res Methods. 2010 Nov;42(4):1030-41. doi: 10.3758/BRM.42.4.1030.
10
Perceptual distances of breathy voice quality: a comparison of psychophysical methods.可感知的气息声嗓音质量差异:心理物理方法的比较。
J Voice. 2010 Mar;24(2):168-77. doi: 10.1016/j.jvoice.2008.08.002. Epub 2009 Jan 29.