Suppr超能文献

人工耳蜗植入者对自然产生和合成的普通话语音的可懂度。

Intelligibility of naturally produced and synthesized Mandarin speech by cochlear implant listeners.

机构信息

Department of Otolaryngology, Head and Neck Surgery, Beijing TongRen Hospital, Capital Medical University, Beijing 100730, People's Republic of China.

House Ear Institute, Los Angeles, California 90057, USA.

出版信息

J Acoust Soc Am. 2018 May;143(5):2886. doi: 10.1121/1.5037590.

Abstract

Mandarin is a tonal language, and it is important to preserve lexical tone information in synthesized speech. With natural speech, Chinese cochlear implant (CI) users have difficulty perceiving voice pitch cues important for lexical tone perception; it is unclear whether this difficulty persists in Mandarin synthesized speech. In this study, intelligibility of naturally produced and synthesized Mandarin speech was measured in Chinese CI listeners; intelligibility was also measured in a control group of normal-hearing (NH) listeners. Five synthesized voices were selected to represent different talker genders (male, female, child), speaking rates (normal, slow), and speaking styles (emotional, accent). The data showed that while modern Mandarin text-to-speech (TTS) systems can provide perfect speech intelligibility for NH listeners, overall intelligibility was much poorer for CI than for NH listeners. CI performance was significantly poorer with synthesized speech than with natural speech (p < 0.001). CI listeners were highly sensitive to the "extra-atypical" synthesized emotional and accented speech. Performance with each of the synthesized speech types was significantly correlated with performance with natural speech in CI users (p < 0.01 in all cases). While modern TTS systems offer educational and communication benefits to CI users and hearing-impaired individuals, the selection of synthesized voices should be carefully considered in education applications of TTS for hearing-impaired individuals, especially CI children, since poor intelligibility performance may affect language learning.

摘要

普通话是一种声调语言,在合成语音中保留词汇声调信息非常重要。对于自然语音,中国人工耳蜗(CI)使用者很难感知到对词汇声调感知很重要的语音音高线索;在普通话合成语音中,这种困难是否仍然存在还不清楚。在这项研究中,测量了中国 CI 使用者对自然产生和合成的普通话语音的可理解度;还在正常听力(NH)对照组中测量了可理解度。选择了五个合成语音来代表不同的说话者性别(男性、女性、儿童)、说话速度(正常、慢)和说话风格(情绪化、口音)。数据表明,虽然现代普通话文语转换(TTS)系统可以为 NH 听众提供完美的语音可理解度,但 CI 听众的整体可理解度要比 NH 听众差得多。与自然语音相比,CI 听众对合成语音的表现明显更差(p<0.001)。CI 听众对“额外非典型”的合成情绪化和口音化语音非常敏感。与自然语音相比,CI 用户对每种合成语音类型的表现都与自然语音的表现显著相关(p<0.01)。虽然现代 TTS 系统为 CI 用户和听力受损者提供了教育和沟通方面的好处,但在 TTS 为听力受损者的教育应用中,应仔细考虑合成语音的选择,尤其是对于 CI 儿童,因为较差的可理解度表现可能会影响语言学习。

相似文献

3
Auditory performance and speech intelligibility of Mandarin-speaking children implanted before age 5.
Int J Pediatr Otorhinolaryngol. 2014 May;78(5):799-803. doi: 10.1016/j.ijporl.2014.02.014. Epub 2014 Feb 20.
5
Melodic pitch perception and lexical tone perception in Mandarin-speaking cochlear implant users.
Ear Hear. 2015 Jan;36(1):102-10. doi: 10.1097/AUD.0000000000000086.
6
Effects of speaking style on speech intelligibility for Mandarin-speaking cochlear implant users.
J Acoust Soc Am. 2011 Jun;129(6):EL242-7. doi: 10.1121/1.3582148.
7
Speech intelligibility of Mandarin-speaking deaf children with cochlear implants.
Int J Pediatr Otorhinolaryngol. 2005 Apr;69(4):505-11. doi: 10.1016/j.ijporl.2004.10.017. Epub 2005 Jan 5.
8
Early prelingual auditory development and speech perception at 1-year follow-up in Mandarin-speaking children after cochlear implantation.
Int J Pediatr Otorhinolaryngol. 2011 Nov;75(11):1418-26. doi: 10.1016/j.ijporl.2011.08.005. Epub 2011 Sep 3.
9
Time-compression thresholds for Mandarin sentences in normal-hearing and cochlear implant listeners.
Hear Res. 2019 Mar 15;374:58-68. doi: 10.1016/j.heares.2019.01.011. Epub 2019 Jan 31.
10
Masking release with changing fundamental frequency: Electric acoustic stimulation resembles normal hearing subjects.
Hear Res. 2017 Jul;350:226-234. doi: 10.1016/j.heares.2017.05.004. Epub 2017 May 11.

本文引用的文献

1
Validation of list equivalency for Mandarin speech materials to use with cochlear implant listeners.
Int J Audiol. 2017;56(sup2):S31-S40. doi: 10.1080/14992027.2016.1204564. Epub 2016 Jul 14.
3
Voice emotion recognition by cochlear-implanted children and their normally-hearing peers.
Hear Res. 2015 Apr;322:151-62. doi: 10.1016/j.heares.2014.10.003. Epub 2014 Oct 16.
6
Development and validation of the Mandarin speech perception test.
J Acoust Soc Am. 2011 Jun;129(6):EL267-73. doi: 10.1121/1.3590739.
7
Effects of speaking style on speech intelligibility for Mandarin-speaking cochlear implant users.
J Acoust Soc Am. 2011 Jun;129(6):EL242-7. doi: 10.1121/1.3582148.
8
Vocal emotion recognition by normal-hearing listeners and cochlear implant users.
Trends Amplif. 2007 Dec;11(4):301-15. doi: 10.1177/1084713807305301.
9
Auditory frequency discrimination learning is affected by stimulus variability.
Percept Psychophys. 2005 May;67(4):691-8. doi: 10.3758/bf03193525.
10
Clear speech perception in acoustic and electric hearing.
J Acoust Soc Am. 2004 Oct;116(4 Pt 1):2374-83. doi: 10.1121/1.1787528.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验