National Institute of Education, Nanyang Technological University, Singapore.
Department of Communication Sciences and Disorders, University of Minnesota Duluth, Duluth, MN, USA.
Augment Altern Commun. 2023 Dec;39(4):208-218. doi: 10.1080/07434618.2023.2181213. Epub 2023 Mar 27.
Voice banking involves recording an inventory of sentences produced via natural speech. The recordings are used to create a synthetic text-to-speech voice that can be installed on speech-generating devices. This study highlights a minimally researched, clinically relevant issue surrounding the development and evaluation of Singaporean-accented English synthetic voices that were created using readily available voice banking software and hardware. Processes used to create seven unique synthetic voices that produce Singaporean-accented English, and the development of a custom Singaporean Colloquial English (SCE) recording inventory, are reviewed. The perspectives of adults who spoke SCE and banked their voices for this project are summarized and were generally positive. Finally, 100 adults familiar with SCE participated in an experiment that evaluated the intelligibility and naturalness of the Singaporean-accented synthetic voices, as well as the effect of the SCE custom inventory on listener preferences. The addition of the custom SCE inventory did not affect intelligibility or naturalness of the synthetic speech, and listeners tended to prefer the voice created with the SCE inventory when the stimulus was an SCE passage. The procedures used in this project may be helpful for interventionists who wish to create synthetic voices with accents that are not commercially available.
声库是指录制通过自然语音生成的句子的库存。这些录音被用来创建一个可以安装在语音生成设备上的合成语音。本研究突出了一个在开发和评估使用现成的声库软件和硬件创建的新加坡口音英语合成语音时,鲜为人知但具有临床意义的问题。本研究回顾了创建七个独特的产生新加坡口音英语的合成语音的过程,以及定制新加坡口语英语(SCE)录音库的开发过程。总结了参与该项目并将其声音存入声库的讲 SCE 的成年人的观点,总体上是积极的。最后,100 名熟悉 SCE 的成年人参与了一项实验,评估了新加坡口音的合成语音的可理解性和自然度,以及 SCE 定制库对听众偏好的影响。添加定制的 SCE 库存不会影响合成语音的可理解性或自然度,并且当刺激是 SCE 段落时,听众倾向于更喜欢使用 SCE 库存创建的语音。本项目中使用的程序可能对希望创建具有商业上不可用口音的合成语音的干预者有帮助。