一种基于听觉反馈的语音产生神经网络模型，该模型对发音系统大小和形状的发育变化具有鲁棒性。

An auditory-feedback-based neural network model of speech production that is robust to developmental changes in the size and shape of the articulatory system.

作者信息

Callan D E, Kent R D, Guenther F H, Vorperian H K

机构信息

ATR Human Information Processing Research Laboratories, Kyoto, Japan.

出版信息

J Speech Lang Hear Res. 2000 Jun;43(3):721-36. doi: 10.1044/jslhr.4303.721.

DOI:10.1044/jslhr.4303.721

PMID:10877441

Abstract

The purpose of this article is to demonstrate that self-produced auditory feedback is sufficient to train a mapping between auditory target space and articulator space under conditions in which the structures of speech production are undergoing considerable developmental restructuring. One challenge for competing theories that propose invariant constriction targets is that it is unclear what teaching signal could specify constriction location and degree so that a mapping between constriction target space and articulator space can be learned. It is predicted that a model trained by auditory feedback will accomplish speech goals, in auditory target space, by continuously learning to use different articulator configurations to adapt to the changing acoustic properties of the vocal tract during development. The Maeda articulatory synthesis part of the DIVA neural network model (Guenther et al., 1998) was modified to reflect the development of the vocal tract by using measurements taken from MR images of children. After training, the model was able to maintain the 11 English vowel targets in auditory planning space, utilizing varying articulator configurations, despite morphological changes that occur during development. The vocal-tract constriction pattern (derived from the vocal-tract area function) as well as the formant values varied during the course of development in correspondence with morphological changes in the structures involved with speech production. Despite changes in the acoustical properties of the vocal tract that occur during the course of development, the model was able to demonstrate motor-equivalent speech production under lip-restriction conditions. The model accomplished this in a self-organizing manner even though there was no prior experience with lip restriction during training.

摘要

本文的目的是证明，在言语产生结构正在经历相当大的发育重构的情况下，自我产生的听觉反馈足以训练听觉目标空间与发音器官空间之间的映射。对于提出不变收缩目标的竞争理论而言，一个挑战在于不清楚何种教学信号能够指定收缩位置和程度，以便能够学习收缩目标空间与发音器官空间之间的映射。据预测，通过听觉反馈训练的模型将在听觉目标空间中通过持续学习使用不同的发音器官配置来适应发育过程中声道不断变化的声学特性，从而实现言语目标。DIVA神经网络模型（Guenther等人，1998）的前田发音合成部分通过使用从儿童的磁共振图像测量得到的数据进行了修改，以反映声道的发育情况。训练后，尽管在发育过程中发生了形态变化，但该模型能够利用不同的发音器官配置在听觉规划空间中维持11个英语元音目标。声道收缩模式（源自声道面积函数）以及共振峰值在发育过程中随着与言语产生相关结构的形态变化而变化。尽管在发育过程中声道的声学特性发生了变化，但该模型能够在唇部受限条件下展示出运动等效的言语产生。即使在训练期间没有唇部受限的先验经验，该模型也以自组织的方式实现了这一点。

相似文献

An auditory-feedback-based neural network model of speech production that is robust to developmental changes in the size and shape of the articulatory system.一种基于听觉反馈的语音产生神经网络模型，该模型对发音系统大小和形状的发育变化具有鲁棒性。

J Speech Lang Hear Res. 2000 Jun;43(3):721-36. doi: 10.1044/jslhr.4303.721.

Acoustic measurements of articulator motions.发音器官运动的声学测量。

Phonetica. 1979;36(4-5):302-13. doi: 10.1159/000259968.

High-Resolution, Non-Invasive Imaging of Upper Vocal Tract Articulators Compatible with Human Brain Recordings.与人类脑记录兼容的上声道发音器官的高分辨率无创成像。

PLoS One. 2016 Mar 28;11(3):e0151327. doi: 10.1371/journal.pone.0151327. eCollection 2016.

Role of vocal tract morphology in speech development: perceptual targets and sensorimotor maps for synthesized French vowels from birth to adulthood.声道形态在语音发展中的作用：从出生到成年合成法语元音的感知目标和感觉运动图谱。

J Speech Lang Hear Res. 2004 Oct;47(5):1059-80. doi: 10.1044/1092-4388(2004/079).

The acoustical significance of tongue, lip, and larynx maneuvers in rounded palatal vowels.舌、唇和喉部动作在圆唇后元音中的声学意义。

J Acoust Soc Am. 1986 Aug;80(2):391-401. doi: 10.1121/1.394090.

Vocal tract normalization for midsagittal articulatory recovery with analysis-by-synthesis.基于合成分析的矢状面中部发音恢复的声道归一化

J Acoust Soc Am. 1999 Aug;106(2):1090-105. doi: 10.1121/1.427117.

Acquisition of vowel articulation in childhood investigated by acoustic-to-articulatory inversion.通过声学-发音反演研究儿童元音发音的习得。

Infant Behav Dev. 2017 Feb;46:178-193. doi: 10.1016/j.infbeh.2017.01.007. Epub 2017 Feb 20.

The relations between area functions and the acoustic signal.面积函数与声学信号之间的关系。

Phonetica. 1980;37(1-2):55-86. doi: 10.1159/000259983.

Dynamic consequences of differences in male and female vocal tract dimensions.男性和女性声道尺寸差异的动态影响。

J Acoust Soc Am. 2001 May;109(5 Pt 1):2153-64. doi: 10.1121/1.1356020.

Modeling the effect of palate shape on the articulatory-acoustics mapping.建立腭形对发音声学映射影响的模型。

J Acoust Soc Am. 2018 Jul;144(1):EL71. doi: 10.1121/1.5048043.

引用本文的文献

Auditory-motor adaptation and de-adaptation for speech depend more on time in the new environment than on the amount of practice.语音的听觉-运动适应和去适应更多地取决于在新环境中的时间，而非练习量。

Commun Psychol. 2025 Aug 18;3(1):127. doi: 10.1038/s44271-025-00304-8.

Speech sensorimotor relationships in francophone preschoolers and adults: Adaptation to real-time auditory feedback perturbations.法语学龄前儿童和成人的言语运动感觉关系：对实时听觉反馈干扰的适应。

PLoS One. 2024 Aug 22;19(8):e0306246. doi: 10.1371/journal.pone.0306246. eCollection 2024.

How to cut the pie is no piece of cake: Toward a process-oriented approach to assessment and diagnosis of speech sound disorders.如何分蛋糕并非易事：迈向言语语音障碍评估和诊断的过程导向方法。

Int J Lang Commun Disord. 2024 Nov-Dec;59(6):2158-2180. doi: 10.1111/1460-6984.12934. Epub 2023 Jul 22.

Neurocomputational modeling of speech motor development.言语运动发育的神经计算建模。

J Child Lang. 2023 Nov;50(6):1318-1335. doi: 10.1017/S0305000923000260. Epub 2023 Jun 20.

Effects of Gradual and Sudden Introduction of Perturbations on Adaptive Responses to Formant-Shift and Formant-Clamp Perturbations.渐变和突发扰动对频率调制和频率钳制扰动自适应响应的影响。

J Speech Lang Hear Res. 2023 May 9;66(5):1588-1599. doi: 10.1044/2023_JSLHR-21-00435. Epub 2023 Apr 14.

Pediatric Responses to Fundamental and Formant Frequency Altered Auditory Feedback: A Scoping Review.儿童对基频和共振峰频率改变的听觉反馈的反应：一项范围综述。

Front Hum Neurosci. 2022 May 17;16:858863. doi: 10.3389/fnhum.2022.858863. eCollection 2022.

Effect of auditory feedback on speech intelligibility of adults with cochlear implants.听觉反馈对人工耳蜗植入者言语可懂度的影响。

Eur Arch Otorhinolaryngol. 2022 Sep;279(9):4345-4351. doi: 10.1007/s00405-021-07189-3. Epub 2021 Nov 27.

Auditory feedback experience in the development of phonetic production: Evidence from preschoolers with cochlear implants and their normal-hearing peers.语音产生发展中的听觉反馈体验：来自植入人工耳蜗的学龄前儿童及其听力正常的同龄人证据。

J Acoust Soc Am. 2021 Sep;150(3):2256. doi: 10.1121/10.0005884.

Dissociated Development of Speech and Limb Sensorimotor Learning in Stuttering: Speech Auditory-motor Learning is Impaired in Both Children and Adults Who Stutter.口吃中言语与肢体感觉运动学习的分离式发展：口吃儿童和成人的言语听觉运动学习均受损。

Neuroscience. 2020 Dec 15;451:1-21. doi: 10.1016/j.neuroscience.2020.10.014. Epub 2020 Oct 20.

The Relation of Articulatory and Vocal Auditory-Motor Control in Typical Speakers.典型发音者的发音与发声听觉-运动控制的关系。

J Speech Lang Hear Res. 2020 Nov 13;63(11):3628-3642. doi: 10.1044/2020_JSLHR-20-00192. Epub 2020 Oct 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种基于听觉反馈的语音产生神经网络模型，该模型对发音系统大小和形状的发育变化具有鲁棒性。

An auditory-feedback-based neural network model of speech production that is robust to developmental changes in the size and shape of the articulatory system.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献