• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于语音产生中声源神经肌肉控制的深度学习

Deep Learning for Neuromuscular Control of Vocal Source for Voice Production.

作者信息

Palaparthi Anil, Alluri Rishi K, Titze Ingo R

机构信息

Utah Center for Vocology, University of Utah, Salt Lake City, UT 84112, USA.

School of Biological Sciences, University of Utah, Salt Lake City, UT 84112, USA.

出版信息

Appl Sci (Basel). 2024 Jan;14(2). doi: 10.3390/app14020769. Epub 2024 Jan 16.

DOI:10.3390/app14020769
PMID:39071945
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11281313/
Abstract

A computational neuromuscular control system that generates lung pressure and three intrinsic laryngeal muscle activations (cricothyroid, thyroarytenoid, and lateral cricoarytenoid) to control the vocal source was developed. In the current study, , a biophysical computational model of the vocal system was used as the physical plant. In the , a three-mass vocal fold model was used to simulate self-sustained vocal fold oscillation. A constant/ǝ/vowel was used for the vocal tract shape. The trachea was modeled after MRI measurements. The neuromuscular control system generates control parameters to achieve four acoustic targets (fundamental frequency, sound pressure level, normalized spectral centroid, and signal-to-noise ratio) and four somatosensory targets (vocal fold length, and longitudinal fiber stress in the three vocal fold layers). The deep-learning-based control system comprises one acoustic feedforward controller and two feedback (acoustic and somatosensory) controllers. Fifty thousand steady speech signals were generated using the for training the control system. The results demonstrated that the control system was able to generate the lung pressure and the three muscle activations such that the four acoustic and four somatosensory targets were reached with high accuracy. After training, the motor command corrections from the feedback controllers were minimal compared to the feedforward controller except for thyroarytenoid muscle activation.

摘要

开发了一种计算神经肌肉控制系统,该系统可产生肺压和三种喉内肌激活(环甲肌、甲杓肌和环杓侧肌)来控制声源。在当前研究中,使用了一个语音系统的生物物理计算模型作为物理对象。在该模型中,使用了一个三质量声带模型来模拟声带的自持振荡。使用恒定的/ǝ/元音来模拟声道形状。气管是根据MRI测量结果建模的。神经肌肉控制系统生成控制参数,以实现四个声学目标(基频、声压级、归一化频谱质心和信噪比)和四个体感目标(声带长度以及三层声带中的纵向纤维应力)。基于深度学习的控制系统包括一个声学前馈控制器和两个反馈(声学和体感)控制器。使用该模型生成了50000个稳定的语音信号来训练控制系统。结果表明,该控制系统能够产生肺压和三种肌肉激活,从而高精度地实现四个声学目标和四个体感目标。训练后,除了甲杓肌激活外,与前馈控制器相比,反馈控制器的运动指令校正最小。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/37c72f031879/nihms-1959452-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/bc34d5a3b229/nihms-1959452-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/b2dcc46aefd8/nihms-1959452-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/c9cf98f829a4/nihms-1959452-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/1d7ca2ab3f9f/nihms-1959452-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/4fa4e6d5e7c2/nihms-1959452-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/3bdd8d7a3e0a/nihms-1959452-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/7877f892b997/nihms-1959452-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/b62a734910e2/nihms-1959452-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/e0a90b6ffb69/nihms-1959452-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/2289c2f5d556/nihms-1959452-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/b188b63ec840/nihms-1959452-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/37c72f031879/nihms-1959452-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/bc34d5a3b229/nihms-1959452-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/b2dcc46aefd8/nihms-1959452-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/c9cf98f829a4/nihms-1959452-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/1d7ca2ab3f9f/nihms-1959452-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/4fa4e6d5e7c2/nihms-1959452-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/3bdd8d7a3e0a/nihms-1959452-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/7877f892b997/nihms-1959452-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/b62a734910e2/nihms-1959452-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/e0a90b6ffb69/nihms-1959452-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/2289c2f5d556/nihms-1959452-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/b188b63ec840/nihms-1959452-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6199/11281313/37c72f031879/nihms-1959452-f0012.jpg

相似文献

1
Deep Learning for Neuromuscular Control of Vocal Source for Voice Production.用于语音产生中声源神经肌肉控制的深度学习
Appl Sci (Basel). 2024 Jan;14(2). doi: 10.3390/app14020769. Epub 2024 Jan 16.
2
Mapping Thyroarytenoid and Cricothyroid Activations to Postural and Acoustic Features in a Fiber-Gel Model of the Vocal Folds.在声带纤维凝胶模型中,将甲杓肌和环甲肌激活与姿势及声学特征进行映射。
Appl Sci (Basel). 2019 Nov;9(21). doi: 10.3390/app9214671. Epub 2019 Nov 1.
3
Interactions of subglottal pressure and neuromuscular activation on fundamental frequency and intensity.声门下压力与神经肌肉激活对基频和强度的相互作用。
Laryngoscope. 2016 May;126(5):1123-30. doi: 10.1002/lary.25550. Epub 2016 Mar 12.
4
Effects of cricothyroid and thyroarytenoid interaction on voice control: Muscle activity, vocal fold biomechanics, flow, and acoustics.环甲肌和甲杓肌相互作用对嗓音控制的影响:肌肉活动、声带生物力学、流量和声学。
J Acoust Soc Am. 2021 Jul;150(1):29. doi: 10.1121/10.0005275.
5
Simulated effects of cricothyroid and thyroarytenoid muscle activation on adult-male vocal fold vibration.环甲肌和甲杓肌激活对成年男性声带振动的模拟效果。
J Acoust Soc Am. 2006 Jul;120(1):386-97. doi: 10.1121/1.2204442.
6
Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model.使用神经网络框架和语音产生模型从颈部表面振动估计声门下压力、声带碰撞压力和喉内肌激活。
Front Physiol. 2021 Sep 1;12:732244. doi: 10.3389/fphys.2021.732244. eCollection 2021.
7
Impact of Instructed Laryngeal Manipulation on Acoustic Measures of Voice-Preliminary Results.指导性喉部手法对嗓音声学指标的影响——初步结果
J Voice. 2023 Jan;37(1):143.e1-143.e11. doi: 10.1016/j.jvoice.2020.11.004. Epub 2020 Dec 4.
8
Effects of Vocal Training on Thyroarytenoid Muscle Neuromuscular Junctions and Myofibers in Young and Older Rats.嗓音训练对青年和老年大鼠甲状腺动肌神经肌肉接头和肌纤维的影响。
J Gerontol A Biol Sci Med Sci. 2021 Jan 18;76(2):244-252. doi: 10.1093/gerona/glaa173.
9
Three-dimensional posture changes of the vocal fold from paired intrinsic laryngeal muscles.来自成对喉内肌的声带三维姿势变化。
Laryngoscope. 2017 Mar;127(3):656-664. doi: 10.1002/lary.26145. Epub 2016 Jul 5.
10
An Acoustic-Signal-Based Preventive Program for University Lecturers' Vocal Health.基于声信号的高校教师嗓音保健预防方案
J Voice. 2020 Jan;34(1):88-99. doi: 10.1016/j.jvoice.2018.05.011. Epub 2018 Jul 31.

引用本文的文献

1
Bayesian inference of state feedback control parameters for fo perturbation responses in cerebellar ataxia.贝叶斯推断小脑共济失调中 fo 扰动响应的状态反馈控制参数。
PLoS Comput Biol. 2024 Oct 11;20(10):e1011986. doi: 10.1371/journal.pcbi.1011986. eCollection 2024 Oct.

本文引用的文献

1
LaDIVA: A neurocomputational model providing laryngeal motor control for speech acquisition and production.LaDIVA:一个提供喉运动控制的神经计算模型,用于语音获取和产生。
PLoS Comput Biol. 2022 Jun 23;18(6):e1010159. doi: 10.1371/journal.pcbi.1010159. eCollection 2022 Jun.
2
Mapping Thyroarytenoid and Cricothyroid Activations to Postural and Acoustic Features in a Fiber-Gel Model of the Vocal Folds.在声带纤维凝胶模型中,将甲杓肌和环甲肌激活与姿势及声学特征进行映射。
Appl Sci (Basel). 2019 Nov;9(21). doi: 10.3390/app9214671. Epub 2019 Nov 1.
3
Objective Assessment of Pathological Voice Using Artificial Intelligence Based on the GRBAS Scale.
基于 GRBAS 量表的人工智能在病理性嗓音评估中的客观评估。
J Voice. 2024 May;38(3):561-566. doi: 10.1016/j.jvoice.2021.11.021. Epub 2021 Dec 30.
4
A Deep Neural Network Based Glottal Flow Model for Predicting Fluid-Structure Interactions during Voice Production.一种基于深度神经网络的声门气流模型,用于预测语音产生过程中的流固相互作用。
Appl Sci (Basel). 2020 Jan 2;10(2). doi: 10.3390/app10020705. Epub 2020 Jan 19.
5
Voice Feature Selection to Improve Performance of Machine Learning Models for Voice Production Inversion.语音特征选择以提高语音产生反转机器学习模型的性能。
J Voice. 2023 Jul;37(4):479-485. doi: 10.1016/j.jvoice.2021.03.004. Epub 2021 Apr 11.
6
Analysis of Glottal Inverse Filtering in the Presence of Source-Filter Interaction.源-滤波器相互作用下的声门逆滤波分析
Speech Commun. 2020 Oct;123:98-108. doi: 10.1016/j.specom.2020.07.003. Epub 2020 Jul 24.
7
Contributions of Auditory and Somatosensory Feedback to Vocal Motor Control.听觉和体感反馈对发声运动控制的作用
J Speech Lang Hear Res. 2020 Jul 20;63(7):2039-2053. doi: 10.1044/2020_JSLHR-19-00296. Epub 2020 Jun 30.
8
Cognitive plausibility in voice-based AI health counselors.基于语音的人工智能健康顾问中的认知合理性。
NPJ Digit Med. 2020 May 15;3:72. doi: 10.1038/s41746-020-0278-7. eCollection 2020.
9
Changes of orofacial somatosensory attenuation during speech production.言语产生过程中口面部躯体感觉衰减的变化
Neurosci Lett. 2020 Jun 21;730:135045. doi: 10.1016/j.neulet.2020.135045. Epub 2020 May 13.
10
Estimation of vocal fold physiology from voice acoustics using machine learning.利用机器学习从语音声学估计声带生理机能。
J Acoust Soc Am. 2020 Mar;147(3):EL264. doi: 10.1121/10.0000927.