• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用神经网络框架和语音产生模型从颈部表面振动估计声门下压力、声带碰撞压力和喉内肌激活。

Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model.

作者信息

Ibarra Emiro J, Parra Jesús A, Alzamendi Gabriel A, Cortés Juan P, Espinoza Víctor M, Mehta Daryush D, Hillman Robert E, Zañartu Matías

机构信息

Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile.

School of Electrical Engineering, University of the Andes, Mérida, Venezuela.

出版信息

Front Physiol. 2021 Sep 1;12:732244. doi: 10.3389/fphys.2021.732244. eCollection 2021.

DOI:10.3389/fphys.2021.732244
PMID:34539451
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8440844/
Abstract

The ambulatory assessment of vocal function can be significantly enhanced by having access to physiologically based features that describe underlying pathophysiological mechanisms in individuals with voice disorders. This type of enhancement can improve methods for the prevention, diagnosis, and treatment of behaviorally based voice disorders. Unfortunately, the direct measurement of important vocal features such as subglottal pressure, vocal fold collision pressure, and laryngeal muscle activation is impractical in laboratory and ambulatory settings. In this study, we introduce a method to estimate these features during phonation from a neck-surface vibration signal through a framework that integrates a physiologically relevant model of voice production and machine learning tools. The signal from a neck-surface accelerometer is first processed using subglottal impedance-based inverse filtering to yield an estimate of the unsteady glottal airflow. Seven aerodynamic and acoustic features are extracted from the neck surface accelerometer and an optional microphone signal. A neural network architecture is selected to provide a mapping between the seven input features and subglottal pressure, vocal fold collision pressure, and cricothyroid and thyroarytenoid muscle activation. This non-linear mapping is trained solely with 13,000 Monte Carlo simulations of a voice production model that utilizes a symmetric triangular body-cover model of the vocal folds. The performance of the method was compared against laboratory data from synchronous recordings of oral airflow, intraoral pressure, microphone, and neck-surface vibration in 79 vocally healthy female participants uttering consecutive /pæ/ syllable strings at comfortable, loud, and soft levels. The mean absolute error and root-mean-square error for estimating the mean subglottal pressure were 191 Pa (1.95 cm HO) and 243 Pa (2.48 cm HO), respectively, which are comparable with previous studies but with the key advantage of not requiring subject-specific training and yielding more output measures. The validation of vocal fold collision pressure and laryngeal muscle activation was performed with synthetic values as reference. These initial results provide valuable insight for further vocal fold model refinement and constitute a proof of concept that the proposed machine learning method is a feasible option for providing physiologically relevant measures for laboratory and ambulatory assessment of vocal function.

摘要

通过获取基于生理的特征来描述嗓音障碍个体潜在的病理生理机制,可显著增强嗓音功能的动态评估。这种增强可以改善基于行为的嗓音障碍的预防、诊断和治疗方法。不幸的是,在实验室和动态环境中,直接测量重要的嗓音特征,如声门下压力、声带碰撞压力和喉肌激活是不切实际的。在本研究中,我们介绍了一种方法,通过一个整合了与生理相关的嗓音产生模型和机器学习工具的框架,从颈部表面振动信号中估计发声过程中的这些特征。首先使用基于声门下阻抗的逆滤波对来自颈部表面加速度计的信号进行处理,以产生不稳定声门气流的估计值。从颈部表面加速度计和一个可选的麦克风信号中提取七个空气动力学和声学特征。选择一种神经网络架构,以提供七个输入特征与声门下压力、声带碰撞压力以及环甲肌和甲杓肌激活之间的映射。这种非线性映射仅通过对一个利用声带对称三角体覆盖模型的嗓音产生模型进行13000次蒙特卡罗模拟来训练。将该方法的性能与79名嗓音健康的女性参与者在舒适、大声和轻声水平下连续发出/pæ/音节串时,同步记录的口腔气流、口腔内压力、麦克风和颈部表面振动的实验室数据进行了比较。估计平均声门下压力的平均绝对误差和均方根误差分别为191 Pa(1.95 cm HO)和243 Pa(2.48 cm HO),这与先前的研究相当,但关键优势在于不需要针对个体的训练,并且能产生更多的输出测量值。以合成值作为参考对声带碰撞压力和喉肌激活进行了验证。这些初步结果为进一步完善声带模型提供了有价值的见解,并构成了一个概念验证,即所提出的机器学习方法是为嗓音功能的实验室和动态评估提供与生理相关测量值的可行选择。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/13b1af5d3159/fphys-12-732244-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/85318e5a7950/fphys-12-732244-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/2376890e6181/fphys-12-732244-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/908a70e9b757/fphys-12-732244-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/e78ef23f3800/fphys-12-732244-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/13b1af5d3159/fphys-12-732244-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/85318e5a7950/fphys-12-732244-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/2376890e6181/fphys-12-732244-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/908a70e9b757/fphys-12-732244-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/e78ef23f3800/fphys-12-732244-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b947/8440844/13b1af5d3159/fphys-12-732244-g0005.jpg

相似文献

1
Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model.使用神经网络框架和语音产生模型从颈部表面振动估计声门下压力、声带碰撞压力和喉内肌激活。
Front Physiol. 2021 Sep 1;12:732244. doi: 10.3389/fphys.2021.732244. eCollection 2021.
2
Improved subglottal pressure estimation from neck-surface vibration in healthy speakers producing non-modal phonation.健康受试者发出非模态发声时,基于颈部表面振动改进声门下压力估计。
IEEE J Sel Top Signal Process. 2020 Feb;14(2):449-460. doi: 10.1109/jstsp.2019.2959267. Epub 2019 Dec 12.
3
Glottal Aerodynamics Estimated From Neck-Surface Vibration in Women With Phonotraumatic and Nonphonotraumatic Vocal Hyperfunction.从患有发音过度和非发音过度嗓音障碍的女性的颈部表面振动估计声门气流动力学。
J Speech Lang Hear Res. 2020 Sep 15;63(9):2861-2869. doi: 10.1044/2020_JSLHR-20-00189. Epub 2020 Aug 5.
4
Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders.通过颈部表面振动估计声门下压力对有声带疾病和无声带疾病个体的动态监测
Appl Sci (Basel). 2022 Nov 1;12(21). doi: 10.3390/app122110692. Epub 2022 Oct 22.
5
Vocal Fold Dissipated Power in Females with Hyperfunctional Voice Disorders.功能性嗓音障碍女性的声带耗散功率
J Voice. 2024 Oct 18. doi: 10.1016/j.jvoice.2024.09.039.
6
Subglottal Impedance-Based Inverse Filtering of Voiced Sounds Using Neck Surface Acceleration.基于声门下阻抗的颈部表面加速度对浊音进行逆滤波
IEEE Trans Audio Speech Lang Process. 2013 Sep;21(9):1929-1939. doi: 10.1109/TASL.2013.2263138.
7
Estimation of Subglottal Pressure From Neck Surface Vibration in Patients With Voice Disorders.通过嗓音障碍患者颈部表面振动估计声门下压力
J Speech Lang Hear Res. 2020 Jul 20;63(7):2202-2218. doi: 10.1044/2020_JSLHR-19-00409. Epub 2020 Jul 1.
8
Bayesian estimation of vocal function measures using laryngeal high-speed videoendoscopy and glottal airflow estimates: An in vivo case study.使用喉高速视频内窥镜检查和声门气流估计进行嗓音功能测量的贝叶斯估计:一项体内病例研究。
J Acoust Soc Am. 2020 May;147(5):EL434. doi: 10.1121/10.0001276.
9
Magnitude of Neck-Surface Vibration as an Estimate of Subglottal Pressure During Modulations of Vocal Effort and Intensity in Healthy Speakers.在健康受试者发声力度和强度调节过程中,颈部表面振动幅度作为声门下压力估计值的研究
J Speech Lang Hear Res. 2017 Dec 20;60(12):3404-3416. doi: 10.1044/2017_JSLHR-S-17-0180.
10
Voice Feature Selection to Improve Performance of Machine Learning Models for Voice Production Inversion.语音特征选择以提高语音产生反转机器学习模型的性能。
J Voice. 2023 Jul;37(4):479-485. doi: 10.1016/j.jvoice.2021.03.004. Epub 2021 Apr 11.

引用本文的文献

1
Estimation of Physiological Vocal Features from Neck Surface Acceleration Signals Using Probabilistic Bayesian Neural Networks.使用概率贝叶斯神经网络从颈部表面加速度信号估计生理发声特征。
IEEE Trans Audio Speech Lang Process (2025). 2025;33:1576-1589. doi: 10.1109/taslpro.2025.3552938. Epub 2025 Apr 18.
2
EEG oscillations and related brain generators of phonation phases in long utterances.长话语发声阶段的脑电图振荡及相关脑发生器
Sci Rep. 2025 Aug 9;15(1):29150. doi: 10.1038/s41598-025-13901-8.
3
Subject-Specific Modeling by Domain Adaptation for the Estimation of Subglottal Pressure from Neck-Surface Acceleration Signals.

本文引用的文献

1
Triangular body-cover model of the vocal folds with coordinated activation of the five intrinsic laryngeal muscles.具有协同激活的五组喉内肌的声带的三角体覆盖模型。
J Acoust Soc Am. 2022 Jan;151(1):17. doi: 10.1121/10.0009169.
2
Improved subglottal pressure estimation from neck-surface vibration in healthy speakers producing non-modal phonation.健康受试者发出非模态发声时,基于颈部表面振动改进声门下压力估计。
IEEE J Sel Top Signal Process. 2020 Feb;14(2):449-460. doi: 10.1109/jstsp.2019.2959267. Epub 2019 Dec 12.
3
Bayesian Inference of Vocal Fold Material Properties from Glottal Area Waveforms Using a 2D Finite Element Model.
基于域适应的特定对象建模,用于从颈部表面加速度信号估计声门下压力
Biomed Signal Process Control. 2025 Aug;106. doi: 10.1016/j.bspc.2025.107681. Epub 2025 Feb 26.
4
Neural network-based estimation of biomechanical vocal fold parameters.基于神经网络的生物力学声带参数估计
Front Physiol. 2024 Feb 21;15:1282574. doi: 10.3389/fphys.2024.1282574. eCollection 2024.
5
Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting.迈向语音、语言和听力科学中的通用机器学习模型:估计样本量并减少过拟合。
J Speech Lang Hear Res. 2024 Mar 11;67(3):753-781. doi: 10.1044/2023_JSLHR-23-00273. Epub 2024 Feb 22.
6
Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders.通过颈部表面振动估计声门下压力对有声带疾病和无声带疾病个体的动态监测
Appl Sci (Basel). 2022 Nov 1;12(21). doi: 10.3390/app122110692. Epub 2022 Oct 22.
7
Biomechanical mechanism of reduced aspiration by the Passy-Muir valve in tracheostomized patients following acquired brain injury: Evidences from subglottic pressure.帕西-缪尔阀降低获得性脑损伤后气管切开患者误吸的生物力学机制:来自声门下压力的证据
Front Neurosci. 2022 Oct 31;16:1004013. doi: 10.3389/fnins.2022.1004013. eCollection 2022.
8
Kalman Filter Implementation of Subglottal Impedance-Based Inverse Filtering to Estimate Glottal Airflow during Phonation.基于声门下阻抗的逆滤波的卡尔曼滤波器实现,用于估计发声过程中的声门气流。
Appl Sci (Basel). 2022 Jan;12(1). doi: 10.3390/app12010401. Epub 2021 Dec 31.
9
LaDIVA: A neurocomputational model providing laryngeal motor control for speech acquisition and production.LaDIVA:一个提供喉运动控制的神经计算模型,用于语音获取和产生。
PLoS Comput Biol. 2022 Jun 23;18(6):e1010159. doi: 10.1371/journal.pcbi.1010159. eCollection 2022 Jun.
10
Estimating subglottal pressure and vocal fold adduction from the produced voice in a single-subject study (L).在一项单主体研究中(L),从产生的声音估计声门下压和声门内收。
J Acoust Soc Am. 2022 Feb;151(2):1337. doi: 10.1121/10.0009616.
使用二维有限元模型从声门面积波形进行声带材料特性的贝叶斯推断
Appl Sci (Basel). 2019 Jul 1;9(13). doi: 10.3390/app9132735. Epub 2019 Jul 6.
4
Differences in Daily Voice Use Measures Between Female Patients With Nonphonotraumatic Vocal Hyperfunction and Matched Controls.非发声创伤性嗓音功能亢进女性患者与匹配对照组之间日常嗓音使用指标的差异。
J Speech Lang Hear Res. 2021 May 11;64(5):1457-1470. doi: 10.1044/2021_JSLHR-20-00538. Epub 2021 Apr 23.
5
Changes in a Daily Phonotrauma Index After Laryngeal Surgery and Voice Therapy: Implications for the Role of Daily Voice Use in the Etiology and Pathophysiology of Phonotraumatic Vocal Hyperfunction.喉手术后及嗓音治疗后每日发声创伤指数的变化:每日嗓音使用在发声创伤性嗓音功能亢进的病因学和病理生理学中的作用探讨
J Speech Lang Hear Res. 2020 Dec 14;63(12):3934-3944. doi: 10.1044/2020_JSLHR-20-00168. Epub 2020 Nov 16.
6
An Updated Theoretical Framework for Vocal Hyperfunction.发声亢进的更新理论框架
Am J Speech Lang Pathol. 2020 Nov 12;29(4):2254-2260. doi: 10.1044/2020_AJSLP-20-00104. Epub 2020 Oct 2.
7
Glottal Aerodynamics Estimated From Neck-Surface Vibration in Women With Phonotraumatic and Nonphonotraumatic Vocal Hyperfunction.从患有发音过度和非发音过度嗓音障碍的女性的颈部表面振动估计声门气流动力学。
J Speech Lang Hear Res. 2020 Sep 15;63(9):2861-2869. doi: 10.1044/2020_JSLHR-20-00189. Epub 2020 Aug 5.
8
Estimation of Subglottal Pressure From Neck Surface Vibration in Patients With Voice Disorders.通过嗓音障碍患者颈部表面振动估计声门下压力
J Speech Lang Hear Res. 2020 Jul 20;63(7):2202-2218. doi: 10.1044/2020_JSLHR-19-00409. Epub 2020 Jul 1.
9
Bayesian estimation of vocal function measures using laryngeal high-speed videoendoscopy and glottal airflow estimates: An in vivo case study.使用喉高速视频内窥镜检查和声门气流估计进行嗓音功能测量的贝叶斯估计:一项体内病例研究。
J Acoust Soc Am. 2020 May;147(5):EL434. doi: 10.1121/10.0001276.
10
Estimation of vocal fold physiology from voice acoustics using machine learning.利用机器学习从语音声学估计声带生理机能。
J Acoust Soc Am. 2020 Mar;147(3):EL264. doi: 10.1121/10.0000927.