• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于神经网络的生物力学声带参数估计

Neural network-based estimation of biomechanical vocal fold parameters.

作者信息

Donhauser Jonas, Tur Bogac, Döllinger Michael

机构信息

Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany.

出版信息

Front Physiol. 2024 Feb 21;15:1282574. doi: 10.3389/fphys.2024.1282574. eCollection 2024.

DOI:10.3389/fphys.2024.1282574
PMID:38449783
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10916882/
Abstract

Vocal fold (VF) vibrations are the primary source of human phonation. High-speed video (HSV) endoscopy enables the computation of descriptive VF parameters for assessment of physiological properties of laryngeal dynamics, i.e., the vibration of the VFs. However, underlying biomechanical factors responsible for physiological and disordered VF vibrations cannot be accessed. In contrast, physically based numerical VF models reveal insights into the organ's oscillations, which remain inaccessible through endoscopy. To estimate biomechanical properties, previous research has fitted subglottal pressure-driven mass-spring-damper systems, as inverse problem to the HSV-recorded VF trajectories, by global optimization of the numerical model. A neural network trained on the numerical model may be used as a substitute for computationally expensive optimization, yielding a fast evaluating surrogate of the biomechanical inverse problem. This paper proposes a convolutional recurrent neural network (CRNN)-based architecture trained on regression of a physiological-based biomechanical six-mass model (6 MM). To compare with previous research, the underlying biomechanical factor "subglottal pressure" prediction was tested against 288 HSV porcine recordings. The contributions of this work are two-fold: first, the presented CRNN with the 6 MM handles multiple trajectories along the VFs, which allows for investigations on local changes in VF characteristics. Second, the network was trained to reproduce further important biomechanical model parameters like VF mass and stiffness on synthetic data. Unlike in a previous work, the network in this study is therefore an entire surrogate of the inverse problem, which allowed for explicit computation of the fitted model using our approach. The presented approach achieves a best-case mean absolute error (MAE) of 133 Pa (13.9%) in subglottal pressure prediction with 76.6% correlation on experimental data and a re-estimated fundamental frequency MAE of 15.9 Hz (9.9%). In-detail training analysis revealed subglottal pressure as the most learnable parameter. With the physiological-based model design and advances in fast parameter prediction, this work is a next step in biomechanical VF model fitting and the estimation of laryngeal kinematics.

摘要

声带(VF)振动是人类发声的主要来源。高速视频(HSV)内窥镜检查能够计算描述性的VF参数,以评估喉动力学的生理特性,即VF的振动。然而,导致生理和紊乱的VF振动的潜在生物力学因素无法通过这种方式得知。相比之下,基于物理的数值VF模型揭示了该器官振动的相关见解,而这些通过内窥镜检查是无法获得的。为了估计生物力学特性,先前的研究通过对数值模型进行全局优化,将声门下压力驱动的质量 - 弹簧 - 阻尼系统作为HSV记录的VF轨迹的逆问题进行拟合。在数值模型上训练的神经网络可以替代计算成本高昂的优化过程,产生生物力学逆问题的快速评估替代模型。本文提出了一种基于卷积循环神经网络(CRNN)的架构,该架构在基于生理的生物力学六质量模型(6MM)的回归上进行训练。为了与先前的研究进行比较,针对288个HSV猪记录测试了潜在生物力学因素“声门下压力”的预测。这项工作的贡献有两个方面:第一,所提出的带有6MM的CRNN能够处理沿VF的多个轨迹,这使得对VF特性的局部变化进行研究成为可能。第二,该网络经过训练以在合成数据上再现其他重要的生物力学模型参数,如VF质量和刚度。与先前的工作不同,本研究中的网络因此是逆问题的完整替代模型,这使得使用我们的方法能够明确计算拟合模型。所提出的方法在声门下压力预测中实现了最佳情况下的平均绝对误差(MAE)为133Pa(13.9%),与实验数据的相关性为76.6%,重新估计的基频MAE为15.9Hz(9.9%)。详细的训练分析表明声门下压力是最容易学习的参数。基于生理的模型设计以及快速参数预测方面的进展,这项工作是生物力学VF模型拟合和喉运动学估计的下一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/fa2e92bd3bcd/fphys-15-1282574-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/c5eaefd37bf7/fphys-15-1282574-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/d59e6c7ae664/fphys-15-1282574-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/b6e54eae2744/fphys-15-1282574-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/4689fabad1e8/fphys-15-1282574-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/ceb2b80450c8/fphys-15-1282574-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/80220909f5f1/fphys-15-1282574-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/5a7a0f72049d/fphys-15-1282574-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/8634099da291/fphys-15-1282574-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/9cab471bb97b/fphys-15-1282574-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/fa2e92bd3bcd/fphys-15-1282574-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/c5eaefd37bf7/fphys-15-1282574-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/d59e6c7ae664/fphys-15-1282574-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/b6e54eae2744/fphys-15-1282574-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/4689fabad1e8/fphys-15-1282574-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/ceb2b80450c8/fphys-15-1282574-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/80220909f5f1/fphys-15-1282574-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/5a7a0f72049d/fphys-15-1282574-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/8634099da291/fphys-15-1282574-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/9cab471bb97b/fphys-15-1282574-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6281/10916882/fa2e92bd3bcd/fphys-15-1282574-g010.jpg

相似文献

1
Neural network-based estimation of biomechanical vocal fold parameters.基于神经网络的生物力学声带参数估计
Front Physiol. 2024 Feb 21;15:1282574. doi: 10.3389/fphys.2024.1282574. eCollection 2024.
2
Laryngeal Pressure Estimation With a Recurrent Neural Network.基于循环神经网络的喉压估计
IEEE J Transl Eng Health Med. 2018 Dec 27;7:2000111. doi: 10.1109/JTEHM.2018.2886021. eCollection 2019.
3
Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model.使用神经网络框架和语音产生模型从颈部表面振动估计声门下压力、声带碰撞压力和喉内肌激活。
Front Physiol. 2021 Sep 1;12:732244. doi: 10.3389/fphys.2021.732244. eCollection 2021.
4
Physical parameter estimation from porcine ex vivo vocal fold dynamics in an inverse problem framework.基于反问题框架的猪离体声带动力学的物理参数估计。
Biomech Model Mechanobiol. 2018 Jun;17(3):777-792. doi: 10.1007/s10237-017-0992-5. Epub 2017 Dec 11.
5
Three-dimensional biomechanical properties of human vocal folds: parameter optimization of a numerical model to match in vitro dynamics.人声声带的三维生物力学特性:数值模型的参数优化以匹配体外动态学。
J Acoust Soc Am. 2012 Feb;131(2):1378-90. doi: 10.1121/1.3676622.
6
Deep-Learning-Based Representation of Vocal Fold Dynamics in Adductor Spasmodic Dysphonia during Connected Speech in High-Speed Videoendoscopy.高速视频内镜检查中内收型痉挛性发声障碍患者连贯言语时基于深度学习的声带动力学表现
J Voice. 2025 Mar;39(2):570.e1-570.e15. doi: 10.1016/j.jvoice.2022.08.022. Epub 2022 Sep 23.
7
Biomechanical simulation of vocal fold dynamics in adults based on laryngeal high-speed videoendoscopy.基于喉部高速视频内窥镜检查的成人声带动力学生物力学模拟
PLoS One. 2017 Nov 9;12(11):e0187486. doi: 10.1371/journal.pone.0187486. eCollection 2017.
8
Computation of physiological human vocal fold parameters by mathematical optimization of a biomechanical model.通过生物力学模型的数学优化计算生理人声带参数。
J Acoust Soc Am. 2011 Aug;130(2):948-64. doi: 10.1121/1.3605551.
9
Identification and analysis of Nonlinear behaviors of vocal fold biomechanics during phonation to assess efficacy of surgery for benign laryngeal Diseases.探讨发声过程中声带生物力学的非线性行为,以评估良性喉部疾病手术疗效。
Comput Biol Med. 2024 Feb;169:107946. doi: 10.1016/j.compbiomed.2024.107946. Epub 2024 Jan 1.
10
Aerodynamically driven phonation of individual vocal folds under general anesthesia in canines.全麻状态下犬只单个声带的空气动力学发声。
Laryngoscope. 2020 Aug;130(8):1980-1988. doi: 10.1002/lary.28333. Epub 2019 Oct 11.

引用本文的文献

1
Estimation of Physiological Vocal Features from Neck Surface Acceleration Signals Using Probabilistic Bayesian Neural Networks.使用概率贝叶斯神经网络从颈部表面加速度信号估计生理发声特征。
IEEE Trans Audio Speech Lang Process (2025). 2025;33:1576-1589. doi: 10.1109/taslpro.2025.3552938. Epub 2025 Apr 18.
2
Machine learning based assessment of hoarseness severity: a multi-sensor approach centered on high-speed videoendoscopy.基于机器学习的声音嘶哑严重程度评估:一种以高速视频内镜检查为核心的多传感器方法。
Front Artif Intell. 2025 Jun 5;8:1601716. doi: 10.3389/frai.2025.1601716. eCollection 2025.
3
Floating Ball Voice Therapy: Preliminary Effects on Outcomes and Predicting Individual Patient Differences in Generalization.

本文引用的文献

1
Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model.使用神经网络框架和语音产生模型从颈部表面振动估计声门下压力、声带碰撞压力和喉内肌激活。
Front Physiol. 2021 Sep 1;12:732244. doi: 10.3389/fphys.2021.732244. eCollection 2021.
2
A Deep Learning Enhanced Novel Software Tool for Laryngeal Dynamics Analysis.深度学习增强型新型喉动力学分析软件工具。
J Speech Lang Hear Res. 2021 Jun 4;64(6):1889-1903. doi: 10.1044/2021_JSLHR-20-00498. Epub 2021 May 17.
3
Fluid-structure-acoustic interactions in an ex vivo porcine phonation model.
漂浮球语音疗法:对结果的初步影响及其对个体化患者间泛化差异的预测。
J Speech Lang Hear Res. 2024 Oct 8;67(10):3521-3535. doi: 10.1044/2024_JSLHR-23-00727. Epub 2024 Sep 25.
离体猪发声模型中的流固声相互作用。
J Acoust Soc Am. 2021 Mar;149(3):1657. doi: 10.1121/10.0003602.
4
3D-FV-FE Aeroacoustic Larynx Model for Investigation of Functional Based Voice Disorders.用于研究基于功能的嗓音障碍的三维有限体积-有限元气动声学喉模型
Front Physiol. 2021 Mar 8;12:616985. doi: 10.3389/fphys.2021.616985. eCollection 2021.
5
Estimation of vocal fold physiology from voice acoustics using machine learning.利用机器学习从语音声学估计声带生理机能。
J Acoust Soc Am. 2020 Mar;147(3):EL264. doi: 10.1121/10.0000927.
6
Glottal Closure in Women with No Voice Complaints or Laryngeal Disorders.无嗓音问题或喉部疾病女性的声门闭合情况
Int Arch Otorhinolaryngol. 2019 Oct;23(4):e384-e388. doi: 10.1055/s-0038-1676108. Epub 2019 May 28.
7
Laryngeal Pressure Estimation With a Recurrent Neural Network.基于循环神经网络的喉压估计
IEEE J Transl Eng Health Med. 2018 Dec 27;7:2000111. doi: 10.1109/JTEHM.2018.2886021. eCollection 2019.
8
Pathological Voice Source Analysis System Using a Flow Waveform-Matched Biomechanical Model.基于血流波形匹配生物力学模型的病理性声源分析系统
Appl Bionics Biomech. 2018 Jul 2;2018:3158439. doi: 10.1155/2018/3158439. eCollection 2018.
9
Physical parameter estimation from porcine ex vivo vocal fold dynamics in an inverse problem framework.基于反问题框架的猪离体声带动力学的物理参数估计。
Biomech Model Mechanobiol. 2018 Jun;17(3):777-792. doi: 10.1007/s10237-017-0992-5. Epub 2017 Dec 11.
10
Influence of glottal closure on the phonatory process in ex vivo porcine larynges.声门关闭对离体猪喉发声过程的影响。
J Acoust Soc Am. 2017 Oct;142(4):2197. doi: 10.1121/1.5007952.