基于 Seq2Seq LSTM 的改进声门波模型用于模拟声带振动。

An Improved Glottal Flow Model Based on Seq2Seq LSTM for Simulation of Vocal Fold Vibration.

机构信息

College of Astronautics, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China.

College of General Aviation and Flight, Nanjing University of Aeronautics and Astronautics, Nanjing 213300, China.

出版信息

J Voice. 2024 Sep;38(5):983-992. doi: 10.1016/j.jvoice.2022.03.029. Epub 2022 May 7.

DOI:10.1016/j.jvoice.2022.03.029

Abstract

OBJECTIVES

An improved data-driven glottal flow model for fluid-structure interaction (FSI) simulation of the vocal fold vibration is proposed in this paper. This model aims to improve the prediction performance of the previously developed deep neural network (DNN) based empirical flow model (EFM) on accuracy and efficiency.

METHODS

A Seq2Seq long short-term memory (LSTM) network is employed in the present model to infer the flow rate and pressure distribution from the subglottal pressure and cross-section area distribution of the glottis. The training data is collected from the generalized glottal shape library generated in Zhang et al. RESULTS AND CONCLUSIONS: Compared to the EFM, the present model not only discards the time-consuming optimization process, but also drastically reduces the errors, therefore the prediction performance can be greatly improved. The present model is evaluated by coupling with a solid dynamics solver for FSI simulation, and the results demonstrate a great improvement on accuracy and efficiency.

摘要

目的

本文提出了一种改进的数据驱动流固耦合（FSI）模拟声门振动的声门波模型。该模型旨在提高先前基于深度神经网络（DNN）的经验流模型（EFM）在准确性和效率方面的预测性能。

方法

本模型采用序列到序列长短期记忆（LSTM）网络，从声门下压力和声带截面积分布推断流量和压力分布。训练数据来自 Zhang 等人生成的广义声门形状库。

结果与结论

与 EFM 相比，本模型不仅省去了耗时的优化过程，而且大幅降低了误差，因此可以大大提高预测性能。本模型通过与固体动力学求解器耦合进行 FSI 模拟进行评估，结果表明在准确性和效率方面有了很大的提高。

相似文献

1

An Improved Glottal Flow Model Based on Seq2Seq LSTM for Simulation of Vocal Fold Vibration.基于 Seq2Seq LSTM 的改进声门波模型用于模拟声带振动。

J Voice. 2024 Sep;38(5):983-992. doi: 10.1016/j.jvoice.2022.03.029. Epub 2022 May 7.

2

A Deep Learning-Based Generalized Empirical Flow Model of Glottal Flow During Normal Phonation.一种基于深度学习的正常发声期间声门波流的广义经验流模型。

J Biomech Eng. 2022 Sep 1;144(9). doi: 10.1115/1.4053862.

3

A computational study of the effect of false vocal folds on glottal flow and vocal fold vibration during phonation.假声带对发声过程中声门气流和声带振动影响的计算研究。

Ann Biomed Eng. 2009 Mar;37(3):625-42. doi: 10.1007/s10439-008-9630-9. Epub 2009 Jan 14.

4

A Reduced-Order Flow Model for Fluid-Structure Interaction Simulation of Vocal Fold Vibration.用于声门振动流固耦合模拟的降阶流模型。

J Biomech Eng. 2020 Feb 1;142(2):0210051-02100510. doi: 10.1115/1.4044033.

5

Analysis of flow-structure interaction in the larynx during phonation using an immersed-boundary method.使用浸入边界法分析发声过程中喉部的流固相互作用。

J Acoust Soc Am. 2009 Aug;126(2):816-24. doi: 10.1121/1.3158942.

6

Direct-numerical simulation of the glottal jet and vocal-fold dynamics in a three-dimensional laryngeal model.三维喉模型中声门射流和声带动力学的直接数值模拟。

J Acoust Soc Am. 2011 Jul;130(1):404-15. doi: 10.1121/1.3592216.

7

Regulation of glottal closure and airflow in a three-dimensional phonation model: implications for vocal intensity control.三维发声模型中声门闭合与气流的调节：对声音强度控制的影响

J Acoust Soc Am. 2015 Feb;137(2):898-910. doi: 10.1121/1.4906272.

8

Pressure, Flow, and Glottal Area Waveform Profile Changes During Phonation Using the Acapella Choice Device.使用 Acapella Choice 装置发声时压力、流量和声门区面积波型的变化。

J Voice. 2024 Sep;38(5):1248.e23-1248.e33. doi: 10.1016/j.jvoice.2022.02.010. Epub 2022 Mar 10.

9

Influence of flow separation location on phonation onset.气流分离位置对发声起始的影响。

J Acoust Soc Am. 2008 Sep;124(3):1689-94. doi: 10.1121/1.2957938.

10

Optimized transformation of the glottal motion into a mechanical model.优化声门运动到机械模型的转换。

Med Eng Phys. 2011 Mar;33(2):210-7. doi: 10.1016/j.medengphy.2010.09.019. Epub 2010 Nov 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验