用于估计声道直接运动学和微分运动学的统计方法。

Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.

作者信息

Lammert Adam, Goldstein Louis, Narayanan Shrikanth, Iskarous Khalil

机构信息

Signal Analysis & Interpretation Laboratory (SAIL), University of Southern California, 3710 McClintock Ave., Los Angeles, CA 90089, USA.

出版信息

Speech Commun. 2013 Jan;55(1):147-161. doi: 10.1016/j.specom.2012.08.001.

DOI:10.1016/j.specom.2012.08.001

PMID:24052685

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3774006/

Abstract

We present and evaluate two statistical methods for estimating kinematic relationships of the speech production system: Artificial Neural Networks and Locally-Weighted Regression. The work is motivated by the need to characterize this motor system, with particular focus on estimating differential aspects of kinematics. Kinematic analysis will facilitate progress in a variety of areas, including the nature of speech production goals, articulatory redundancy and, relatedly, acoustic-to-articulatory inversion. Statistical methods must be used to estimate these relationships from data since they are infeasible to express in closed form. Statistical models are optimized and evaluated - using a heldout data validation procedure - on two sets of synthetic speech data. The theoretical and practical advantages of both methods are also discussed. It is shown that both direct and differential kinematics can be estimated with high accuracy, even for complex, nonlinear relationships. Locally-Weighted Regression displays the best overall performance, which may be due to practical advantages in its training procedure. Moreover, accurate estimation can be achieved using only a modest amount of training data, as judged by convergence of performance. The algorithms are also applied to real-time MRI data, and the results are generally consistent with those obtained from synthetic data.

摘要

我们提出并评估了两种用于估计语音产生系统运动学关系的统计方法

人工神经网络和局部加权回归。这项工作的动机是需要对这个运动系统进行特征描述，特别关注估计运动学的差异方面。运动学分析将促进多个领域的进展，包括语音产生目标的性质、发音冗余以及相关的声学到发音的逆向转换。由于这些关系难以用封闭形式表达，因此必须使用统计方法从数据中估计它们。使用留出数据验证程序在两组合成语音数据上对统计模型进行优化和评估。还讨论了这两种方法的理论和实际优势。结果表明，即使对于复杂的非线性关系，直接运动学和微分运动学都可以高精度地估计。局部加权回归显示出最佳的整体性能，这可能归因于其训练过程中的实际优势。此外，从性能收敛情况判断，仅使用适量的训练数据就能实现准确估计。这些算法还应用于实时MRI数据，结果与从合成数据中获得的结果总体一致。

相似文献

Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.

Speech Commun. 2013 Jan;55(1):147-161. doi: 10.1016/j.specom.2012.08.001.

High-Resolution, Non-Invasive Imaging of Upper Vocal Tract Articulators Compatible with Human Brain Recordings.

PLoS One. 2016 Mar 28;11(3):e0151327. doi: 10.1371/journal.pone.0151327. eCollection 2016.

Impact of Vocal Effort on Respiratory and Articulatory Kinematics.

J Speech Lang Hear Res. 2022 Jan 12;65(1):5-21. doi: 10.1044/2021_JSLHR-21-00323. Epub 2021 Nov 29.

Human Sensorimotor Cortex Control of Directly Measured Vocal Tract Movements during Vowel Production.

J Neurosci. 2018 Mar 21;38(12):2955-2966. doi: 10.1523/JNEUROSCI.2382-17.2018. Epub 2018 Feb 8.

Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion.

J Acoust Soc Am. 2005 Jul;118(1):444-60. doi: 10.1121/1.1921448.

Retrieving Tract Variables From Acoustics: A Comparison of Different Machine Learning Strategies.

IEEE J Sel Top Signal Process. 2010 Sep 13;4(6):1027-1045. doi: 10.1109/JSTSP.2010.2076013.

Computer-Implemented Articulatory Models for Speech Production: A Review.

Front Robot AI. 2022 Mar 8;9:796739. doi: 10.3389/frobt.2022.796739. eCollection 2022.

Incorporation of phonetic constraints in acoustic-to-articulatory inversion.

J Acoust Soc Am. 2008 Apr;123(4):2310-23. doi: 10.1121/1.2885747.

Kinematic Analysis of Speech Sound Sequencing Errors Induced by Delayed Auditory Feedback.

J Speech Lang Hear Res. 2017 Jun 22;60(6S):1695-1711. doi: 10.1044/2017_JSLHR-S-16-0234.

Automatic Grading of Stroke Symptoms for Rapid Assessment Using Optimized Machine Learning and 4-Limb Kinematics: Clinical Validation Study.

J Med Internet Res. 2020 Sep 16;22(9):e20641. doi: 10.2196/20641.

引用本文的文献

Characterization of Photo-Crosslinked Methacrylated Type I Collagen as a Platform to Investigate the Lymphatic Endothelial Cell Response.

Lymphatics. 2024 Sep;2(3):177-194. doi: 10.3390/lymphatics2030015. Epub 2024 Sep 19.

Vertical larynx actions and intergestural timing stability in Hausa ejectives and implosives.

Phonetica. 2024 Oct 22;81(6):559-597. doi: 10.1515/phon-2023-0052. Print 2024 Dec 17.

Tongue Postures and Tongue Centers: A Study of Acoustic-Articulatory Correspondences Across Different Head Angles.

Front Psychol. 2022 Jan 17;12:768754. doi: 10.3389/fpsyg.2021.768754. eCollection 2021.

A modular architecture for articulatory synthesis from gestural specification.

J Acoust Soc Am. 2019 Dec;146(6):4458. doi: 10.1121/1.5139413.

The FACTS model of speech motor control: Fusing state estimation and task-based control.

PLoS Comput Biol. 2019 Sep 3;15(9):e1007321. doi: 10.1371/journal.pcbi.1007321. eCollection 2019 Sep.

Task-dependence of articulator synergies.

J Acoust Soc Am. 2019 Mar;145(3):1504. doi: 10.1121/1.5093538.

Advances in real-time magnetic resonance imaging of the vocal tract for speech science and technology research.

APSIPA Trans Signal Inf Process. 2016;5. doi: 10.1017/ATSIP.2016.5. Epub 2016 Mar 31.

Are articulatory settings mechanically advantageous for speech motor control?

PLoS One. 2014 Aug 18;9(8):e104168. doi: 10.1371/journal.pone.0104168. eCollection 2014.

本文引用的文献

Adaptive Mixtures of Local Experts.

Neural Comput. 1991 Spring;3(1):79-87. doi: 10.1162/neco.1991.3.1.79.

A self-organizing neural model of motor equivalent reaching and tool use by a multijoint arm.

J Cogn Neurosci. 1993 Fall;5(4):408-35. doi: 10.1162/jocn.1993.5.4.408.

Retrieving Tract Variables From Acoustics: A Comparison of Different Machine Learning Strategies.

IEEE J Sel Top Signal Process. 2010 Sep 13;4(6):1027-1045. doi: 10.1109/JSTSP.2010.2076013.

A procedure for estimating gestural scores from speech acoustics.

J Acoust Soc Am. 2012 Dec;132(6):3980-9. doi: 10.1121/1.4763545.

Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion.

J Acoust Soc Am. 2011 Oct;130(4):EL251-7. doi: 10.1121/1.3634122.

A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model.

J Acoust Soc Am. 2011 Apr;129(4):2144-62. doi: 10.1121/1.3514544.

A generalized smoothness criterion for acoustic-to-articulatory inversion.

J Acoust Soc Am. 2010 Oct;128(4):2162-72. doi: 10.1121/1.3455847.

Acoustic-articulatory mapping in vowels by locally weighted regression.

J Acoust Soc Am. 2009 Oct;126(4):2011-32. doi: 10.1121/1.3184581.

Region segmentation in the frequency domain applied to upper airway real-time magnetic resonance images.

IEEE Trans Med Imaging. 2009 Mar;28(3):323-38. doi: 10.1109/TMI.2008.928920.

Multilayer Potts perceptrons with Levenberg-Marquardt learning.

IEEE Trans Neural Netw. 2008 Dec;19(12):2032-43. doi: 10.1109/TNN.2008.2003271.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于估计声道直接运动学和微分运动学的统计方法。

Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.

作者信息

Lammert Adam, Goldstein Louis, Narayanan Shrikanth, Iskarous Khalil

机构信息

Signal Analysis & Interpretation Laboratory (SAIL), University of Southern California, 3710 McClintock Ave., Los Angeles, CA 90089, USA.

出版信息

Speech Commun. 2013 Jan;55(1):147-161. doi: 10.1016/j.specom.2012.08.001.

DOI:10.1016/j.specom.2012.08.001

PMID:24052685

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3774006/

Abstract

摘要

用于估计声道直接运动学和微分运动学的统计方法。

Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.

作者信息

机构信息

出版信息

我们提出并评估了两种用于估计语音产生系统运动学关系的统计方法

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于估计声道直接运动学和微分运动学的统计方法。

Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.

作者信息

机构信息

出版信息

我们提出并评估了两种用于估计语音产生系统运动学关系的统计方法