• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Spatio-temporal articulatory movement primitives during speech production: extraction, interpretation, and validation.言语产生过程中的时空发音运动基元:提取、解释和验证。
J Acoust Soc Am. 2013 Aug;134(2):1378-94. doi: 10.1121/1.4812765.
2
Co-registration of speech production datasets from electromagnetic articulography and real-time magnetic resonance imaging.基于电磁关节造影和实时磁共振成像的言语产生数据集的联合配准
J Acoust Soc Am. 2014 Feb;135(2):EL115-21. doi: 10.1121/1.4862880.
3
The coarticulation/invariance scale: mutual information as a measure of coarticulation resistance, motor synergy, and articulatory invariance.协同发音/不变性尺度:互信息作为衡量协同发音阻力、运动协同和发音不变性的指标。
J Acoust Soc Am. 2013 Aug;134(2):1271-82. doi: 10.1121/1.4812855.
4
An episodic memory-based solution for the acoustic-to-articulatory inversion problem.基于情景记忆的声学-发音反转问题解决方案。
J Acoust Soc Am. 2013 May;133(5):2921-30. doi: 10.1121/1.4798665.
5
Articulatory Range of Movement in Individuals With Dysarthria Secondary to Amyotrophic Lateral Sclerosis.肌萎缩侧索硬化症继发构音障碍患者的运动范围。
Am J Speech Lang Pathol. 2018 Aug 6;27(3):996-1009. doi: 10.1044/2018_AJSLP-17-0064.
6
Speech Sound Disorders in Children: An Articulatory Phonology Perspective.儿童语音障碍:发音音系学视角
Front Psychol. 2020 Jan 28;10:2998. doi: 10.3389/fpsyg.2019.02998. eCollection 2019.
7
Vocal tract normalization for midsagittal articulatory recovery with analysis-by-synthesis.基于合成分析的矢状面中部发音恢复的声道归一化
J Acoust Soc Am. 1999 Aug;106(2):1090-105. doi: 10.1121/1.427117.
8
The acoustic counterpart to coarticulation resistance and aggressiveness in locus equation metrics and vowel dispersion.在音位方程度量和元音离散度中,与协同发音阻力和攻击性相对应的声学特征。
J Acoust Soc Am. 2017 Apr;141(4):EL345. doi: 10.1121/1.4979301.
9
On smoothing articulatory trajectories obtained from Gaussian mixture model based acoustic-to-articulatory inversion.基于高斯混合模型的声学-发音反向的发音轨迹平滑处理。
J Acoust Soc Am. 2013 Aug;134(2):EL258-64. doi: 10.1121/1.4813590.
10
A procedure for estimating gestural scores from speech acoustics.一种从语音声学估算手势分数的方法。
J Acoust Soc Am. 2012 Dec;132(6):3980-9. doi: 10.1121/1.4763545.

引用本文的文献

1
Precise Extraction of Neural Motifs Reveals Multiscale, Parallel Encoding Schemes in Auditory Cortex.神经基序的精确提取揭示了听觉皮层中的多尺度并行编码方案。
bioRxiv. 2025 Aug 26:2025.08.26.672281. doi: 10.1101/2025.08.26.672281.
2
Computer-Implemented Articulatory Models for Speech Production: A Review.用于语音生成的计算机实现的发音模型:综述
Front Robot AI. 2022 Mar 8;9:796739. doi: 10.3389/frobt.2022.796739. eCollection 2022.
3
A deep joint sparse non-negative matrix factorization framework for identifying the common and subject-specific functional units of tongue motion during speech.一种深度联合稀疏非负矩阵分解框架,用于识别言语中舌运动的共同和特定于主题的功能单元。
Med Image Anal. 2021 Aug;72:102131. doi: 10.1016/j.media.2021.102131. Epub 2021 Jun 12.
4
Quantal biomechanical effects in speech postures of the lips.唇音言语姿势的量子生物力学效应。
J Neurophysiol. 2020 Sep 1;124(3):833-843. doi: 10.1152/jn.00676.2019. Epub 2020 Jul 29.
5
Compression of dynamic tactile information in the human hand.人类手部动态触觉信息的压缩
Sci Adv. 2020 Apr 15;6(16):eaaz1158. doi: 10.1126/sciadv.aaz1158. eCollection 2020 Apr.
6
Identifying the Common and Subject-specific Functional Units of Speech Movements via a Joint Sparse Non-negative Matrix Factorization Framework.通过联合稀疏非负矩阵分解框架识别言语运动的通用和特定主题功能单元。
Proc SPIE Int Soc Opt Eng. 2020 Feb;11313. Epub 2020 Mar 10.
7
The FACTS model of speech motor control: Fusing state estimation and task-based control.言语运动控制的 FACTS 模型:融合状态估计和基于任务的控制。
PLoS Comput Biol. 2019 Sep 3;15(9):e1007321. doi: 10.1371/journal.pcbi.1007321. eCollection 2019 Sep.
8
Differentiating post-cancer from healthy tongue muscle coordination patterns during speech using deep learning.利用深度学习区分癌症后和健康人舌肌在言语时的协调模式。
J Acoust Soc Am. 2019 May;145(5):EL423. doi: 10.1121/1.5103191.
9
Unsupervised discovery of temporal sequences in high-dimensional datasets, with applications to neuroscience.无监督发现高维数据集的时间序列,及其在神经科学中的应用。
Elife. 2019 Feb 5;8:e38471. doi: 10.7554/eLife.38471.
10
A Sparse Non-Negative Matrix Factorization Framework for Identifying Functional Units of Tongue Behavior From MRI.基于稀疏非负矩阵分解的 MRI 舌运动功能单元识别方法
IEEE Trans Med Imaging. 2019 Mar;38(3):730-740. doi: 10.1109/TMI.2018.2870939. Epub 2018 Sep 18.

本文引用的文献

1
A procedure for estimating gestural scores from speech acoustics.一种从语音声学估算手势分数的方法。
J Acoust Soc Am. 2012 Dec;132(6):3980-9. doi: 10.1121/1.4763545.
2
Recognizing articulatory gestures from speech for robust speech recognition.从语音中识别发音动作以实现鲁棒的语音识别。
J Acoust Soc Am. 2012 Mar;131(3):2270-87. doi: 10.1121/1.3682038.
3
Articulatory-acoustic kinematics: the production of American English /s/.发音-声学运动学:美国英语/s/的产生。
J Acoust Soc Am. 2011 Feb;129(2):944-54. doi: 10.1121/1.3514537.
4
A generalized smoothness criterion for acoustic-to-articulatory inversion.声学到发音反演的广义平滑性准则。
J Acoust Soc Am. 2010 Oct;128(4):2162-72. doi: 10.1121/1.3455847.
5
A neural basis for motor primitives in the spinal cord.脊髓运动原基的神经基础。
J Neurosci. 2010 Jan 27;30(4):1322-36. doi: 10.1523/JNEUROSCI.5894-08.2010.
6
Synergies: atoms of brain and behavior.协同作用:大脑与行为的基本要素。
Adv Exp Med Biol. 2009;629:83-91. doi: 10.1007/978-0-387-77064-2_5.
7
Sparse representation of sounds in the unanesthetized auditory cortex.未麻醉听觉皮层中声音的稀疏表示
PLoS Biol. 2008 Jan;6(1):e16. doi: 10.1371/journal.pbio.0060016.
8
Combining modules for movement.运动组合模块。
Brain Res Rev. 2008 Jan;57(1):125-33. doi: 10.1016/j.brainresrev.2007.08.004. Epub 2007 Sep 5.
9
Control of fast-reaching movements by muscle synergy combinations.通过肌肉协同组合控制快速伸展运动。
J Neurosci. 2006 Jul 26;26(30):7791-810. doi: 10.1523/JNEUROSCI.0830-06.2006.
10
Matrix factorization algorithms for the identification of muscle synergies: evaluation on simulated and experimental data sets.用于识别肌肉协同作用的矩阵分解算法:对模拟数据集和实验数据集的评估
J Neurophysiol. 2006 Apr;95(4):2199-212. doi: 10.1152/jn.00222.2005. Epub 2006 Jan 4.

言语产生过程中的时空发音运动基元:提取、解释和验证。

Spatio-temporal articulatory movement primitives during speech production: extraction, interpretation, and validation.

机构信息

Ming Hsieh Department of Electrical Engineering, University of Southern California, Los Angeles, California 90089, USA.

出版信息

J Acoust Soc Am. 2013 Aug;134(2):1378-94. doi: 10.1121/1.4812765.

DOI:10.1121/1.4812765
PMID:23927134
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3745549/
Abstract

This paper presents a computational approach to derive interpretable movement primitives from speech articulation data. It puts forth a convolutive Nonnegative Matrix Factorization algorithm with sparseness constraints (cNMFsc) to decompose a given data matrix into a set of spatiotemporal basis sequences and an activation matrix. The algorithm optimizes a cost function that trades off the mismatch between the proposed model and the input data against the number of primitives that are active at any given instant. The method is applied to both measured articulatory data obtained through electromagnetic articulography as well as synthetic data generated using an articulatory synthesizer. The paper then describes how to evaluate the algorithm performance quantitatively and further performs a qualitative assessment of the algorithm's ability to recover compositional structure from data. This is done using pseudo ground-truth primitives generated by the articulatory synthesizer based on an Articulatory Phonology frame-work [Browman and Goldstein (1995). "Dynamics and articulatory phonology," in Mind as motion: Explorations in the dynamics of cognition, edited by R. F. Port and T.van Gelder (MIT Press, Cambridge, MA), pp. 175-194]. The results suggest that the proposed algorithm extracts movement primitives from human speech production data that are linguistically interpretable. Such a framework might aid the understanding of longstanding issues in speech production such as motor control and coarticulation.

摘要

本文提出了一种从语音发音数据中推导出可解释运动基元的计算方法。它提出了一种具有稀疏约束的卷积非负矩阵分解算法(cNMFsc),将给定的数据矩阵分解为一组时空基序列和一个激活矩阵。该算法优化了一个代价函数,该函数在提出的模型与输入数据之间的不匹配与任何给定时刻活动的基元数量之间进行权衡。该方法应用于通过电磁发音图获得的测量发音数据以及使用发音合成器生成的合成数据。然后,本文描述了如何对算法性能进行定量评估,并进一步对算法从数据中恢复成分结构的能力进行定性评估。这是通过使用基于发音语音学框架的发音合成器生成的伪基元来完成的[Browman 和 Goldstein(1995)。“动态与发音语音学”,Mind as motion:Explorations in the dynamics of cognition,edited by R. F. Port and T.van Gelder(MIT Press,Cambridge,MA),pp. 175-194]。结果表明,所提出的算法从人类言语产生数据中提取出具有语言可解释性的运动基元。这样的框架可能有助于理解言语产生中的长期问题,例如运动控制和协同发音。