• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于与主体无关的声学-发音反转的发音特征的自动语音识别。

Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion.

机构信息

Signal Analysis and Interpretation Laboratory, Department of Electrical Engineering, University of Southern California, Los Angeles, California 90089, USA.

出版信息

J Acoust Soc Am. 2011 Oct;130(4):EL251-7. doi: 10.1121/1.3634122.

DOI:10.1121/1.3634122
PMID:21974500
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3189967/
Abstract

An automatic speech recognition approach is presented which uses articulatory features estimated by a subject-independent acoustic-to-articulatory inversion. The inversion allows estimation of articulatory features from any talker's speech acoustics using only an exemplary subject's articulatory-to-acoustic map. Results are reported on a broad class phonetic classification experiment on speech from English talkers using data from three distinct English talkers as exemplars for inversion. Results indicate that the inclusion of the articulatory information improves classification accuracy but the improvement is more significant when the speaking style of the exemplar and the talker are matched compared to when they are mismatched.

摘要

本文提出了一种自动语音识别方法,该方法使用由与说话人无关的声学到发音的反转估计的发音特征。该反转允许仅使用示例主体的发音到声学图从任何说话者的语音声学中估计发音特征。使用来自三个不同英语说话者的示例数据,在英语说话者的语音的广泛类语音分类实验中报告了结果。结果表明,包含发音信息可以提高分类准确性,但当示例和说话者的说话风格匹配时,与不匹配时相比,改进更为显著。

相似文献

1
Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion.基于与主体无关的声学-发音反转的发音特征的自动语音识别。
J Acoust Soc Am. 2011 Oct;130(4):EL251-7. doi: 10.1121/1.3634122.
2
A generalized smoothness criterion for acoustic-to-articulatory inversion.声学到发音反演的广义平滑性准则。
J Acoust Soc Am. 2010 Oct;128(4):2162-72. doi: 10.1121/1.3455847.
3
Speech production knowledge in automatic speech recognition.自动语音识别中的语音生成知识。
J Acoust Soc Am. 2007 Feb;121(2):723-42. doi: 10.1121/1.2404622.
4
Tongue- and Jaw-Specific Articulatory Underpinnings of Reduced and Enhanced Acoustic Vowel Contrast in Talkers With Parkinson's Disease.舌部和颌部特定发音器官对帕金森病患者言语中元音对比减弱和增强的影响。
J Speech Lang Hear Res. 2019 Jul 15;62(7):2118-2132. doi: 10.1044/2019_JSLHR-S-MSC18-18-0192.
5
Articulatory-acoustic kinematics: the production of American English /s/.发音-声学运动学:美国英语/s/的产生。
J Acoust Soc Am. 2011 Feb;129(2):944-54. doi: 10.1121/1.3514537.
6
Improved speech inversion using general regression neural network.使用通用回归神经网络改进语音反转
J Acoust Soc Am. 2015 Sep;138(3):EL229-35. doi: 10.1121/1.4929626.
7
Incorporation of phonetic constraints in acoustic-to-articulatory inversion.在声学到发音逆向转换中纳入语音约束。
J Acoust Soc Am. 2008 Apr;123(4):2310-23. doi: 10.1121/1.2885747.
8
Modeling the effect of palate shape on the articulatory-acoustics mapping.建立腭形对发音声学映射影响的模型。
J Acoust Soc Am. 2018 Jul;144(1):EL71. doi: 10.1121/1.5048043.
9
Articulatory Underpinnings of Reduced Acoustic-Phonetic Contrasts in Individuals With Amyotrophic Lateral Sclerosis.肌萎缩侧索硬化症患者的发音基础与声学语音对比减弱。
Am J Speech Lang Pathol. 2022 Sep 7;31(5):2022-2044. doi: 10.1044/2022_AJSLP-22-00046. Epub 2022 Aug 16.
10
A modeling investigation of articulatory variability and acoustic stability during American English /r/ production.美式英语/r/发音过程中发音器官变异性和声学稳定性的建模研究。
J Acoust Soc Am. 2005 May;117(5):3196-212. doi: 10.1121/1.1893271.

引用本文的文献

1
Articulation constrained learning with application to speech emotion recognition.应用于语音情感识别的关节约束学习
EURASIP J Audio Speech Music Process. 2019;2019(1). doi: 10.1186/s13636-019-0157-9. Epub 2019 Aug 20.
2
Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.利用发音运动数据识别接受喉部手术重建的个体所发出的低语语音。
Workshop Speech Lang Process Assist Technol. 2016 Sep;2016:80-86. doi: 10.21437/SLPAT.2016-14.
3
Speaker verification based on the fusion of speech acoustics and inverted articulatory signals.基于语音声学与反向发音信号融合的说话人验证
Comput Speech Lang. 2016 Mar;36:196-211. doi: 10.1016/j.csl.2015.05.003. Epub 2015 May 22.
4
Advances in real-time magnetic resonance imaging of the vocal tract for speech science and technology research.用于语音科学与技术研究的声道实时磁共振成像进展。
APSIPA Trans Signal Inf Process. 2016;5. doi: 10.1017/ATSIP.2016.5. Epub 2016 Mar 31.
5
Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.用于估计声道直接运动学和微分运动学的统计方法。
Speech Commun. 2013 Jan;55(1):147-161. doi: 10.1016/j.specom.2012.08.001.
6
Modeling speech imitation and ecological learning of auditory-motor maps.建模听觉-运动图谱的言语模仿和生态学习。
Front Psychol. 2013 Jun 27;4:364. doi: 10.3389/fpsyg.2013.00364. Print 2013.

本文引用的文献

1
A generalized smoothness criterion for acoustic-to-articulatory inversion.声学到发音反演的广义平滑性准则。
J Acoust Soc Am. 2010 Oct;128(4):2162-72. doi: 10.1121/1.3455847.
2
Accurate recovery of articulator positions from acoustics: new conclusions based on human data.从声学中准确恢复咬合架位置:基于人体数据的新结论。
J Acoust Soc Am. 1996 Sep;100(3):1819-34. doi: 10.1121/1.416001.