• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

最大似然连续语音识别方法。

A maximum likelihood approach to continuous speech recognition.

机构信息

MEMBER, IEEE, IBM T. J. Watson Research Center, Yorktown Heights, NY 10598.

出版信息

IEEE Trans Pattern Anal Mach Intell. 1983 Feb;5(2):179-90. doi: 10.1109/tpami.1983.4767370.

DOI:10.1109/tpami.1983.4767370
PMID:21869099
Abstract

Speech recognition is formulated as a problem of maximum likelihood decoding. This formulation requires statistical models of the speech production process. In this paper, we describe a number of statistical models for use in speech recognition. We give special attention to determining the parameters for such models from sparse data. We also describe two decoding methods, one appropriate for constrained artificial languages and one appropriate for more realistic decoding tasks. To illustrate the usefulness of the methods described, we review a number of decoding results that have been obtained with them.

摘要

语音识别被表述为最大似然解码的问题。这种表述需要语音产生过程的统计模型。在本文中,我们描述了一些用于语音识别的统计模型。我们特别关注如何从稀疏数据中确定这些模型的参数。我们还描述了两种解码方法,一种适用于约束性人工语言,另一种适用于更现实的解码任务。为了说明所描述方法的有用性,我们回顾了用这些方法获得的一些解码结果。

相似文献

1
A maximum likelihood approach to continuous speech recognition.最大似然连续语音识别方法。
IEEE Trans Pattern Anal Mach Intell. 1983 Feb;5(2):179-90. doi: 10.1109/tpami.1983.4767370.
2
Neural speech recognition: continuous phoneme decoding using spatiotemporal representations of human cortical activity.神经语音识别:利用人类皮层活动的时空表征进行连续音素解码。
J Neural Eng. 2016 Oct;13(5):056004. doi: 10.1088/1741-2560/13/5/056004. Epub 2016 Aug 3.
3
Visual speech processing: word-decoding and word-discrimination related to sentence-based speechreading and hearing-impairment.视觉言语处理:与基于句子的唇读和听力障碍相关的单词解码和单词辨别
Scand J Psychol. 1991;32(1):9-17. doi: 10.1111/j.1467-9450.1991.tb00847.x.
4
Masked Speech Recognition and Reading Ability in School-Age Children: Is There a Relationship?学龄儿童的掩蔽语音识别与阅读能力:它们之间有关系吗?
J Speech Lang Hear Res. 2018 Mar 15;61(3):776-788. doi: 10.1044/2017_JSLHR-H-17-0279.
5
Real-time Controlling Dynamics Sensing in Air Traffic System.空中交通系统中的实时控制动态感应。
Sensors (Basel). 2019 Feb 7;19(3):679. doi: 10.3390/s19030679.
6
Statistical modeling of speech Poincaré sections in combination of frequency analysis to improve speech recognition performance.联合频率分析的语音庞加莱截面的统计建模以提高语音识别性能。
Chaos. 2010 Sep;20(3):033106. doi: 10.1063/1.3463722.
7
Model adaptation method for recognition of speech with missing frames.用于识别缺失帧语音的模型自适应方法。
J Acoust Soc Am. 2014 Mar;135(3):EL166-71. doi: 10.1121/1.4865264.
8
Noise-robust speech recognition through auditory feature detection and spike sequence decoding.通过听觉特征检测和尖峰序列解码实现抗噪语音识别。
Neural Comput. 2014 Mar;26(3):523-56. doi: 10.1162/NECO_a_00557. Epub 2013 Dec 9.
9
Improved model adaptation approach for recognition of reduced-frame-rate continuous speech.改进的模型自适应方法,用于识别降帧率连续语音。
PLoS One. 2018 Nov 7;13(11):e0206916. doi: 10.1371/journal.pone.0206916. eCollection 2018.
10
EMG-based speech recognition using hidden markov models with global control variables.基于肌电图的语音识别,使用带有全局控制变量的隐马尔可夫模型。
IEEE Trans Biomed Eng. 2008 Mar;55(3):930-40. doi: 10.1109/TBME.2008.915658.

引用本文的文献

1
The UCI Phonotactic Calculator: An online tool for computing phonotactic metrics.加州大学欧文分校音位结构计算器:一个用于计算音位结构指标的在线工具。
Behav Res Methods. 2025 Jul 3;57(8):218. doi: 10.3758/s13428-025-02725-z.
2
The utility of generative artificial intelligence Chatbot (ChatGPT) in generating teaching and learning material for anesthesiology residents.生成式人工智能聊天机器人(ChatGPT)在为麻醉学住院医师生成教学材料方面的效用。
Front Artif Intell. 2025 May 21;8:1582096. doi: 10.3389/frai.2025.1582096. eCollection 2025.
3
Spatial-Temporal Transformer Networks for Traffic Flow Forecasting Using a Pre-Trained Language Model.
基于预训练语言模型的时空Transformer网络用于交通流预测
Sensors (Basel). 2024 Aug 25;24(17):5502. doi: 10.3390/s24175502.
4
Towards AI-Driven Healthcare: Systematic Optimization, Linguistic Analysis, and Clinicians' Evaluation of Large Language Models for Smoking Cessation Interventions.迈向人工智能驱动的医疗保健:大型语言模型在戒烟干预中的系统优化、语言分析及临床医生评估
Proc SIGCHI Conf Hum Factor Comput Syst. 2024 May;2024. doi: 10.1145/3613904.3641965. Epub 2024 May 11.
5
A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient.一种基于并行初始卷积神经网络和梅尔频率谱系数的新型无声语音识别方法。
Front Neurorobot. 2022 Sep 2;16:971446. doi: 10.3389/fnbot.2022.971446. eCollection 2022.
6
Sentimental Analysis of Twitter Users from Turkish Content with Natural Language Processing.基于自然语言处理的土耳其语内容中 Twitter 用户的情感分析。
Comput Intell Neurosci. 2022 Apr 13;2022:2455160. doi: 10.1155/2022/2455160. eCollection 2022.
7
Machine Learning in Relation to Emergency Medicine Clinical and Operational Scenarios: An Overview.机器学习与急诊医学临床和操作场景的关系:概述。
West J Emerg Med. 2019 Mar;20(2):219-227. doi: 10.5811/westjem.2019.1.41244. Epub 2019 Feb 14.
8
Deep Learning in Medical Imaging: General Overview.医学成像中的深度学习:概述
Korean J Radiol. 2017 Jul-Aug;18(4):570-584. doi: 10.3348/kjr.2017.18.4.570. Epub 2017 May 19.
9
Analysis, classification, and coding of multielectrode spike trains with hidden Markov models.
Biol Cybern. 1994;71(4):359-73. doi: 10.1007/BF00239623.