• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用传入听觉系统的研究结果,通过人工神经网络进行语音识别。

Speech recognition by an artificial neural network using findings on the afferent auditory system.

作者信息

Kurogi S

机构信息

Division of Control Engineering, Kyushu Institute of Technology, Kitakyushu, Japan.

出版信息

Biol Cybern. 1991;64(3):243-9. doi: 10.1007/BF00201985.

DOI:10.1007/BF00201985
PMID:2004135
Abstract

An artificial neural network which uses anatomical and physiological findings on the afferent pathway from the ear to the cortex is presented and the roles of the constituent functions in recognition of continuous speech are examined. The network deals with successive spectra of speech sounds by a cascade of several neural layers: lateral excitation layer (LEL), lateral inhibition layer (LIL), and a pile of feature detection layers (FDL's). These layers are shown to be effective for recognizing spoken words. Namely, first, LEL reduces the distortion of sound spectrum caused by the pitch of speech sounds. Next, LIL emphasizes the major energy peaks of sound spectrum, the formants. Last, FDL's detect syllables and words in successive formants, where two functions, time-delay and strong adaptation, play important roles: time-delay makes it possible to retain the pattern of formant changes for a period to detect spoken words successively; strong adaptation contributes to removing the time-warp of formant changes. Digital computer simulations show that the network detect isolated syllables, isolated words, and connected words in continuous speech, while reproducing the fundamental responses found in the auditory system such as ON, OFF, ON-OFF, and SUSTAINED patterns.

摘要

本文提出了一种人工神经网络,该网络利用从耳朵到皮层的传入通路上的解剖学和生理学发现,并研究了其组成功能在连续语音识别中的作用。该网络通过几个神经层的级联来处理连续的语音频谱:侧向兴奋层(LEL)、侧向抑制层(LIL)和一堆特征检测层(FDL)。这些层被证明对识别口语单词是有效的。具体来说,首先,LEL减少了由语音音高引起的声谱失真。其次,LIL强调了声谱的主要能量峰值,即共振峰。最后,FDL在连续的共振峰中检测音节和单词,其中时间延迟和强适应这两个功能起着重要作用:时间延迟使得能够在一段时间内保留共振峰变化的模式,以便连续检测口语单词;强适应有助于消除共振峰变化的时间扭曲。数字计算机模拟表明,该网络能够检测连续语音中的孤立音节、孤立单词和连续单词,同时重现听觉系统中发现的基本反应,如开、关、开-关和持续模式。

相似文献

1
Speech recognition by an artificial neural network using findings on the afferent auditory system.利用传入听觉系统的研究结果,通过人工神经网络进行语音识别。
Biol Cybern. 1991;64(3):243-9. doi: 10.1007/BF00201985.
2
A temporal analysis of auditory-nerve fiber responses to spoken stop consonant-vowel syllables.对听觉神经纤维对口语塞音-元音音节反应的时间分析。
J Acoust Soc Am. 1986 Jun;79(6):1896-914. doi: 10.1121/1.393197.
3
Tonotopic features of speech-evoked activity in primate auditory cortex.灵长类动物听觉皮层中言语诱发活动的音频拓扑特征。
Brain Res. 1990 Jun 11;519(1-2):158-68. doi: 10.1016/0006-8993(90)90074-l.
4
Neural sensitivity to statistical regularities as a fundamental biological process that underlies auditory learning: the role of musical practice.神经对统计规律的敏感性是听觉学习的基本生物学过程:音乐练习的作用。
Hear Res. 2014 Feb;308:122-8. doi: 10.1016/j.heares.2013.08.018. Epub 2013 Sep 12.
5
Auditory responsive cortex in the squirrel monkey: neural responses to amplitude-modulated sounds.松鼠猴的听觉反应皮层:对调幅声音的神经反应。
Exp Brain Res. 1996 Mar;108(2):273-84. doi: 10.1007/BF00228100.
6
Attention Is Required for Knowledge-Based Sequential Grouping: Insights from the Integration of Syllables into Words.注意:基于知识的序列分组需要注意:从音节到单词的整合中得到的启示。
J Neurosci. 2018 Jan 31;38(5):1178-1188. doi: 10.1523/JNEUROSCI.2606-17.2017. Epub 2017 Dec 18.
7
How do auditory cortex neurons represent communication sounds?听觉皮层神经元如何表征交流声音?
Hear Res. 2013 Nov;305:102-12. doi: 10.1016/j.heares.2013.03.011. Epub 2013 Apr 17.
8
Enhanced sound perception by widespread-onset neuronal responses in auditory cortex.听觉皮层中广泛起始的神经元反应增强声音感知。
Neural Comput. 2007 Dec;19(12):3310-34. doi: 10.1162/neco.2007.19.12.3310.
9
Predictive Brain Mechanisms in Sound-to-Meaning Mapping during Speech Processing.言语处理过程中从声音到意义映射的预测性脑机制。
J Neurosci. 2016 Oct 19;36(42):10813-10822. doi: 10.1523/JNEUROSCI.0583-16.2016.
10
Harmonic template neurons in primate auditory cortex underlying complex sound processing.灵长类动物听觉皮层中参与复杂声音处理的谐波模板神经元。
Proc Natl Acad Sci U S A. 2017 Jan 31;114(5):E840-E848. doi: 10.1073/pnas.1607519114. Epub 2017 Jan 17.

引用本文的文献

1
A global view on how local muscular fatigue affects human performance.从全球范围来看,局部肌肉疲劳如何影响人类表现。
Proc Natl Acad Sci U S A. 2020 Aug 18;117(33):19866-19872. doi: 10.1073/pnas.2007579117. Epub 2020 Aug 4.
2
A neural network application to classification of health status of HIV/AIDS patients.一种用于对艾滋病毒/艾滋病患者健康状况进行分类的神经网络应用。
J Med Syst. 1997 Apr;21(2):87-97. doi: 10.1023/a:1022890223449.
3
Noninvasive diagnosis of coronary artery disease using a neural network algorithm.使用神经网络算法对冠状动脉疾病进行无创诊断。

本文引用的文献

1
A model of neural network for spatiotemporal pattern recognition.
Biol Cybern. 1987;57(1-2):103-14. doi: 10.1007/BF00318720.
2
Pitch-synchronous response of cat cochlear nerve fibers to speech sounds.猫耳蜗神经纤维对语音的基频同步反应。
Jpn J Physiol. 1975;25(5):633-44. doi: 10.2170/jjphysiol.25.633.
Biol Cybern. 1992;67(4):361-7. doi: 10.1007/BF02414891.