• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自由能优化的早期发声学习和对应匹配的脑启发模型。

Brain-inspired model for early vocal learning and correspondence matching using free-energy optimization.

机构信息

Laboratoire ETIS, CY Cergy Paris University, ENSEA, CNRS, UMR8051, Cergy, France.

出版信息

PLoS Comput Biol. 2021 Feb 18;17(2):e1008566. doi: 10.1371/journal.pcbi.1008566. eCollection 2021 Feb.

DOI:10.1371/journal.pcbi.1008566
PMID:33600482
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7891699/
Abstract

We propose a developmental model inspired by the cortico-basal system (CX-BG) for vocal learning in babies and for solving the correspondence mismatch problem they face when they hear unfamiliar voices, with different tones and pitches. This model is based on the neural architecture INFERNO standing for Iterative Free-Energy Optimization of Recurrent Neural Networks. Free-energy minimization is used for rapidly exploring, selecting and learning the optimal choices of actions to perform (eg sound production) in order to reproduce and control as accurately as possible the spike trains representing desired perceptions (eg sound categories). We detail in this paper the CX-BG system responsible for linking causally the sound and motor primitives at the order of a few milliseconds. Two experiments performed with a small and a large audio database show the capabilities of exploration, generalization and robustness to noise of our neural architecture in retrieving audio primitives during vocal learning and during acoustic matching with unheared voices (different genders and tones).

摘要

我们提出了一个受皮质基底系统(CX-BG)启发的发展模型,用于婴儿的发声学习以及解决他们在听到陌生声音(不同音调和音高)时面临的对应不匹配问题。该模型基于神经架构 INFERNO,代表用于递归神经网络的迭代自由能优化。自由能最小化用于快速探索、选择和学习执行最佳动作选择(例如发声),以便尽可能准确地再现和控制表示期望感知(例如声音类别)的尖峰序列。本文详细介绍了 CX-BG 系统,该系统负责在几毫秒的时间内因果地连接声音和运动原语。使用小型和大型音频数据库进行的两项实验表明,我们的神经架构在发声学习期间以及与未听过的声音(不同性别和音高)进行声学匹配期间检索音频原语时具有探索、泛化和对噪声鲁棒性的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4422/7891699/feda05393fd2/pcbi.1008566.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4422/7891699/82622f5ec07c/pcbi.1008566.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4422/7891699/feda05393fd2/pcbi.1008566.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4422/7891699/82622f5ec07c/pcbi.1008566.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4422/7891699/feda05393fd2/pcbi.1008566.g010.jpg

相似文献

1
Brain-inspired model for early vocal learning and correspondence matching using free-energy optimization.基于自由能优化的早期发声学习和对应匹配的脑启发模型。
PLoS Comput Biol. 2021 Feb 18;17(2):e1008566. doi: 10.1371/journal.pcbi.1008566. eCollection 2021 Feb.
2
Gated spiking neural network using Iterative Free-Energy Optimization and rank-order coding for structure learning in memory sequences (INFERNO GATE).基于迭代自由能优化和排序编码的门控尖峰神经网络用于记忆序列中的结构学习 (INFERNO GATE)。
Neural Netw. 2020 Jan;121:242-258. doi: 10.1016/j.neunet.2019.09.023. Epub 2019 Sep 25.
3
Auditory learning: a developmental method.听觉学习:一种发展方法。
IEEE Trans Neural Netw. 2005 May;16(3):601-16. doi: 10.1109/TNN.2005.845217.
4
Iterative free-energy optimization for recurrent neural networks (INFERNO).循环神经网络的迭代自由能优化(INFERNO)。
PLoS One. 2017 Mar 10;12(3):e0173684. doi: 10.1371/journal.pone.0173684. eCollection 2017.
5
A brain-like neural network for periodicity analysis.用于周期性分析的类脑神经网络。
IEEE Trans Syst Man Cybern B Cybern. 2005 Feb;35(1):12-22. doi: 10.1109/tsmcb.2004.837751.
6
Auditory experience refines cortico-basal ganglia inputs to motor cortex via remapping of single axons during vocal learning in zebra finches.听觉体验通过在斑马雀的发声学习过程中单个轴突的重新映射,对运动皮层的皮质-基底神经节输入进行了修正。
J Neurophysiol. 2012 Feb;107(4):1142-56. doi: 10.1152/jn.00614.2011. Epub 2011 Dec 7.
7
Auditory perceptual learning and changes in the conceptualization of auditory cortex.听觉感知学习与听觉皮层概念化的变化
Hear Res. 2018 Sep;366:3-16. doi: 10.1016/j.heares.2018.03.011. Epub 2018 Mar 12.
8
Selective consistency of recurrent neural networks induced by plasticity as a mechanism of unsupervised perceptual learning.可塑性诱导的递归神经网络的选择性一致性作为无监督感知学习的机制。
PLoS Comput Biol. 2024 Sep 3;20(9):e1012378. doi: 10.1371/journal.pcbi.1012378. eCollection 2024 Sep.
9
SSTE: Syllable-Specific Temporal Encoding to FORCE-learn audio sequences with an associative memory approach.SSTE:基于联想记忆方法的音节特定时间编码,用于强制学习音频序列。
Neural Netw. 2024 Sep;177:106368. doi: 10.1016/j.neunet.2024.106368. Epub 2024 May 7.
10
Exploring the anatomical encoding of voice with a mathematical model of the vocal system.用语音系统的数学模型探索语音的解剖学编码。
Neuroimage. 2016 Nov 1;141:31-39. doi: 10.1016/j.neuroimage.2016.07.033. Epub 2016 Jul 17.

引用本文的文献

1
Digital computing through randomness and order in neural networks.神经网络中的随机性和有序性进行数字计算。
Proc Natl Acad Sci U S A. 2022 Aug 16;119(33):e2115335119. doi: 10.1073/pnas.2115335119. Epub 2022 Aug 10.

本文引用的文献

1
Gated spiking neural network using Iterative Free-Energy Optimization and rank-order coding for structure learning in memory sequences (INFERNO GATE).基于迭代自由能优化和排序编码的门控尖峰神经网络用于记忆序列中的结构学习 (INFERNO GATE)。
Neural Netw. 2020 Jan;121:242-258. doi: 10.1016/j.neunet.2019.09.023. Epub 2019 Sep 25.
2
Emergence of an Action Repository as Part of a Biologically Inspired Model of Speech Processing: The Role of Somatosensory Information in Learning Phonetic-Phonological Sound Features.作为语音处理生物启发模型一部分的动作库的出现:体感信息在学习语音-音系学声音特征中的作用。
Front Psychol. 2019 Jul 10;10:1462. doi: 10.3389/fpsyg.2019.01462. eCollection 2019.
3
Fusing autonomy and sociality via embodied emergence and development of behaviour and cognition from fetal period.
通过胎儿期行为和认知的具身涌现和发展来融合自主性和社会性。
Philos Trans R Soc Lond B Biol Sci. 2019 Apr 29;374(1771):20180031. doi: 10.1098/rstb.2018.0031.
4
Computer simulations of coupled idiosyncrasies in speech perception and speech production with COSMO, a perceptuo-motor Bayesian model of speech communication.使用 COSMO(一种基于感知运动的言语交际贝叶斯模型)对言语感知和言语产生中的个体差异进行耦合的计算机模拟。
PLoS One. 2019 Jan 11;14(1):e0210302. doi: 10.1371/journal.pone.0210302. eCollection 2019.
5
Cognitive science in the era of artificial intelligence: A roadmap for reverse-engineering the infant language-learner.人工智能时代的认知科学:逆向工程婴儿语言学习者的路线图。
Cognition. 2018 Apr;173:43-59. doi: 10.1016/j.cognition.2017.11.008. Epub 2018 Jan 8.
6
The complementary roles of auditory and motor information evaluated in a Bayesian perceptuo-motor model of speech perception.在语音感知的贝叶斯感知运动模型中评估听觉和运动信息的互补作用。
Psychol Rev. 2017 Oct;124(5):572-602. doi: 10.1037/rev0000069. Epub 2017 May 4.
7
Iterative free-energy optimization for recurrent neural networks (INFERNO).循环神经网络的迭代自由能优化(INFERNO)。
PLoS One. 2017 Mar 10;12(3):e0173684. doi: 10.1371/journal.pone.0173684. eCollection 2017.
8
Spatio-Temporal Tolerance of Visuo-Tactile Illusions in Artificial Skin by Recurrent Neural Network with Spike-Timing-Dependent Plasticity.基于具有尖峰时间依赖可塑性的递归神经网络的人工皮肤视触错觉的时空容忍度。
Sci Rep. 2017 Jan 20;7:41056. doi: 10.1038/srep41056.
9
Predictive coding as a model of cognition.预测编码作为一种认知模型。
Cogn Process. 2016 Aug;17(3):279-305. doi: 10.1007/s10339-016-0765-6. Epub 2016 Apr 27.
10
Learning to Produce Syllabic Speech Sounds via Reward-Modulated Neural Plasticity.通过奖励调制神经可塑性学习发出音节语音
PLoS One. 2016 Jan 25;11(1):e0145096. doi: 10.1371/journal.pone.0145096. eCollection 2016.