具有结构先验的循环网络解释了次优动物行为。

Recurrent networks endowed with structural priors explain suboptimal animal behavior.

作者信息

Molano-Mazón Manuel, Shao Yuxiu, Duque Daniel, Yang Guangyu Robert, Ostojic Srdjan, de la Rocha Jaime

机构信息

IDIBAPS, Rosselló 149, Barcelona 08036, Spain.

Laboratoire de Neurosciences Cognitives, INSERM U960, École Normale Supérieure - PSL Research University, 75005 Paris, France.

出版信息

Curr Biol. 2023 Feb 27;33(4):622-638.e7. doi: 10.1016/j.cub.2022.12.044. Epub 2023 Jan 18.

DOI:10.1016/j.cub.2022.12.044

PMID:36657448

Abstract

The strategies found by animals facing a new task are determined both by individual experience and by structural priors evolved to leverage the statistics of natural environments. Rats quickly learn to capitalize on the trial sequence correlations of two-alternative forced choice (2AFC) tasks after correct trials but consistently deviate from optimal behavior after error trials. To understand this outcome-dependent gating, we first show that recurrent neural networks (RNNs) trained in the same 2AFC task outperform rats as they can readily learn to use across-trial information both after correct and error trials. We hypothesize that, although RNNs can optimize their behavior in the 2AFC task without any a priori restrictions, rats' strategy is constrained by a structural prior adapted to a natural environment in which rewarded and non-rewarded actions provide largely asymmetric information. When pre-training RNNs in a more ecological task with more than two possible choices, networks develop a strategy by which they gate off the across-trial evidence after errors, mimicking rats' behavior. Population analyses show that the pre-trained networks form an accurate representation of the sequence statistics independently of the outcome in the previous trial. After error trials, gating is implemented by a change in the network dynamics that temporarily decouple the categorization of the stimulus from the across-trial accumulated evidence. Our results suggest that the rats' suboptimal behavior reflects the influence of a structural prior that reacts to errors by isolating the network decision dynamics from the context, ultimately constraining the performance in a 2AFC laboratory task.

摘要

面临新任务的动物所采用的策略既由个体经验决定，也由为利用自然环境统计规律而进化出的结构先验决定。大鼠在正确试验后能迅速学会利用二选一强制选择（2AFC）任务中的试验序列相关性，但在错误试验后却始终偏离最优行为。为理解这种结果依赖的门控现象，我们首先表明，在相同2AFC任务中训练的循环神经网络（RNN）比大鼠表现更好，因为它们能在正确和错误试验后都轻松学会利用跨试验信息。我们推测，尽管RNN可以在没有任何先验限制的情况下优化其在2AFC任务中的行为，但大鼠的策略受到一种适应自然环境的结构先验的约束，在这种自然环境中，有奖励和无奖励的行动提供的信息在很大程度上是不对称的。当在具有两个以上可能选择的更生态化任务中对RNN进行预训练时，网络会形成一种策略，即它们在错误后屏蔽跨试验证据，模仿大鼠的行为。群体分析表明，预训练的网络独立于前一次试验的结果形成了序列统计的准确表征。在错误试验后，门控是通过网络动力学的变化来实现的，这种变化暂时使刺激的分类与跨试验积累的证据解耦。我们的结果表明，大鼠的次优行为反映了一种结构先验的影响，这种结构先验通过将网络决策动力学与背景隔离开来对错误做出反应，最终限制了在2AFC实验室任务中的表现。

相似文献

Recurrent networks endowed with structural priors explain suboptimal animal behavior.具有结构先验的循环网络解释了次优动物行为。

Curr Biol. 2023 Feb 27;33(4):622-638.e7. doi: 10.1016/j.cub.2022.12.044. Epub 2023 Jan 18.

Considerations in using recurrent neural networks to probe neural dynamics.使用循环神经网络探究神经动力学的注意事项。

J Neurophysiol. 2019 Dec 1;122(6):2504-2521. doi: 10.1152/jn.00467.2018. Epub 2019 Oct 16.

Choice modulates the neural dynamics of prediction error processing during rewarded learning.选择调节奖励学习过程中预测误差处理的神经动力学。

Neuroimage. 2011 Jan 15;54(2):1385-94. doi: 10.1016/j.neuroimage.2010.09.051. Epub 2010 Sep 25.

Efficient probabilistic inference in generic neural networks trained with non-probabilistic feedback.在使用非概率反馈训练的通用神经网络中进行高效概率推理。

Nat Commun. 2017 Jul 26;8(1):138. doi: 10.1038/s41467-017-00181-8.

Efficient training protocol for rapid learning of the two-alternative forced-choice visual stimulus detection task.用于快速学习二选一强制选择视觉刺激检测任务的高效训练方案。

Physiol Rep. 2014 Jul 3;2(7). doi: 10.14814/phy2.12060. Print 2014 Jul 1.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Bio-instantiated recurrent neural networks: Integrating neurobiology-based network topology in artificial networks.生物实例化递归神经网络：在人工网络中整合基于神经生物学的网络拓扑结构。

Neural Netw. 2021 Oct;142:608-618. doi: 10.1016/j.neunet.2021.07.011. Epub 2021 Jul 24.

Speed and accuracy of visual image discrimination by rats.大鼠视觉图像辨别速度和准确性。

Front Neural Circuits. 2013 Dec 18;7:200. doi: 10.3389/fncir.2013.00200. eCollection 2013.

tension: A Python package for FORCE learning.张力：用于 FORCE 学习的 Python 包。

PLoS Comput Biol. 2022 Dec 19;18(12):e1010722. doi: 10.1371/journal.pcbi.1010722. eCollection 2022 Dec.

Representation of foreseeable choice outcomes in orbitofrontal cortex triplet-wise interactions.眶额皮层三重相互作用中可预见选择结果的表示。

PLoS Comput Biol. 2020 Jun 24;16(6):e1007862. doi: 10.1371/journal.pcbi.1007862. eCollection 2020 Jun.

引用本文的文献

Rapid, systematic updating of movement by accumulated decision evidence.通过累积的决策证据对运动进行快速、系统的更新。

Nat Commun. 2024 Dec 4;15(1):10583. doi: 10.1038/s41467-024-53586-7.

Flexible multitask computation in recurrent networks utilizes shared dynamical motifs.递归网络中的灵活多任务计算利用了共享的动态模式。

Nat Neurosci. 2024 Jul;27(7):1349-1363. doi: 10.1038/s41593-024-01668-6. Epub 2024 Jul 9.

Synergistic information supports modality integration and flexible learning in neural networks solving multiple tasks.协同信息支持神经网络在解决多项任务时的模态整合和灵活学习。

PLoS Comput Biol. 2024 Jun 3;20(6):e1012178. doi: 10.1371/journal.pcbi.1012178. eCollection 2024 Jun.

Learning to learn: Single session acquisition of new rules by freely moving mice.学会学习：自由活动的小鼠单次获取新规则

PNAS Nexus. 2024 May 19;3(5):pgae203. doi: 10.1093/pnasnexus/pgae203. eCollection 2024 May.

Performance errors during rodent learning reflect a dynamic choice strategy.在啮齿动物学习过程中的表现错误反映了一种动态的选择策略。

Curr Biol. 2024 May 20;34(10):2107-2117.e5. doi: 10.1016/j.cub.2024.04.017. Epub 2024 Apr 26.

Computational role of structure in neural activity and connectivity.结构在神经活动和连接中的计算作用。

Trends Cogn Sci. 2024 Jul;28(7):677-690. doi: 10.1016/j.tics.2024.03.003. Epub 2024 Mar 28.

Rapid, systematic updating of movement by accumulated decision evidence.通过累积的决策证据对运动进行快速、系统的更新。

bioRxiv. 2024 Jan 30:2023.11.09.566389. doi: 10.1101/2023.11.09.566389.

Trial-history biases in evidence accumulation can give rise to apparent lapses in decision-making.在证据积累过程中，试验历史偏倚可能导致决策出现明显失误。

Nat Commun. 2024 Jan 22;15(1):662. doi: 10.1038/s41467-024-44880-5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

具有结构先验的循环网络解释了次优动物行为。

Recurrent networks endowed with structural priors explain suboptimal animal behavior.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献