• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

破解卷积神经网络中单词识别的神经密码。

Cracking the neural code for word recognition in convolutional neural networks.

机构信息

Cognitive Neuroimaging Unit, CEA, INSERM U 992, Université Paris-Saclay, NeuroSpin center, Gif/Yvette, France.

Collège de France, Université Paris Sciences Lettres (PSL), Paris, France.

出版信息

PLoS Comput Biol. 2024 Sep 6;20(9):e1012430. doi: 10.1371/journal.pcbi.1012430. eCollection 2024 Sep.

DOI:10.1371/journal.pcbi.1012430
PMID:39241019
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11410253/
Abstract

Learning to read places a strong challenge on the visual system. Years of expertise lead to a remarkable capacity to separate similar letters and encode their relative positions, thus distinguishing words such as FORM and FROM, invariantly over a large range of positions, sizes and fonts. How neural circuits achieve invariant word recognition remains unknown. Here, we address this issue by recycling deep neural network models initially trained for image recognition. We retrain them to recognize written words and then analyze how reading-specialized units emerge and operate across the successive layers. With literacy, a small subset of units becomes specialized for word recognition in the learned script, similar to the visual word form area (VWFA) in the human brain. We show that these units are sensitive to specific letter identities and their ordinal position from the left or the right of a word. The transition from retinotopic to ordinal position coding is achieved by a hierarchy of "space bigram" unit that detect the position of a letter relative to a blank space and that pool across low- and high-frequency-sensitive units from early layers of the network. The proposed scheme provides a plausible neural code for written words in the VWFA, and leads to predictions for reading behavior, error patterns, and the neurophysiology of reading.

摘要

学习阅读对视觉系统提出了巨大的挑战。多年的专业知识使我们能够出色地区分相似的字母并编码它们的相对位置,从而能够不变地在大范围的位置、大小和字体中区分出单词,例如 FORM 和 FROM。但是,神经回路如何实现不变的单词识别仍然未知。在这里,我们通过重新利用最初为图像识别而训练的深度神经网络模型来解决这个问题。我们重新训练它们以识别书面单词,然后分析阅读专业单元如何在连续的层中出现和运作。在读写能力方面,一小部分单元会专门识别所学文字中的单词,类似于人类大脑中的视觉单词形式区域(VWFA)。我们表明,这些单元对特定的字母身份及其在单词左侧或右侧的序号位置敏感。从视网膜坐标到序号位置的编码是通过“空间双元组”单元的层次结构实现的,这些单元检测字母相对于空白的位置,并在网络的早期层中汇总来自低和高频敏感单元的信息。所提出的方案为 VWFA 中的书面单词提供了一种合理的神经编码,并为阅读行为、错误模式和阅读的神经生理学提供了预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/2256d913d5e1/pcbi.1012430.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/b0af4317f8da/pcbi.1012430.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/ade2f859db02/pcbi.1012430.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/cd79b9c3cc1c/pcbi.1012430.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/5068d6e071e5/pcbi.1012430.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/e931f14a5883/pcbi.1012430.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/79715dbf4b14/pcbi.1012430.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/2256d913d5e1/pcbi.1012430.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/b0af4317f8da/pcbi.1012430.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/ade2f859db02/pcbi.1012430.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/cd79b9c3cc1c/pcbi.1012430.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/5068d6e071e5/pcbi.1012430.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/e931f14a5883/pcbi.1012430.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/79715dbf4b14/pcbi.1012430.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5123/11410253/2256d913d5e1/pcbi.1012430.g007.jpg

相似文献

1
Cracking the neural code for word recognition in convolutional neural networks.破解卷积神经网络中单词识别的神经密码。
PLoS Comput Biol. 2024 Sep 6;20(9):e1012430. doi: 10.1371/journal.pcbi.1012430. eCollection 2024 Sep.
2
Emergence of a compositional neural code for written words: Recycling of a convolutional neural network for reading.书面文字组合神经编码的出现:用于阅读的卷积神经网络的再利用。
Proc Natl Acad Sci U S A. 2021 Nov 16;118(46). doi: 10.1073/pnas.2104779118.
3
A compositional neural code in high-level visual cortex can explain jumbled word reading.高级视觉皮层中的组合神经代码可以解释乱序单词阅读。
Elife. 2020 May 5;9:e54846. doi: 10.7554/eLife.54846.
4
The lazy visual word form area: computational insights into location-sensitivity.视觉词汇惰性区:位置敏感性的计算洞察
PLoS Comput Biol. 2013;9(10):e1003250. doi: 10.1371/journal.pcbi.1003250. Epub 2013 Oct 3.
5
Development of the Visual Word Form Area Requires Visual Experience: Evidence from Blind Braille Readers.视觉词形区的发育需要视觉体验:来自盲人盲文阅读者的证据。
J Neurosci. 2017 Nov 22;37(47):11495-11504. doi: 10.1523/JNEUROSCI.0997-17.2017. Epub 2017 Oct 23.
6
Deep generative learning of location-invariant visual word recognition.位置不变视觉词识别的深度生成学习。
Front Psychol. 2013 Sep 19;4:635. doi: 10.3389/fpsyg.2013.00635. eCollection 2013.
7
Visual word recognition in the left and right hemispheres: anatomical and functional correlates of peripheral alexias.左右半球的视觉单词识别:周边失读症的解剖学和功能相关性
Cereb Cortex. 2003 Dec;13(12):1313-33. doi: 10.1093/cercor/bhg079.
8
Language-specific tuning of visual cortex? Functional properties of the Visual Word Form Area.视觉皮层的语言特异性调谐?视觉词形区的功能特性。
Brain. 2002 May;125(Pt 5):1054-69. doi: 10.1093/brain/awf094.
9
The VWFA Is the Home of Orthographic Learning When Houses Are Used as Letters.当房子被用作字母时,VWFA 是正字法学习的所在地。
eNeuro. 2019 Feb 15;6(1). doi: 10.1523/ENEURO.0425-17.2019. eCollection 2019 Jan-Feb.
10
Visual Experience Shapes Orthographic Representations in the Visual Word Form Area.视觉体验塑造视觉词形区的正字法表征。
Psychol Sci. 2016 Sep;27(9):1240-8. doi: 10.1177/0956797616657319. Epub 2016 Jul 19.

引用本文的文献

1
Modality-Specific and Amodal Language Processing by Single Neurons.单个神经元的模态特异性和非模态语言处理
bioRxiv. 2024 Nov 22:2024.11.16.623907. doi: 10.1101/2024.11.16.623907.

本文引用的文献

1
Characterizing different types of developmental dyslexias in French: The Malabi screener.法国不同类型发育性阅读障碍的特征:Malabi 筛查器。
Cogn Neuropsychol. 2023 Oct-Dec;40(7-8):319-350. doi: 10.1080/02643294.2024.2327665. Epub 2024 Jun 3.
2
Does the visual word form area split in bilingual readers? A millimeter-scale 7-T fMRI study.双语读者的视觉词形区是否分离?一项毫米级 7-T fMRI 研究。
Sci Adv. 2023 Apr 5;9(14):eadf6140. doi: 10.1126/sciadv.adf6140.
3
High-density single-unit human cortical recordings using the Neuropixels probe.
高密度单神经元人类皮质记录使用神经像素探针。
Neuron. 2022 Aug 3;110(15):2409-2421.e3. doi: 10.1016/j.neuron.2022.05.007. Epub 2022 Jun 8.
4
A Spatiotemporal Map of Reading Aloud.阅读 aloud 的时空图谱。
J Neurosci. 2022 Jul 6;42(27):5438-5450. doi: 10.1523/JNEUROSCI.2324-21.2022. Epub 2022 May 31.
5
Letter processing in upright bigrams predicts reading fluency variations in children.正立双字母词加工预测儿童阅读流畅性变化。
J Exp Psychol Gen. 2022 Sep;151(9):2237-2249. doi: 10.1037/xge0001175. Epub 2022 Feb 10.
6
Large-scale neural recordings with single neuron resolution using Neuropixels probes in human cortex.使用神经像素探针在人类皮层中进行具有单神经元分辨率的大规模神经记录。
Nat Neurosci. 2022 Feb;25(2):252-263. doi: 10.1038/s41593-021-00997-0. Epub 2022 Jan 31.
7
Emergence of a compositional neural code for written words: Recycling of a convolutional neural network for reading.书面文字组合神经编码的出现:用于阅读的卷积神经网络的再利用。
Proc Natl Acad Sci U S A. 2021 Nov 16;118(46). doi: 10.1073/pnas.2104779118.
8
Spatiotemporal dynamics of orthographic and lexical processing in the ventral visual pathway.腹侧视觉通路上的字形和词汇处理的时空动态。
Nat Hum Behav. 2021 Mar;5(3):389-398. doi: 10.1038/s41562-020-00982-w. Epub 2020 Nov 30.
9
The inferior temporal cortex is a potential cortical precursor of orthographic processing in untrained monkeys.下颞叶皮层是未经训练的猴子在字形处理中的潜在皮质前体。
Nat Commun. 2020 Aug 4;11(1):3886. doi: 10.1038/s41467-020-17714-3.
10
A M/EEG-fMRI Fusion Primer: Resolving Human Brain Responses in Space and Time.M/EEG-fMRI 融合入门:解析人类大脑在空间和时间上的反应。
Neuron. 2020 Sep 9;107(5):772-781. doi: 10.1016/j.neuron.2020.07.001. Epub 2020 Jul 27.