• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

卷积架构是从头开始与皮质对齐的。

Convolutional architectures are cortex-aligned de novo.

作者信息

Kazemian Atlas, Elmoznino Eric, Bonner Michael F

机构信息

Department of Cognitive Science, Johns Hopkins University, Baltimore, MD, USA.

University of Montreal and MILA - Quebec AI Institute.

出版信息

bioRxiv. 2025 Jul 29:2024.05.10.593623. doi: 10.1101/2024.05.10.593623.

DOI:10.1101/2024.05.10.593623
PMID:40766564
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12324244/
Abstract

What underlies the emergence of cortex-aligned representations in deep neural network models of vision? Earlier work suggested that shared architectural constraints were a major factor, but the success of widely varied architectures after pre-training raises critical questions about the importance of architectural constraints. Here we show that in wide networks with minimal training, architectural inductive biases have a prominent role. We examined networks with varied architectures but no pre-training and quantified their ability to predict image representations in the visual cortices of monkeys and humans. We found that cortex-aligned representations emerge in convolutional architectures that combine two key manipulations of dimensionality: compression in the spatial domain, through pooling, and expansion in the feature domain by increasing the number of channels. We further show that the inductive biases of convolutional architectures are critical for obtaining performance gains from feature expansion-dimensionality manipulations were relatively ineffective in other architectures and in convolutional models with targeted lesions. Our findings suggest that the architectural constraints of convolutional networks are sufficiently close to the constraints of biological vision to allow many aspects of cortical visual representation to emerge even before synaptic connections have been tuned through experience.

摘要

在视觉深度神经网络模型中,与皮层对齐的表征是如何出现的?早期的研究表明,共享的架构约束是一个主要因素,但预训练后广泛多样的架构取得成功,引发了关于架构约束重要性的关键问题。在这里,我们表明,在经过最少训练的宽网络中,架构归纳偏差起着突出的作用。我们研究了各种架构但未经过预训练的网络,并量化了它们预测猴子和人类视觉皮层中图像表征的能力。我们发现,与皮层对齐的表征出现在结合了两种关键维度操作的卷积架构中:通过池化在空间域进行压缩,以及通过增加通道数量在特征域进行扩展。我们进一步表明,卷积架构的归纳偏差对于从特征扩展中获得性能提升至关重要——维度操作在其他架构以及具有靶向损伤的卷积模型中相对无效。我们的研究结果表明,卷积网络的架构约束与生物视觉的约束足够接近,以至于即使在通过经验调整突触连接之前,皮层视觉表征的许多方面也能出现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/ab2438b33904/nihpp-2024.05.10.593623v3-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/2088f7e9ee6e/nihpp-2024.05.10.593623v3-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/dd1ad8d3ba5d/nihpp-2024.05.10.593623v3-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/c17d54e004dc/nihpp-2024.05.10.593623v3-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/84a991eee91d/nihpp-2024.05.10.593623v3-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/22de72270d61/nihpp-2024.05.10.593623v3-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/5cea8b85ccfe/nihpp-2024.05.10.593623v3-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/52400eb45f1c/nihpp-2024.05.10.593623v3-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/94ab43c89f7c/nihpp-2024.05.10.593623v3-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/89a21dfe0a6f/nihpp-2024.05.10.593623v3-f0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/3848ee06dcd7/nihpp-2024.05.10.593623v3-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/3376157acc8d/nihpp-2024.05.10.593623v3-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/b63125c39c6b/nihpp-2024.05.10.593623v3-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/f0d4cf416c49/nihpp-2024.05.10.593623v3-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/d31ee70bbb74/nihpp-2024.05.10.593623v3-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/ab2438b33904/nihpp-2024.05.10.593623v3-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/2088f7e9ee6e/nihpp-2024.05.10.593623v3-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/dd1ad8d3ba5d/nihpp-2024.05.10.593623v3-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/c17d54e004dc/nihpp-2024.05.10.593623v3-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/84a991eee91d/nihpp-2024.05.10.593623v3-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/22de72270d61/nihpp-2024.05.10.593623v3-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/5cea8b85ccfe/nihpp-2024.05.10.593623v3-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/52400eb45f1c/nihpp-2024.05.10.593623v3-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/94ab43c89f7c/nihpp-2024.05.10.593623v3-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/89a21dfe0a6f/nihpp-2024.05.10.593623v3-f0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/3848ee06dcd7/nihpp-2024.05.10.593623v3-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/3376157acc8d/nihpp-2024.05.10.593623v3-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/b63125c39c6b/nihpp-2024.05.10.593623v3-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/f0d4cf416c49/nihpp-2024.05.10.593623v3-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/d31ee70bbb74/nihpp-2024.05.10.593623v3-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c0/12324244/ab2438b33904/nihpp-2024.05.10.593623v3-f0006.jpg

相似文献

1
Convolutional architectures are cortex-aligned de novo.卷积架构是从头开始与皮质对齐的。
bioRxiv. 2025 Jul 29:2024.05.10.593623. doi: 10.1101/2024.05.10.593623.
2
Short-Term Memory Impairment短期记忆障碍
3
Can We Enhance Shared Decision-making for Periacetabular Osteotomy Surgery? A Qualitative Study of Patient Experiences.我们能否加强髋臼周围截骨术的共同决策?一项关于患者体验的定性研究。
Clin Orthop Relat Res. 2025 Jan 1;483(1):120-136. doi: 10.1097/CORR.0000000000003198. Epub 2024 Jul 23.
4
"Living Independently Means Everything to Me": The Voice of Australian Autistic Adults.“独立生活对我来说意味着一切”:澳大利亚成年自闭症患者的心声。
Autism Adulthood. 2024 Sep 16;6(3):312-320. doi: 10.1089/aut.2022.0102. eCollection 2024 Sep.
5
Pharmacological interventions for those who have sexually offended or are at risk of offending.针对有性犯罪行为或有性犯罪风险者的药物干预措施。
Cochrane Database Syst Rev. 2015 Feb 18;2015(2):CD007989. doi: 10.1002/14651858.CD007989.pub2.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
7
"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.“心流状态”:对自闭症成年人任务沉浸现象学体验的质性研究
Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.
8
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
9
Effectiveness and cost-effectiveness of computer and other electronic aids for smoking cessation: a systematic review and network meta-analysis.计算机和其他电子戒烟辅助手段的有效性和成本效益:系统评价和网络荟萃分析。
Health Technol Assess. 2012;16(38):1-205, iii-v. doi: 10.3310/hta16380.
10
Factors that influence parents' and informal caregivers' views and practices regarding routine childhood vaccination: a qualitative evidence synthesis.影响父母和非正式照顾者对常规儿童疫苗接种看法和做法的因素:定性证据综合分析。
Cochrane Database Syst Rev. 2021 Oct 27;10(10):CD013265. doi: 10.1002/14651858.CD013265.pub2.

本文引用的文献

1
Universal dimensions of visual representation.视觉表征的通用维度。
Sci Adv. 2025 Jul 4;11(27):eadw7697. doi: 10.1126/sciadv.adw7697. Epub 2025 Jul 2.
2
High-performing neural network models of visual cortex benefit from high latent dimensionality.高表现的视觉皮层神经网络模型受益于高潜在维度。
PLoS Comput Biol. 2024 Jan 10;20(1):e1011792. doi: 10.1371/journal.pcbi.1011792. eCollection 2024 Jan.
3
THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior.
事物数据集(THINGS-data),一个多模态大型数据集集合,用于研究人类大脑和行为中的目标表示。
Elife. 2023 Feb 27;12:e82580. doi: 10.7554/eLife.82580.
4
Data-driven emergence of convolutional structure in neural networks.基于数据的神经网络卷积结构的涌现。
Proc Natl Acad Sci U S A. 2022 Oct 4;119(40):e2201854119. doi: 10.1073/pnas.2201854119. Epub 2022 Sep 26.
5
A self-supervised domain-general learning framework for human ventral stream representation.一种用于人类腹侧流表示的自监督领域泛化学习框架。
Nat Commun. 2022 Jan 25;13(1):491. doi: 10.1038/s41467-022-28091-4.
6
A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence.一个用于连接认知神经科学与人工智能的大规模7T功能磁共振成像数据集。
Nat Neurosci. 2022 Jan;25(1):116-126. doi: 10.1038/s41593-021-00962-x. Epub 2021 Dec 16.
7
Face detection in untrained deep neural networks.未训练的深度神经网络中的人脸检测。
Nat Commun. 2021 Dec 16;12(1):7328. doi: 10.1038/s41467-021-27606-9.
8
Diverse Deep Neural Networks All Predict Human Inferior Temporal Cortex Well, After Training and Fitting.各种深度神经网络在经过训练和适配后都能很好地预测人类的下颞叶皮质。
J Cogn Neurosci. 2021 Sep 1;33(10):2044-2064. doi: 10.1162/jocn_a_01755.
9
Object representations in the human brain reflect the co-occurrence statistics of vision and language.人类大脑中的物体表示反映了视觉和语言的共现统计数据。
Nat Commun. 2021 Jul 2;12(1):4081. doi: 10.1038/s41467-021-24368-2.
10
If deep learning is the answer, what is the question?如果深度学习是答案,那么问题是什么?
Nat Rev Neurosci. 2021 Jan;22(1):55-67. doi: 10.1038/s41583-020-00395-8. Epub 2020 Nov 16.