• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类视觉系统和人工视觉系统共享一种将双眼视差转化为深度表征的计算原理。

Human and artificial visual systems share a computational principle for transforming binocular disparity into depth representation.

作者信息

Wundari Bayu Gautama, Fujita Ichiro, Ban Hiroshi

机构信息

Graduate School of Frontier Biosciences, Osaka University, Suita, Japan.

Center for Information and Neural Networks (CiNet), Advanced ICT Research Institute, National Institute of Information and Communications Technology, Suita, Japan.

出版信息

Commun Biol. 2025 Jul 11;8(1):1042. doi: 10.1038/s42003-025-08474-1.

DOI:10.1038/s42003-025-08474-1
PMID:40646303
Abstract

Our visual brain transforms small differences between images in the two eyes (binocular disparity) into coherent depth. Initially, neurons in the primary visual cortex (V1) compute the degrees of overlap between the left and right images to encode disparity. Such cross-correlation-like neurons respond to both binocularly matched and mismatched features. This ambiguous representation is refined along the visual pathway through a cross-matching computation involving additional nonlinear processing to filter out mismatches. How these representations are organized in the human visual cortex remains unclear. Using functional magnetic resonance imaging (fMRI), we show that areas V1-V3 exhibit stronger cross-correlation components, while V3A/B, V7, hV4, and hMT+ are inclined towards cross-matching. A deep neural network (DNN) trained for stereo vision undergoes a similar transformation across its layers, progressing through distinct phases that exploit dissimilar features to achieve coherent depth. This brain-DNN alignment demonstrates that human and artificial visual systems share a computational principle for robust 3D vision.

摘要

我们的视觉大脑将双眼图像之间的微小差异(双眼视差)转化为连贯的深度信息。最初,初级视觉皮层(V1)中的神经元通过计算左右图像之间的重叠程度来编码视差。这种类似互相关的神经元对双眼匹配和不匹配的特征都会做出反应。这种模糊的表征会沿着视觉通路通过一种交叉匹配计算得到细化,该计算涉及额外的非线性处理以滤除不匹配的部分。这些表征在人类视觉皮层中是如何组织的仍不清楚。利用功能磁共振成像(fMRI),我们发现V1 - V3区域表现出更强的互相关成分,而V3A/B、V7、hV4和hMT+则倾向于交叉匹配。一个经过立体视觉训练的深度神经网络(DNN)在其各层中也经历了类似的转变,通过利用不同特征的不同阶段来实现连贯的深度。这种大脑与DNN的一致性表明,人类和人工视觉系统在稳健的3D视觉方面共享一种计算原理。

相似文献

1
Human and artificial visual systems share a computational principle for transforming binocular disparity into depth representation.人类视觉系统和人工视觉系统共享一种将双眼视差转化为深度表征的计算原理。
Commun Biol. 2025 Jul 11;8(1):1042. doi: 10.1038/s42003-025-08474-1.
2
Short-Term Memory Impairment短期记忆障碍
3
Binocular receptive-field construction in the primary visual cortex.初级视皮层的双眼感受野构建。
Curr Biol. 2024 Jun 3;34(11):2474-2486.e5. doi: 10.1016/j.cub.2024.04.058. Epub 2024 May 20.
4
The Representational Organization of Static and Dynamic Visual Features in the Human Cortex.人类大脑皮层中静态和动态视觉特征的表征组织
J Neurosci. 2025 Jul 9;45(28):e1164242025. doi: 10.1523/JNEUROSCI.1164-24.2025.
5
Representation of locomotive action affordances in human behavior, brains, and deep neural networks.人类行为、大脑和深度神经网络中机车动作可供性的表征。
Proc Natl Acad Sci U S A. 2025 Jun 17;122(24):e2414005122. doi: 10.1073/pnas.2414005122. Epub 2025 Jun 12.
6
Retrieving and reconstructing conceptually similar images from fMRI with latent diffusion models and a neuro-inspired brain decoding model.使用潜在扩散模型和神经启发式脑解码模型从功能磁共振成像中检索和重建概念上相似的图像。
J Neural Eng. 2024 Jun 28;21(4). doi: 10.1088/1741-2552/ad593c.
7
Trifocal versus extended depth of focus (EDOF) intraocular lenses after cataract extraction.白内障摘除术后三焦点与扩展景深(EDOF)人工晶状体的比较。
Cochrane Database Syst Rev. 2024 Jul 10;7(7):CD014891. doi: 10.1002/14651858.CD014891.pub2.
8
Responses of primary visual cortical neurons to binocular disparity without depth perception.初级视觉皮层神经元对无深度感知的双眼视差的反应。
Nature. 1997 Sep 18;389(6648):280-3. doi: 10.1038/38487.
9
A cross-species analysis of neuroanatomical covariance sex differences in humans and mice.人类和小鼠神经解剖协方差性别差异的跨物种分析。
Biol Sex Differ. 2025 Jul 1;16(1):47. doi: 10.1186/s13293-025-00728-1.
10
Disparity Sensitivity and Binocular Integration in Mouse Visual Cortex Areas.鼠视觉皮层中视差敏感性和双眼整合
J Neurosci. 2020 Nov 11;40(46):8883-8899. doi: 10.1523/JNEUROSCI.1060-20.2020. Epub 2020 Oct 13.

本文引用的文献

1
Testing the top-down feedback in the central visual field using the reversed depth illusion.使用反向深度错觉测试中央视野中的自上而下反馈。
iScience. 2025 Mar 15;28(4):112223. doi: 10.1016/j.isci.2025.112223. eCollection 2025 Apr 18.
2
Neuronal Representations Supporting Three-Dimensional Vision in Nonhuman Primates.支持非人灵长类动物三维视觉的神经元表征。
Annu Rev Vis Sci. 2023 Sep 15;9:337-359. doi: 10.1146/annurev-vision-111022-123857. Epub 2023 Mar 21.
3
Stereopsis without correspondence.无对应立体视。
Philos Trans R Soc Lond B Biol Sci. 2023 Jan 30;378(1869):20210449. doi: 10.1098/rstb.2021.0449. Epub 2022 Dec 13.
4
Neural tuning and representational geometry.神经调谐与表象几何。
Nat Rev Neurosci. 2021 Nov;22(11):703-718. doi: 10.1038/s41583-021-00502-3. Epub 2021 Sep 14.
5
Human primary visual cortex shows larger population receptive fields for binocular disparity-defined stimuli.人类初级视觉皮层对双眼视差定义的刺激表现出更大的群体感受野。
Brain Struct Funct. 2021 Dec;226(9):2819-2838. doi: 10.1007/s00429-021-02351-3. Epub 2021 Aug 4.
6
Specialized contributions of mid-tier stages of dorsal and ventral pathways to stereoscopic processing in macaque.中阶背侧和腹侧通路对猕猴立体加工的专业化贡献。
Elife. 2021 Feb 24;10:e58749. doi: 10.7554/eLife.58749.
7
Human Depth Sensitivity Is Affected by Object Plausibility.人类深度敏感度受物体逼真度的影响。
J Cogn Neurosci. 2020 Feb;32(2):338-352. doi: 10.1162/jocn_a_01483. Epub 2019 Oct 21.
8
A new framework for understanding vision from the perspective of the primary visual cortex.从初级视皮层的角度理解视觉的新框架。
Curr Opin Neurobiol. 2019 Oct;58:1-10. doi: 10.1016/j.conb.2019.06.001. Epub 2019 Jul 1.
9
Areal differences in depth cue integration between monkey and human.猴子和人类在深度线索整合方面的区域差异。
PLoS Biol. 2019 Mar 29;17(3):e2006405. doi: 10.1371/journal.pbio.2006405. eCollection 2019 Mar.
10
Spatial pooling inherent to intrinsic signal optical imaging might cause V2 to resemble a solution to the stereo correspondence problem.内在信号光学成像所固有的空间汇聚可能会使V2区类似于立体匹配问题的一个解决方案。
Proc Natl Acad Sci U S A. 2018 Jul 24;115(30):E6967-E6968. doi: 10.1073/pnas.1807687115. Epub 2018 Jul 6.