• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无监督学习预测人类对光泽的感知和错觉。

Unsupervised learning predicts human perception and misperception of gloss.

机构信息

Department of Experimental Psychology, Justus Liebig University Giessen, Giessen, Germany.

School of Psychology, University of Sydney, Sydney, Australia.

出版信息

Nat Hum Behav. 2021 Oct;5(10):1402-1417. doi: 10.1038/s41562-021-01097-6. Epub 2021 May 6.

DOI:10.1038/s41562-021-01097-6
PMID:33958744
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8526360/
Abstract

Reflectance, lighting and geometry combine in complex ways to create images. How do we disentangle these to perceive individual properties, such as surface glossiness? We suggest that brains disentangle properties by learning to model statistical structure in proximal images. To test this hypothesis, we trained unsupervised generative neural networks on renderings of glossy surfaces and compared their representations with human gloss judgements. The networks spontaneously cluster images according to distal properties such as reflectance and illumination, despite receiving no explicit information about these properties. Intriguingly, the resulting representations also predict the specific patterns of 'successes' and 'errors' in human perception. Linearly decoding specular reflectance from the model's internal code predicts human gloss perception better than ground truth, supervised networks or control models, and it predicts, on an image-by-image basis, illusions of gloss perception caused by interactions between material, shape and lighting. Unsupervised learning may underlie many perceptual dimensions in vision and beyond.

摘要

反射率、光照和几何形状以复杂的方式结合在一起,形成了图像。我们如何将这些因素分开,以感知单个属性,例如表面光泽度?我们认为,大脑通过学习对近端图像中的统计结构进行建模来分离属性。为了验证这一假设,我们在有光泽表面的渲染图像上训练了无监督生成式神经网络,并将它们的表示与人类的光泽判断进行了比较。尽管这些网络没有接收到关于这些属性的明确信息,但它们会根据反射率和照明等远端属性自发地对图像进行聚类。有趣的是,由此产生的表示还可以预测人类感知中的具体“成功”和“错误”模式。从模型的内部代码中线性解码镜面反射率,比真实情况、监督网络或控制模型更能预测人类的光泽感知,并且可以基于每幅图像的基础,预测出由于材料、形状和照明之间的相互作用而产生的光泽感知错觉。无监督学习可能是视觉和其他领域中许多感知维度的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/033850b0b307/41562_2021_1097_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/020852d229b1/41562_2021_1097_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/516c75e172d5/41562_2021_1097_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/17eecd910bbf/41562_2021_1097_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/5c5777231514/41562_2021_1097_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/4dcce6691589/41562_2021_1097_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/0ebd543ef53d/41562_2021_1097_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/bf15734b2c04/41562_2021_1097_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/033850b0b307/41562_2021_1097_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/020852d229b1/41562_2021_1097_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/516c75e172d5/41562_2021_1097_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/17eecd910bbf/41562_2021_1097_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/5c5777231514/41562_2021_1097_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/4dcce6691589/41562_2021_1097_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/0ebd543ef53d/41562_2021_1097_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/bf15734b2c04/41562_2021_1097_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4476/8526360/033850b0b307/41562_2021_1097_Fig8_HTML.jpg

相似文献

1
Unsupervised learning predicts human perception and misperception of gloss.无监督学习预测人类对光泽的感知和错觉。
Nat Hum Behav. 2021 Oct;5(10):1402-1417. doi: 10.1038/s41562-021-01097-6. Epub 2021 May 6.
2
The perception and misperception of specular surface reflectance.镜面反射率的感知与误解。
Curr Biol. 2012 Oct 23;22(20):1909-13. doi: 10.1016/j.cub.2012.08.009. Epub 2012 Sep 6.
3
Naturally glossy: Gloss perception, illumination statistics, and tone mapping.自然光泽:光泽感知、光照统计与色调映射。
J Vis. 2018 Dec 3;18(13):4. doi: 10.1167/18.13.4.
4
The perception of hazy gloss.朦胧光泽感
J Vis. 2017 May 1;17(5):19. doi: 10.1167/17.5.19.
5
Luminance edge is a cue for glossiness perception based on low-luminance specular components.亮度边缘是基于低亮度镜面反射成分的光泽度感知线索。
J Vis. 2019 Oct 1;19(12):5. doi: 10.1167/19.12.5.
6
Highlights, disparity, and perceived gloss with convex and concave surfaces.高光、差异以及凸面和凹面的感知光泽。
J Vis. 2013 Jan 4;13(1):9. doi: 10.1167/13.1.9.
7
The dark side of gloss.光彩背后的阴暗面。
Nat Neurosci. 2012 Nov;15(11):1590-5. doi: 10.1038/nn.3221. Epub 2012 Sep 23.
8
Generative constraints on image cues for perceived gloss.对感知光泽的图像线索的生成性约束。
J Vis. 2013 Dec 2;13(14):2. doi: 10.1167/13.14.2.
9
Lightness perception for matte and glossy complex shapes.哑光和光泽复杂形状的明度感知。
Vision Res. 2017 Feb;131:82-95. doi: 10.1016/j.visres.2016.12.004. Epub 2017 Jan 13.
10
The perception of gloss depends on highlight congruence with surface shading.光泽感取决于高光与表面阴影的一致性。
J Vis. 2011 Aug 12;11(9):4. doi: 10.1167/11.9.4.

引用本文的文献

1
Human-like monocular depth biases in deep neural networks.深度神经网络中类似人类的单眼深度偏差。
PLoS Comput Biol. 2025 Aug 19;21(8):e1013020. doi: 10.1371/journal.pcbi.1013020. eCollection 2025 Aug.
2
Computational models reveal that intuitive physics underlies visual processing of soft objects.计算模型表明,直观物理学是软物体视觉处理的基础。
Nat Commun. 2025 Jul 9;16(1):6303. doi: 10.1038/s41467-025-61458-x.
3
Energy optimization induces predictive-coding properties in a multi-compartment spiking neural network model.

本文引用的文献

1
Five points to check when comparing visual perception in humans and machines.比较人与机器的视觉感知时需要检查的五个要点。
J Vis. 2021 Mar 1;21(3):16. doi: 10.1167/jov.21.3.16.
2
Controversial stimuli: Pitting neural networks against each other as models of human cognition.有争议的刺激:将神经网络作为人类认知模型进行相互竞争。
Proc Natl Acad Sci U S A. 2020 Nov 24;117(47):29330-29337. doi: 10.1073/pnas.1912334117.
3
Children's use of local and global visual features for material perception.儿童利用局部和全局视觉特征进行材料感知。
能量优化在多房室脉冲神经网络模型中诱导预测编码特性。
PLoS Comput Biol. 2025 Jun 10;21(6):e1013112. doi: 10.1371/journal.pcbi.1013112. eCollection 2025 Jun.
4
Human shape perception spontaneously discovers the biological origin of novel, but natural, stimuli.人类形状感知能自发地发现新颖但自然的刺激的生物学起源。
J R Soc Interface. 2025 May;22(226):20240931. doi: 10.1098/rsif.2024.0931. Epub 2025 May 21.
5
Poisson Variational Autoencoder.泊松变分自编码器
ArXiv. 2024 Dec 9:arXiv:2405.14473v2.
6
Parallel development of object recognition in newborn chicks and deep neural networks.新生雏鸡与深度神经网络中物体识别的并行发展
PLoS Comput Biol. 2024 Dec 2;20(12):e1012600. doi: 10.1371/journal.pcbi.1012600. eCollection 2024 Dec.
7
Probing the link between vision and language in material perception using psychophysics and unsupervised learning.使用心理物理学和无监督学习探究物质感知中视觉和语言之间的联系。
PLoS Comput Biol. 2024 Oct 3;20(10):e1012481. doi: 10.1371/journal.pcbi.1012481. eCollection 2024 Oct.
8
High Dynamic Range Image Reconstruction from Saturated Images of Metallic Objects.基于金属物体饱和图像的高动态范围图像重建
J Imaging. 2024 Apr 15;10(4):92. doi: 10.3390/jimaging10040092.
9
Hierarchical VAEs provide a normative account of motion processing in the primate brain.分层变分自编码器为灵长类大脑中的运动处理提供了一种规范性解释。
bioRxiv. 2023 Nov 5:2023.09.27.559646. doi: 10.1101/2023.09.27.559646.
10
Color and gloss constancy under diverse lighting environments.在不同光照环境下的颜色和光泽恒常性。
J Vis. 2023 Jul 3;23(7):8. doi: 10.1167/jov.23.7.8.
J Vis. 2020 Feb 10;20(2):10. doi: 10.1167/jov.20.2.10.
4
Mid-level vision.中观视觉。
Curr Biol. 2020 Feb 3;30(3):R105-R109. doi: 10.1016/j.cub.2019.11.088.
5
Learning to see stuff.学着观察事物。
Curr Opin Behav Sci. 2019 Dec;30:100-108. doi: 10.1016/j.cobeha.2019.07.004.
6
A deep learning framework for neuroscience.深度学习在神经科学中的应用框架。
Nat Neurosci. 2019 Nov;22(11):1761-1770. doi: 10.1038/s41593-019-0520-2. Epub 2019 Oct 28.
7
Letter perception emerges from unsupervised deep learning and recycling of natural image features.字母感知源于无监督深度学习和自然图像特征的循环利用。
Nat Hum Behav. 2017 Sep;1(9):657-664. doi: 10.1038/s41562-017-0186-2. Epub 2017 Aug 21.
8
Naturally glossy: Gloss perception, illumination statistics, and tone mapping.自然光泽:光泽感知、光照统计与色调映射。
J Vis. 2018 Dec 3;18(13):4. doi: 10.1167/18.13.4.
9
Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.大规模、高分辨率的人类、猴子和最先进的深度人工神经网络核心视觉对象识别行为比较。
J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.
10
Material and shape perception based on two types of intensity gradient information.基于两种类型的强度梯度信息的材料和形状感知。
PLoS Comput Biol. 2018 Apr 27;14(4):e1006061. doi: 10.1371/journal.pcbi.1006061. eCollection 2018 Apr.