区分镜子和玻璃：一种基于“大数据”的材料感知方法。

Distinguishing mirror from glass: A "big data" approach to material perception.

机构信息

Department of Computer Science and Engineering, Toyohashi University of Technology, Toyohashi, Aichi, Japan.

Department of Experimental Psychology, Justus Liebig University Giessen, Giessen, Germany.

出版信息

J Vis. 2022 Mar 2;22(4):4. doi: 10.1167/jov.22.4.4.

DOI:10.1167/jov.22.4.4

PMID:35266961

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8934559/

Abstract

Distinguishing mirror from glass is a challenging visual inference, because both materials derive their appearance from their surroundings, yet we rarely experience difficulties in telling them apart. Very few studies have investigated how the visual system distinguishes reflections from refractions and to date, there is no image-computable model that emulates human judgments. Here we sought to develop a deep neural network that reproduces the patterns of visual judgments human observers make. To do this, we trained thousands of convolutional neural networks on more than 750,000 simulated mirror and glass objects, and compared their performance with human judgments, as well as alternative classifiers based on "hand-engineered" image features. For randomly chosen images, all classifiers and humans performed with high accuracy, and therefore correlated highly with one another. However, to assess how similar models are to humans, it is not sufficient to compare accuracy or correlation on random images. A good model should also predict the characteristic errors that humans make. We, therefore, painstakingly assembled a diagnostic image set for which humans make systematic errors, allowing us to isolate signatures of human-like performance. A large-scale, systematic search through feedforward neural architectures revealed that relatively shallow (three-layer) networks predicted human judgments better than any other models we tested. This is the first image-computable model that emulates human errors and succeeds in distinguishing mirror from glass, and hints that mid-level visual processing might be particularly important for the task.

摘要

区分镜子和玻璃是一种具有挑战性的视觉推断，因为这两种材料的外观都源自周围环境，但我们很少在分辨它们时遇到困难。很少有研究调查视觉系统如何区分反射和折射，到目前为止，还没有可模拟人类判断的图像可计算模型。在这里，我们试图开发一种深度神经网络，以再现人类观察者做出的视觉判断模式。为此，我们在超过 75 万个模拟的镜子和玻璃物体上训练了数千个卷积神经网络，并将其性能与人类判断以及基于“手工制作”图像特征的替代分类器进行了比较。对于随机选择的图像，所有分类器和人类的表现都非常准确，因此彼此高度相关。然而，要评估模型与人的相似程度，仅比较随机图像上的准确性或相关性是不够的。一个好的模型还应该预测人类会犯的典型错误。因此，我们煞费苦心地组装了一个诊断图像集，人类在这些图像上会犯系统错误，从而使我们能够隔离出类似人类的表现特征。通过前馈神经网络结构进行的大规模、系统搜索表明，相对较浅的（三层）网络比我们测试的任何其他模型都更能预测人类的判断。这是第一个可模拟人类错误并成功区分镜子和玻璃的图像可计算模型，这表明中层视觉处理可能对该任务特别重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8b/8934559/b8bca285b74c/jovi-22-4-4-f001.jpg

相似文献

Distinguishing mirror from glass: A "big data" approach to material perception.

J Vis. 2022 Mar 2;22(4):4. doi: 10.1167/jov.22.4.4.

Gloss perception: Searching for a deep neural network that behaves like humans.

J Vis. 2021 Nov 1;21(12):14. doi: 10.1167/jov.21.12.14.

Identifying specular highlights: Insights from deep learning.

J Vis. 2022 Jun 1;22(7):6. doi: 10.1167/jov.22.7.6.

Visual perception of liquids: Insights from deep neural networks.

PLoS Comput Biol. 2020 Aug 19;16(8):e1008018. doi: 10.1371/journal.pcbi.1008018. eCollection 2020 Aug.

Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.

Front Psychol. 2017 Oct 9;8:1726. doi: 10.3389/fpsyg.2017.01726. eCollection 2017.

Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.

J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.

The Ventral Visual Pathway Represents Animal Appearance over Animacy, Unlike Human Behavior and Deep Neural Networks.

J Neurosci. 2019 Aug 14;39(33):6513-6525. doi: 10.1523/JNEUROSCI.1714-18.2019. Epub 2019 Jun 13.

Human Eyes-Inspired Recurrent Neural Networks Are More Robust Against Adversarial Noises.

Neural Comput. 2024 Aug 19;36(9):1713-1743. doi: 10.1162/neco_a_01688.

Social Trait Information in Deep Convolutional Neural Networks Trained for Face Identification.

Cogn Sci. 2019 Jun;43(6):e12729. doi: 10.1111/cogs.12729.

Transfer of Learning from Vision to Touch: A Hybrid Deep Convolutional Neural Network for Visuo-Tactile 3D Object Recognition.

Sensors (Basel). 2020 Dec 27;21(1):113. doi: 10.3390/s21010113.

引用本文的文献

The evolution of Big Data in neuroscience and neurology.

J Big Data. 2023;10(1):116. doi: 10.1186/s40537-023-00751-2. Epub 2023 Jul 10.

Color and gloss constancy under diverse lighting environments.

J Vis. 2023 Jul 3;23(7):8. doi: 10.1167/jov.23.7.8.

Modeling surface color discrimination under different lighting environments using image chromatic statistics and convolutional neural networks.

J Opt Soc Am A Opt Image Sci Vis. 2023 Feb 15;40(3):A149-A159. doi: 10.1364/JOSAA.479986.

Unsupervised learning reveals interpretable latent representations for translucency perception.

PLoS Comput Biol. 2023 Feb 8;19(2):e1010878. doi: 10.1371/journal.pcbi.1010878. eCollection 2023 Feb.

Visual discrimination of optical material properties: A large-scale study.

J Vis. 2022 Feb 1;22(2):17. doi: 10.1167/jov.22.2.17.

Material constancy in perception and working memory.

J Vis. 2020 Oct 1;20(10):10. doi: 10.1167/jov.20.10.10.

Visual perception of liquids: Insights from deep neural networks.

PLoS Comput Biol. 2020 Aug 19;16(8):e1008018. doi: 10.1371/journal.pcbi.1008018. eCollection 2020 Aug.

Learning to see stuff.

Curr Opin Behav Sci. 2019 Dec;30:100-108. doi: 10.1016/j.cobeha.2019.07.004.

本文引用的文献

Low level visual features support robust material perception in the judgement of metallicity.

Sci Rep. 2021 Aug 12;11(1):16396. doi: 10.1038/s41598-021-95416-6.

Unsupervised learning predicts human perception and misperception of gloss.

Nat Hum Behav. 2021 Oct;5(10):1402-1417. doi: 10.1038/s41562-021-01097-6. Epub 2021 May 6.

Controversial stimuli: Pitting neural networks against each other as models of human cognition.

Proc Natl Acad Sci U S A. 2020 Nov 24;117(47):29330-29337. doi: 10.1073/pnas.1912334117.

Visual perception of liquids: Insights from deep neural networks.

PLoS Comput Biol. 2020 Aug 19;16(8):e1008018. doi: 10.1371/journal.pcbi.1008018. eCollection 2020 Aug.

Image-Computable Ideal Observers for Tasks with Natural Stimuli.

Annu Rev Vis Sci. 2020 Sep 15;6:491-517. doi: 10.1146/annurev-vision-030320-041134. Epub 2020 Jun 24.

Learning to see stuff.

Curr Opin Behav Sci. 2019 Dec;30:100-108. doi: 10.1016/j.cobeha.2019.07.004.

The Rotating Glass Illusion: Material Appearance Is Bound to Perceived Shape and Motion.

Iperception. 2018 Dec 26;9(6):2041669518816716. doi: 10.1177/2041669518816716. eCollection 2018 Nov-Dec.

Naturally glossy: Gloss perception, illumination statistics, and tone mapping.

J Vis. 2018 Dec 3;18(13):4. doi: 10.1167/18.13.4.

Deep learning-Using machine learning to study biological vision.

J Vis. 2018 Dec 3;18(13):2. doi: 10.1167/18.13.2.

Neural Mechanisms of Material Perception: Quest on Shitsukan.

Neuroscience. 2018 Nov 10;392:329-347. doi: 10.1016/j.neuroscience.2018.09.001. Epub 2018 Sep 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

区分镜子和玻璃：一种基于“大数据”的材料感知方法。

Distinguishing mirror from glass: A "big data" approach to material perception.

机构信息

Department of Computer Science and Engineering, Toyohashi University of Technology, Toyohashi, Aichi, Japan.

Department of Experimental Psychology, Justus Liebig University Giessen, Giessen, Germany.

出版信息

J Vis. 2022 Mar 2;22(4):4. doi: 10.1167/jov.22.4.4.

DOI:10.1167/jov.22.4.4

PMID:35266961

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8934559/

Abstract

摘要

区分镜子和玻璃：一种基于“大数据”的材料感知方法。

Distinguishing mirror from glass: A "big data" approach to material perception.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

区分镜子和玻璃：一种基于“大数据”的材料感知方法。

Distinguishing mirror from glass: A "big data" approach to material perception.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献