• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PsyPhy:一个视觉识别的心理物理学驱动评估框架。

PsyPhy: A Psychophysics Driven Evaluation Framework for Visual Recognition.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2019 Sep;41(9):2280-2286. doi: 10.1109/TPAMI.2018.2849989. Epub 2018 Jun 25.

DOI:10.1109/TPAMI.2018.2849989
PMID:29994469
Abstract

By providing substantial amounts of data and standardized evaluation protocols, datasets in computer vision have helped fuel advances across all areas of visual recognition. But even in light of breakthrough results on recent benchmarks, it is still fair to ask if our recognition algorithms are doing as well as we think they are. The vision sciences at large make use of a very different evaluation regime known as Visual Psychophysics to study visual perception. Psychophysics is the quantitative examination of the relationships between controlled stimuli and the behavioral responses they elicit in experimental test subjects. Instead of using summary statistics to gauge performance, psychophysics directs us to construct item-response curves made up of individual stimulus responses to find perceptual thresholds, thus allowing one to identify the exact point at which a subject can no longer reliably recognize the stimulus class. In this article, we introduce a comprehensive evaluation framework for visual recognition models that is underpinned by this methodology. Over millions of procedurally rendered 3D scenes and 2D images, we compare the performance of well-known convolutional neural networks. Our results bring into question recent claims of human-like performance, and provide a path forward for correcting newly surfaced algorithmic deficiencies.

摘要

通过提供大量的数据和标准化的评估协议,计算机视觉中的数据集帮助推动了视觉识别各个领域的进展。但即使考虑到最近基准测试中的突破结果,我们仍然有理由怀疑我们的识别算法是否像我们认为的那样出色。整个视觉科学领域都在使用一种称为视觉心理物理学的完全不同的评估机制来研究视觉感知。心理物理学是对受控刺激与实验测试对象所引起的行为反应之间关系的定量研究。心理物理学不是使用汇总统计数据来衡量性能,而是指导我们构建由单个刺激反应组成的项目响应曲线,以找到感知阈值,从而可以确定受试者无法再可靠识别刺激类别的确切点。在本文中,我们引入了一个基于该方法的视觉识别模型的综合评估框架。在数百万个程序渲染的 3D 场景和 2D 图像上,我们比较了著名卷积神经网络的性能。我们的结果对最近提出的类似人类的性能声称提出了质疑,并为纠正新出现的算法缺陷提供了一条途径。

相似文献

1
PsyPhy: A Psychophysics Driven Evaluation Framework for Visual Recognition.PsyPhy:一个视觉识别的心理物理学驱动评估框架。
IEEE Trans Pattern Anal Mach Intell. 2019 Sep;41(9):2280-2286. doi: 10.1109/TPAMI.2018.2849989. Epub 2018 Jun 25.
2
Visual Object Recognition: Do We (Finally) Know More Now Than We Did?视觉物体识别:我们(终于)比以前知道得更多了吗?
Annu Rev Vis Sci. 2016 Oct 14;2:377-396. doi: 10.1146/annurev-vision-111815-114621. Epub 2016 Aug 3.
3
Image Visual Realism: From Human Perception to Machine Computation.图像视觉逼真度:从人类感知到机器计算。
IEEE Trans Pattern Anal Mach Intell. 2018 Sep;40(9):2180-2193. doi: 10.1109/TPAMI.2017.2747150. Epub 2017 Aug 30.
4
On the neural correlates of visual perception.论视觉感知的神经关联
Cereb Cortex. 1999 Jan-Feb;9(1):4-19. doi: 10.1093/cercor/9.1.4.
5
Hyperacuity test to evaluate vision through dense cataracts; research preliminary to a clinical study. I. Studies conducted at the University of California at Berkeley before travel to India.通过致密白内障评估视力的超敏锐度测试;一项临床研究的前期研究。一、前往印度之前在加利福尼亚大学伯克利分校进行的研究。
Optom Vis Sci. 1995 Sep;72(9):619-29. doi: 10.1097/00006324-199509000-00005.
6
Discrimination of spectrally blended natural images: optimisation of the human visual system for encoding natural images.光谱混合自然图像的辨别:人类视觉系统对自然图像编码的优化。
Perception. 2000;29(9):1087-100. doi: 10.1068/p3015.
7
Visual Percepts in the Cases of Binocular and Monocular Viewing Stabilized Test Objects, Ganzfeld Stimuli, and Prolonged Afterimages.双眼和单眼观察稳定测试物体、全视野刺激和长时间后像情况下的视觉感知。
Perception. 2015;44(8-9):934-51. doi: 10.1177/0301006615594957. Epub 2015 Aug 14.
8
Representation of object similarity in human vision: psychophysics and a computational model.人类视觉中物体相似性的表征:心理物理学与计算模型。
Vision Res. 1998 Aug;38(15-16):2229-57. doi: 10.1016/s0042-6989(97)00186-7.
9
A quantitative theory of immediate visual recognition.即时视觉识别的定量理论。
Prog Brain Res. 2007;165:33-56. doi: 10.1016/S0079-6123(06)65004-8.
10
Methodological Considerations to Strengthen Studies of Peripheral Vision.强化周边视觉研究的方法学考量。
Trends Cogn Sci. 2016 Sep;20(9):642-643. doi: 10.1016/j.tics.2016.06.005. Epub 2016 Jul 4.

引用本文的文献

1
Squeeze-EnGAN: Memory Efficient and Unsupervised Low-Light Image Enhancement for Intelligent Vehicles.挤压生成对抗网络(Squeeze-EnGAN):用于智能车辆的内存高效且无监督的低光图像增强
Sensors (Basel). 2025 Mar 14;25(6):1825. doi: 10.3390/s25061825.
2
Low-light image enhancement via adaptive frequency decomposition network.基于自适应频率分解网络的低光照图像增强
Sci Rep. 2023 Aug 29;13(1):14107. doi: 10.1038/s41598-023-40899-8.
3
Human-Like Modulation Sensitivity Emerging through Optimization to Natural Sound Recognition.通过优化自然声音识别实现类人调制敏感性。
J Neurosci. 2023 May 24;43(21):3876-3894. doi: 10.1523/JNEUROSCI.2002-22.2023. Epub 2023 Apr 25.
4
A Nighttime Vehicle Detection Method with Attentive GAN for Accurate Classification and Regression.一种基于注意力生成对抗网络的夜间车辆检测方法,用于精确分类与回归。
Entropy (Basel). 2021 Nov 11;23(11):1490. doi: 10.3390/e23111490.
5
Overview and Empirical Analysis of ISP Parameter Tuning for Visual Perception in Autonomous Driving.自动驾驶视觉感知中ISP参数调整的概述与实证分析
J Imaging. 2019 Sep 24;5(10):78. doi: 10.3390/jimaging5100078.
6
A Hierarchy of Functional States in Working Memory.工作记忆中的功能状态层次结构。
J Neurosci. 2021 May 19;41(20):4461-4475. doi: 10.1523/JNEUROSCI.3104-20.2021. Epub 2021 Apr 22.
7
Artificial cognition: How experimental psychology can help generate explainable artificial intelligence.人工认知:实验心理学如何帮助生成可解释的人工智能。
Psychon Bull Rev. 2021 Apr;28(2):454-475. doi: 10.3758/s13423-020-01825-5. Epub 2020 Nov 6.
8
Performance vs. competence in human-machine comparisons.人机比较中的表现与能力。
Proc Natl Acad Sci U S A. 2020 Oct 27;117(43):26562-26571. doi: 10.1073/pnas.1905334117. Epub 2020 Oct 13.
9
Capsule networks as recurrent models of grouping and segmentation.胶囊网络作为分组和分割的递归模型。
PLoS Comput Biol. 2020 Jul 21;16(7):e1008017. doi: 10.1371/journal.pcbi.1008017. eCollection 2020 Jul.
10
Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.大规模、高分辨率的人类、猴子和最先进的深度人工神经网络核心视觉对象识别行为比较。
J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.