赞重新构建的人工图像：在视觉建模中使用自然图像数据库时需谨慎。

In Praise of Artifice Reloaded: Caution With Natural Image Databases in Modeling Vision.

作者信息

Martinez-Garcia Marina, Bertalmío Marcelo, Malo Jesús

机构信息

Image Processing Lab, Universitat de València Valencia, Spain.

CSIC, Instituto de Neurociencias Alicante, Spain.

出版信息

Front Neurosci. 2019 Feb 18;13:8. doi: 10.3389/fnins.2019.00008. eCollection 2019.

DOI:10.3389/fnins.2019.00008

PMID:30894796

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6414813/

Abstract

Subjective image quality databases are a major source of raw data on how the visual system works in . These databases describe the sensitivity of many observers to a wide range of distortions of different nature and intensity seen on top of a variety of natural images. Data of this kind seems to open a number of possibilities for the vision scientist to check the models in realistic scenarios. However, while these natural databases are great benchmarks for models developed in some other way (e.g., by using the well-controlled of traditional psychophysics), they should be carefully used when trying to fit vision models. Given the high dimensionality of the image space, it is very likely that some basic phenomena are under-represented in the database. Therefore, a model fitted on these large-scale natural databases will not reproduce these under-represented basic phenomena that could otherwise be easily illustrated with well selected artificial stimuli. In this work we study a specific example of the above statement. A standard cortical model using wavelets and divisive normalization tuned to reproduce subjective opinion on a large image quality dataset fails to reproduce basic cross-masking. Here we outline a solution for this problem by using artificial stimuli and by proposing a modification that makes the model easier to tune. Then, we show that the modified model is still competitive in the large-scale database. Our simulations with these artificial stimuli show that when using steerable wavelets, the conventional unit norm Gaussian kernels in divisive normalization should be multiplied by high-pass filters to reproduce basic trends in masking. Basic visual phenomena may be misrepresented in large natural image datasets but this can be solved with model-interpretable stimuli. This is an additional argument in line with Rust and Movshon (2005).

摘要

主观图像质量数据库是关于视觉系统如何工作的原始数据的主要来源。这些数据库描述了许多观察者对在各种自然图像之上看到的不同性质和强度的广泛失真的敏感度。这类数据似乎为视觉科学家在现实场景中检验模型提供了多种可能性。然而，虽然这些自然数据库是用其他方式（例如，通过使用传统心理物理学的严格控制）开发的模型的很好的基准，但在尝试拟合视觉模型时应谨慎使用。鉴于图像空间的高维度，很可能数据库中一些基本现象的代表性不足。因此，基于这些大规模自然数据库拟合的模型将无法再现这些代表性不足的基本现象，而这些现象用精心挑选的人工刺激原本可以很容易地说明。在这项工作中，我们研究上述陈述的一个具体例子。一个使用小波和归一化除法进行调整以在大型图像质量数据集上再现主观意见的标准皮质模型无法再现基本的交叉掩蔽。在这里，我们通过使用人工刺激并提出一种使模型更易于调整的修改方案来概述这个问题的解决方案。然后，我们表明修改后的模型在大规模数据库中仍然具有竞争力。我们使用这些人工刺激的模拟表明，当使用可操纵小波时，归一化除法中传统的单位范数高斯核应乘以高通滤波器以再现掩蔽的基本趋势。基本视觉现象在大型自然图像数据集中可能会被错误表示，但这可以通过模型可解释的刺激来解决。这是与拉斯和莫夫尚（2005年）一致的另一个论据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b09/6414813/f1b3ffb8736a/fnins-13-00008-g0001.jpg

相似文献

In Praise of Artifice Reloaded: Caution With Natural Image Databases in Modeling Vision.赞重新构建的人工图像：在视觉建模中使用自然图像数据库时需谨慎。

Front Neurosci. 2019 Feb 18;13:8. doi: 10.3389/fnins.2019.00008. eCollection 2019.

Divisive normalization image quality metric revisited.再探归一化分割图像质量度量标准。

J Opt Soc Am A Opt Image Sci Vis. 2010 Apr 1;27(4):852-64. doi: 10.1364/JOSAA.27.000852.

An image-computable psychophysical spatial vision model.一种图像可计算的心理物理学空间视觉模型。

J Vis. 2017 Oct 1;17(12):12. doi: 10.1167/17.12.12.

In praise of artifice.赞技艺

Nat Neurosci. 2005 Dec;8(12):1647-50. doi: 10.1038/nn1606.

Image quality assessment based on multiscale geometric analysis.基于多尺度几何分析的图像质量评估

IEEE Trans Image Process. 2009 Jul;18(7):1409-23. doi: 10.1109/TIP.2009.2018014. Epub 2009 May 12.

Local masking in natural images: a database and analysis.自然图像中的局部掩蔽：一个数据库及分析

J Vis. 2014 Jul 29;14(8):22. doi: 10.1167/14.8.22.

Massive Online Crowdsourced Study of Subjective and Objective Picture Quality.大规模在线众包的主观和客观图片质量研究。

IEEE Trans Image Process. 2016 Jan;25(1):372-87. doi: 10.1109/TIP.2015.2500021. Epub 2015 Nov 11.

The statistics of how natural images drive the responses of neurons.自然图像如何驱动神经元反应的统计数据。

J Vis. 2019 Nov 1;19(13):4. doi: 10.1167/19.13.4.

Universal blind image quality assessment metrics via natural scene statistics and multiple kernel learning.基于自然场景统计和多核学习的通用盲图像质量评估指标

IEEE Trans Neural Netw Learn Syst. 2013 Dec;24(12):2013-26. doi: 10.1109/TNNLS.2013.2271356.

Exploring Human Cognition Using Large Image Databases.利用大型图像数据库探索人类认知。

Top Cogn Sci. 2016 Jul;8(3):569-88. doi: 10.1111/tops.12209.

引用本文的文献

Orthogonal neural representations support perceptual judgments of natural stimuli.正交神经表征支持对自然刺激的感知判断。

Sci Rep. 2025 Feb 13;15(1):5316. doi: 10.1038/s41598-025-88910-8.

Estimating the contribution of early and late noise in vision from psychophysical data.从心理物理学数据估计视觉中早期和晚期噪声的贡献。

J Vis. 2025 Jan 2;25(1):12. doi: 10.1167/jov.25.1.12.

Alignment of color discrimination in humans and image segmentation networks.人类颜色辨别与图像分割网络的对齐。

Front Psychol. 2024 Oct 23;15:1415958. doi: 10.3389/fpsyg.2024.1415958. eCollection 2024.

Plaid masking explained with input-dependent dendritic nonlinearities.基于输入依赖型树突非线性对格子掩蔽进行解释。

Sci Rep. 2024 Oct 22;14(1):24856. doi: 10.1038/s41598-024-75471-5.

Orthogonal neural representations support perceptual judgements of natural stimuli.正交神经表征支持对自然刺激的感知判断。

bioRxiv. 2024 Jun 4:2024.02.14.580134. doi: 10.1101/2024.02.14.580134.

questions classical hue cancellation experiments.经典色调消除实验的问题。

Front Neurosci. 2023 Jul 6;17:1208882. doi: 10.3389/fnins.2023.1208882. eCollection 2023.

On the synthesis of visual illusions using deep generative models.利用深度生成模型合成视错觉。

J Vis. 2022 Jul 11;22(8):2. doi: 10.1167/jov.22.8.2.

Contrast sensitivity functions in autoencoders.自编码器中的对比敏感度函数。

J Vis. 2022 May 3;22(6):8. doi: 10.1167/jov.22.6.8.

Spatio-chromatic information available from different neural layers via Gaussianization.通过高斯化从不同神经层获得的空间色度信息。

J Math Neurosci. 2020 Nov 11;10(1):18. doi: 10.1186/s13408-020-00095-8.

本文引用的文献

Derivatives and inverse of cascaded linear+nonlinear neural models.级联线性+非线性神经网络模型的导数和反演。

PLoS One. 2018 Oct 15;13(10):e0201326. doi: 10.1371/journal.pone.0201326. eCollection 2018.

Which tone-mapping operator is the best? A comparative study of perceptual quality.哪种色调映射算子是最佳的？感知质量的比较研究。

J Opt Soc Am A Opt Image Sci Vis. 2018 Apr 1;35(4):626-638. doi: 10.1364/JOSAA.35.000626.

End-to-End Blind Image Quality Assessment Using Deep Neural Networks.基于深度神经网络的端到端盲图像质量评估。

IEEE Trans Image Process. 2018 Mar;27(3):1202-1213. doi: 10.1109/TIP.2017.2774045.

Perceptually optimized image rendering.感知优化图像渲染

J Opt Soc Am A Opt Image Sci Vis. 2017 Sep 1;34(9):1511-1525. doi: 10.1364/JOSAA.34.001511.

Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment.深度神经网络在无参考和全参考图像质量评估中的应用。

IEEE Trans Image Process. 2018 Jan;27(1):206-219. doi: 10.1109/TIP.2017.2760518. Epub 2017 Oct 10.

The cyberscientist.

Science. 2017 Jul 7;357(6346):18-21. doi: 10.1126/science.357.6346.18.

Perception Science in the Age of Deep Neural Networks.深度神经网络时代的感知科学。

Front Psychol. 2017 Feb 2;8:142. doi: 10.3389/fpsyg.2017.00142. eCollection 2017.

Can we open the black box of AI?我们能打开人工智能的黑匣子吗？

Nature. 2016 Oct 6;538(7623):20-23. doi: 10.1038/538020a.

System gamma as a function of image- and monitor-dynamic range.作为图像和显示器动态范围函数的系统伽马

J Vis. 2016;16(6):4. doi: 10.1167/16.6.4.

Massive Online Crowdsourced Study of Subjective and Objective Picture Quality.大规模在线众包的主观和客观图片质量研究。

IEEE Trans Image Process. 2016 Jan;25(1):372-87. doi: 10.1109/TIP.2015.2500021. Epub 2015 Nov 11.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

赞重新构建的人工图像：在视觉建模中使用自然图像数据库时需谨慎。

In Praise of Artifice Reloaded: Caution With Natural Image Databases in Modeling Vision.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献