• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ViSpa(视觉空间):一种基于计算机视觉的个体图像和概念原型表示系统,具有大规模评估。

ViSpa (Vision Spaces): A computer-vision-based representation system for individual images and concept prototypes, with large-scale evaluation.

机构信息

Department of Psychology, Humboldt-Universitat zu Berlin.

Department of Psychology, University of Milano-Bicocca.

出版信息

Psychol Rev. 2023 Jul;130(4):896-934. doi: 10.1037/rev0000392. Epub 2022 Oct 6.

DOI:10.1037/rev0000392
PMID:36201829
Abstract

Quantitative, data-driven models for mental representations have long enjoyed popularity and success in psychology (e.g., distributional semantic models in the language domain), but have largely been missing for the visual domain. To overcome this, we present ViSpa (Vision Spaces), high-dimensional vector spaces that include vision-based representation for naturalistic images as well as concept prototypes. These vectors are derived directly from visual stimuli through a deep convolutional neural network trained to classify images and allow us to compute vision-based similarity scores between any pair of images and/or concept prototypes. We successfully evaluate these similarities against human behavioral data in a series of large-scale studies, including off-line judgments-visual similarity judgments for the referents of word pairs (Study 1) and for image pairs (Study 2), and typicality judgments for images given a label (Study 3)-as well as online processing times and error rates in a discrimination (Study 4) and priming task (Study 5) with naturalistic image material. similarities predict behavioral data across all tasks, which renders a theoretically appealing model for vision-based representations and a valuable research tool for data analysis and the construction of experimental material: allows for precise control over experimental material consisting of images and/or words denoting imageable concepts and introduces a specifically vision-based similarity for word pairs. To make available to a wide audience, this article (a) includes (video) tutorials on how to use in R and (b) presents a user-friendly web interface at http://vispa.fritzguenther.de. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

摘要

长期以来,用于心理表象的定量、数据驱动模型在心理学中一直很受欢迎并取得了成功(例如,语言领域的分布语义模型),但在视觉领域却基本上没有。为了克服这个问题,我们提出了 ViSpa(视觉空间),这是一种高维向量空间,其中包括基于视觉的自然图像表示以及概念原型。这些向量是通过一个经过训练来对图像进行分类的深度卷积神经网络从视觉刺激中直接导出的,这使我们能够计算任意一对图像和/或概念原型之间的基于视觉的相似度得分。我们在一系列大规模研究中成功地评估了这些相似性与人类行为数据之间的关系,包括离线判断-词对(研究 1)和图像对(研究 2)的视觉相似性判断,以及给定标签的图像的典型性判断(研究 3)-以及自然图像材料的辨别(研究 4)和启动任务(研究 5)中的在线处理时间和错误率。相似性可以预测所有任务中的行为数据,这为基于视觉的表示提供了一个理论上吸引人的模型,也是数据分析和实验材料构建的有价值的研究工具:它可以精确控制由表示可想象概念的图像和/或单词组成的实验材料,并为词对引入了特定的基于视觉的相似度。为了让更广泛的受众能够使用,本文(a)包括了如何在 R 中使用的(视频)教程,以及(b)在 http://vispa.fritzguenther.de 上提供了一个用户友好的网络界面。(PsycInfo 数据库记录(c)2023 APA,保留所有权利)。

相似文献

1
ViSpa (Vision Spaces): A computer-vision-based representation system for individual images and concept prototypes, with large-scale evaluation.ViSpa(视觉空间):一种基于计算机视觉的个体图像和概念原型表示系统,具有大规模评估。
Psychol Rev. 2023 Jul;130(4):896-934. doi: 10.1037/rev0000392. Epub 2022 Oct 6.
2
Predicting patterns of similarity among abstract semantic relations.预测抽象语义关系之间相似性的模式。
J Exp Psychol Learn Mem Cogn. 2022 Jan;48(1):108-121. doi: 10.1037/xlm0001010. Epub 2021 Jul 1.
3
Probing the link between vision and language in material perception using psychophysics and unsupervised learning.使用心理物理学和无监督学习探究物质感知中视觉和语言之间的联系。
PLoS Comput Biol. 2024 Oct 3;20(10):e1012481. doi: 10.1371/journal.pcbi.1012481. eCollection 2024 Oct.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Images of the unseen: extrapolating visual representations for abstract and concrete words in a data-driven computational model.不可见事物的图像:在数据驱动的计算模型中推断抽象词和具体词的视觉表征
Psychol Res. 2022 Nov;86(8):2512-2532. doi: 10.1007/s00426-020-01429-7.
6
Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.大规模、高分辨率的人类、猴子和最先进的深度人工神经网络核心视觉对象识别行为比较。
J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.
7
Visual features as stepping stones toward semantics: Explaining object similarity in IT and perception with non-negative least squares.作为通向语义学垫脚石的视觉特征:用非负最小二乘法解释信息技术与感知中的物体相似性。
Neuropsychologia. 2016 Mar;83:201-226. doi: 10.1016/j.neuropsychologia.2015.10.023. Epub 2015 Oct 19.
8
Constructing Semantic Models From Words, Images, and Emojis.从字词、图像和表情符号构建语义模型。
Cogn Sci. 2020 Apr;44(4):e12830. doi: 10.1111/cogs.12830.
9
Similarity Judgment Within and Across Categories: A Comprehensive Model Comparison.范畴内和范畴间相似性判断:综合模型比较
Cogn Sci. 2021 Aug;45(8):e13030. doi: 10.1111/cogs.13030.
10
Learning semantic and visual similarity for endomicroscopy video retrieval.学习内窥镜视频检索的语义和视觉相似性。
IEEE Trans Med Imaging. 2012 Jun;31(6):1276-88. doi: 10.1109/TMI.2012.2188301. Epub 2012 Feb 16.

引用本文的文献

1
Cracking arbitrariness: A data-driven study of auditory iconicity in spoken English.破解任意性:一项关于英语口语中听觉象似性的数据驱动研究。
Psychon Bull Rev. 2025 Jun;32(3):1425-1442. doi: 10.3758/s13423-024-02630-0. Epub 2025 Jan 8.
2
Visual search and real-image similarity: An empirical assessment through the lens of deep learning.视觉搜索与真实图像相似度:基于深度学习视角的实证评估
Psychon Bull Rev. 2025 Apr;32(2):822-838. doi: 10.3758/s13423-024-02583-4. Epub 2024 Sep 26.
3
Visual experience modulates the sensitivity to the distributional history of words in natural language.
视觉体验会调节对自然语言中词汇分布历史的敏感度。
Psychon Bull Rev. 2025 Feb;32(1):472-481. doi: 10.3758/s13423-024-02557-6. Epub 2024 Aug 22.
4
Decomposing geographical judgments into spatial, temporal and linguistic components.将地理判断分解为空间、时间和语言成分。
Psychol Res. 2024 Jul;88(5):1590-1601. doi: 10.1007/s00426-024-01980-7. Epub 2024 Jun 5.
5
Taboo language across the globe: A multi-lab study.全球禁忌语:一项多实验室研究。
Behav Res Methods. 2024 Apr;56(4):3794-3813. doi: 10.3758/s13428-024-02376-6. Epub 2024 May 9.
6
From vector spaces to DRM lists: False Memory Generator, a software for automated generation of lists of stimuli inducing false memories.从向量空间到 DRM 列表:虚假记忆生成器,一款用于自动化生成诱导虚假记忆的刺激列表的软件。
Behav Res Methods. 2024 Apr;56(4):3779-3793. doi: 10.3758/s13428-024-02425-0. Epub 2024 May 6.
7
Visual Intuitions in the Absence of Visual Experience: The Role of Direct Experience in Concreteness and Imageability Judgements.缺乏视觉体验时的视觉直觉:直接体验在具体性和可想象性判断中的作用
J Cogn. 2024 Jan 9;7(1):3. doi: 10.5334/joc.328. eCollection 2024.
8
A Cross-Modal and Cross-lingual Study of Iconicity in Language: Insights From Deep Learning.语言象似性的跨模态和跨语言研究:深度学习的启示。
Cogn Sci. 2022 Jun;46(6):e13147. doi: 10.1111/cogs.13147.