人工智能、视觉意象，以及对人类智力测验所带来的挑战的案例研究。

AI, visual imagery, and a case study on the challenges posed by human intelligence tests.

机构信息

Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37235-1679

出版信息

Proc Natl Acad Sci U S A. 2020 Nov 24;117(47):29390-29397. doi: 10.1073/pnas.1912335117.

DOI:10.1073/pnas.1912335117

PMID:33229557

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7703577/

Abstract

Observations abound about the power of visual imagery in human intelligence, from how Nobel prize-winning physicists make their discoveries to how children understand bedtime stories. These observations raise an important question for cognitive science, which is, what are the computations taking place in someone's mind when they use visual imagery? Answering this question is not easy and will require much continued research across the multiple disciplines of cognitive science. Here, we focus on a related and more circumscribed question from the perspective of artificial intelligence (AI): If you have an intelligent agent that uses visual imagery-based knowledge representations and reasoning operations, then what kinds of problem solving might be possible, and how would such problem solving work? We highlight recent progress in AI toward answering these questions in the domain of visuospatial reasoning, looking at a case study of how imagery-based artificial agents can solve visuospatial intelligence tests. In particular, we first examine several variations of imagery-based knowledge representations and problem-solving strategies that are sufficient for solving problems from the Raven's Progressive Matrices intelligence test. We then look at how artificial agents, instead of being designed manually by AI researchers, might learn portions of their own knowledge and reasoning procedures from experience, including learning visuospatial domain knowledge, learning and generalizing problem-solving strategies, and learning the actual definition of the task in the first place.

摘要

从诺贝尔奖得主物理学家如何做出发现到孩子们如何理解睡前故事，人们对视觉意象在人类智力中的作用有很多观察。这些观察为认知科学提出了一个重要问题，即当人们使用视觉意象时，他们的大脑中正在进行什么计算？回答这个问题并不容易，需要认知科学的多个学科继续进行大量研究。在这里，我们从人工智能 (AI) 的角度关注一个相关但范围更窄的问题：如果您有一个使用基于视觉意象的知识表示和推理操作的智能代理，那么可能会解决什么样的问题，以及这种问题解决方式将如何工作？我们强调了人工智能在视觉空间推理领域回答这些问题的最新进展，研究了基于意象的人工智能代理如何解决视觉空间智能测试的案例。具体来说，我们首先研究了几种基于意象的知识表示和问题解决策略的变体，这些变体足以解决瑞文渐进矩阵智力测验中的问题。然后，我们研究了人工智能代理如何从经验中学习自己的部分知识和推理过程，包括学习视觉空间领域知识、学习和推广问题解决策略，以及首先学习任务的实际定义。

相似文献

AI, visual imagery, and a case study on the challenges posed by human intelligence tests.人工智能、视觉意象，以及对人类智力测验所带来的挑战的案例研究。

Proc Natl Acad Sci U S A. 2020 Nov 24;117(47):29390-29397. doi: 10.1073/pnas.1912335117.

Visual mental imagery: A view from artificial intelligence.视觉心理意象：人工智能视角。

Cortex. 2018 Aug;105:155-172. doi: 10.1016/j.cortex.2018.01.022. Epub 2018 Feb 27.

Augmenting cognitive architectures to support diagrammatic imagination.增强认知架构以支持图表想象。

Top Cogn Sci. 2011 Oct;3(4):760-77. doi: 10.1111/j.1756-8765.2011.01156.x. Epub 2011 Aug 4.

Modeling visual problem solving as analogical reasoning.将视觉问题解决建模为类比推理。

Psychol Rev. 2017 Jan;124(1):60-90. doi: 10.1037/rev0000039.

Autistic fluid intelligence: Increased reliance on visual functional connectivity with diminished modulation of coupling by task difficulty.自闭症的流体智力：对视觉功能连接的依赖增加，且任务难度对耦合的调节作用减弱。

Neuroimage Clin. 2015 Sep 18;9:467-78. doi: 10.1016/j.nicl.2015.09.007. eCollection 2015.

Unsupervised Abstract Reasoning for Raven's Problem Matrices.无监督抽象推理在瑞文标准推理测验中的应用。

IEEE Trans Image Process. 2021;30:8332-8341. doi: 10.1109/TIP.2021.3114987. Epub 2021 Oct 5.

Mechanical reasoning by mental simulation.通过心理模拟进行机械推理。

Trends Cogn Sci. 2004 Jun;8(6):280-5. doi: 10.1016/j.tics.2004.04.001.

Engineering neural systems for high-level problem solving.工程化神经系统以解决高级别问题。

Neural Netw. 2016 Jul;79:37-52. doi: 10.1016/j.neunet.2016.03.006. Epub 2016 Mar 31.

Intraindividual strategy shifts in Raven's matrices, and their dependence on working memory capacity and need for cognition.个体在瑞文标准推理测验中的策略转变，及其对工作记忆容量和认知需求的依赖性。

J Exp Psychol Gen. 2020 Mar;149(3):564-579. doi: 10.1037/xge0000660. Epub 2019 Jul 18.

A neural model of rule generation in inductive reasoning.归纳推理中规则生成的神经模型。

Top Cogn Sci. 2011 Jan;3(1):140-53. doi: 10.1111/j.1756-8765.2010.01127.x.

引用本文的文献

Let's do it: Response times in Mental Paper Folding and its execution.我们来做这个：心理折纸任务中的反应时间及其执行情况。

Q J Exp Psychol (Hove). 2025 Apr;78(4):731-743. doi: 10.1177/17470218241249727. Epub 2024 May 12.

Modeling Sequential Dependencies in Progressive Matrices: An Auto-Regressive Item Response Theory (AR-IRT) Approach.渐进矩阵中序列依赖关系的建模：一种自回归项目反应理论（AR-IRT）方法。

J Intell. 2024 Jan 15;12(1):7. doi: 10.3390/jintelligence12010007.

Responses to Raven matrices: Governed by visual complexity and centrality.对 Raven 矩阵的反应：受视觉复杂性和中心性的影响。

Perception. 2023 Sep;52(9):645-661. doi: 10.1177/03010066231178149. Epub 2023 Jun 2.

Multimodal Art Pose Recognition and Interaction With Human Intelligence Enhancement.多模态艺术姿态识别与人类智能增强交互

Front Psychol. 2021 Nov 8;12:769509. doi: 10.3389/fpsyg.2021.769509. eCollection 2021.

The brain produces mind by modeling.大脑通过建模产生思维。

Proc Natl Acad Sci U S A. 2020 Nov 24;117(47):29299-29301. doi: 10.1073/pnas.1912340117.

本文引用的文献

Beyond imitation: Zero-shot task transfer on robots by learning concepts as cognitive programs.超越模仿：通过将概念作为认知程序来学习，实现机器人的零样本任务迁移。

Sci Robot. 2019 Jan 16;4(26). doi: 10.1126/scirobotics.aav3150.

Visual mental imagery: A view from artificial intelligence.视觉心理意象：人工智能视角。

Cortex. 2018 Aug;105:155-172. doi: 10.1016/j.cortex.2018.01.022. Epub 2018 Feb 27.

Modeling visual problem solving as analogical reasoning.将视觉问题解决建模为类比推理。

Psychol Rev. 2017 Jan;124(1):60-90. doi: 10.1037/rev0000039.

Human-level concept learning through probabilistic program induction.通过概率编程归纳实现人类水平的概念学习。

Science. 2015 Dec 11;350(6266):1332-8. doi: 10.1126/science.aab3050.

Home Reading Environment and Brain Activation in Preschool Children Listening to Stories.学龄前儿童听故事时的家庭阅读环境与大脑激活

Pediatrics. 2015 Sep;136(3):466-78. doi: 10.1542/peds.2015-0359. Epub 2015 Aug 10.

The heterogeneity of mental representation: Ending the imagery debate.心理表征的异质性：终结意象之争。

Proc Natl Acad Sci U S A. 2015 Aug 18;112(33):10089-92. doi: 10.1073/pnas.1504933112. Epub 2015 Jul 14.

A neural model of rule generation in inductive reasoning.归纳推理中规则生成的神经模型。

Top Cogn Sci. 2011 Jan;3(1):140-53. doi: 10.1111/j.1756-8765.2010.01127.x.

Learning to relate images.学习关联图像。

IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1829-46. doi: 10.1109/TPAMI.2013.53.

A century of Gestalt psychology in visual perception: I. Perceptual grouping and figure-ground organization.一百年来视觉感知的格式塔心理学：一、知觉群集和图形-背景组织。

Psychol Bull. 2012 Nov;138(6):1172-217. doi: 10.1037/a0029333. Epub 2012 Jul 30.

Learning to represent spatial transformations with factored higher-order Boltzmann machines.用因子高阶玻尔兹曼机来学习表示空间变换。

Neural Comput. 2010 Jun;22(6):1473-92. doi: 10.1162/neco.2010.01-09-953.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

人工智能、视觉意象，以及对人类智力测验所带来的挑战的案例研究。

AI, visual imagery, and a case study on the challenges posed by human intelligence tests.

机构信息

Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37235-1679

出版信息

Proc Natl Acad Sci U S A. 2020 Nov 24;117(47):29390-29397. doi: 10.1073/pnas.1912335117.

DOI:10.1073/pnas.1912335117

PMID:33229557

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7703577/

Abstract

摘要

人工智能、视觉意象，以及对人类智力测验所带来的挑战的案例研究。

AI, visual imagery, and a case study on the challenges posed by human intelligence tests.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

人工智能、视觉意象，以及对人类智力测验所带来的挑战的案例研究。

AI, visual imagery, and a case study on the challenges posed by human intelligence tests.

机构信息

出版信息