• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计算机视觉系统的视觉图灵测试。

Visual Turing test for computer vision systems.

作者信息

Geman Donald, Geman Stuart, Hallonquist Neil, Younes Laurent

机构信息

Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD 21287; and.

Division of Applied Mathematics, Brown University, Providence, RI 02912

出版信息

Proc Natl Acad Sci U S A. 2015 Mar 24;112(12):3618-23. doi: 10.1073/pnas.1422953112. Epub 2015 Mar 9.

DOI:10.1073/pnas.1422953112
PMID:25755262
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4378453/
Abstract

Today, computer vision systems are tested by their accuracy in detecting and localizing instances of objects. As an alternative, and motivated by the ability of humans to provide far richer descriptions and even tell a story about an image, we construct a "visual Turing test": an operator-assisted device that produces a stochastic sequence of binary questions from a given test image. The query engine proposes a question; the operator either provides the correct answer or rejects the question as ambiguous; the engine proposes the next question ("just-in-time truthing"). The test is then administered to the computer-vision system, one question at a time. After the system's answer is recorded, the system is provided the correct answer and the next question. Parsing is trivial and deterministic; the system being tested requires no natural language processing. The query engine employs statistical constraints, learned from a training set, to produce questions with essentially unpredictable answers-the answer to a question, given the history of questions and their correct answers, is nearly equally likely to be positive or negative. In this sense, the test is only about vision. The system is designed to produce streams of questions that follow natural story lines, from the instantiation of a unique object, through an exploration of its properties, and on to its relationships with other uniquely instantiated objects.

摘要

如今,计算机视觉系统通过检测和定位物体实例的准确性来进行测试。作为一种替代方法,并且受人类能够提供更丰富描述甚至讲述图像故事的能力的启发,我们构建了一个“视觉图灵测试”:一种由操作员辅助的设备,它从给定的测试图像中生成一系列随机的二元问题。查询引擎提出一个问题;操作员要么提供正确答案,要么以问题模糊为由拒绝该问题;引擎提出下一个问题(“即时验证”)。然后一次向计算机视觉系统提出一个问题来进行测试。记录系统的答案后,向系统提供正确答案和下一个问题。解析过程简单且具有确定性;被测试的系统不需要自然语言处理。查询引擎利用从训练集中学习到的统计约束来生成答案基本不可预测的问题——给定问题历史及其正确答案,一个问题的答案几乎同样可能是肯定或否定的。从这个意义上说,该测试仅关乎视觉。该系统旨在生成遵循自然故事情节的问题流,从单个独特物体的实例化开始,通过探索其属性,再到它与其他独特实例化物体的关系。

相似文献

1
Visual Turing test for computer vision systems.计算机视觉系统的视觉图灵测试。
Proc Natl Acad Sci U S A. 2015 Mar 24;112(12):3618-23. doi: 10.1073/pnas.1422953112. Epub 2015 Mar 9.
2
A method and software for segmentation of anatomic object ensembles by deformable m-reps.一种通过可变形m-表示法对解剖对象集合进行分割的方法和软件。
Med Phys. 2005 May;32(5):1335-45. doi: 10.1118/1.1869872.
3
Vision processing for realtime 3-D data acquisition based on coded structured light.基于编码结构光的实时三维数据采集视觉处理
IEEE Trans Image Process. 2008 Feb;17(2):167-76. doi: 10.1109/TIP.2007.914755.
4
Automatic digital ECG signal extraction and normal QRS recognition from real scene ECG images.自动从真实场景 ECG 图像中提取数字 ECG 信号和识别正常 QRS 波。
Comput Methods Programs Biomed. 2020 Apr;187:105254. doi: 10.1016/j.cmpb.2019.105254. Epub 2019 Nov 30.
5
Automatic image orientation detection via confidence-based integration of low-level and semantic cues.通过基于置信度的低级线索与语义线索整合实现自动图像方向检测
IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):715-26. doi: 10.1109/TPAMI.2005.96.
6
3-D object recognition using 2-D views.使用二维视图进行三维物体识别。
IEEE Trans Image Process. 2008 Nov;17(11):2236-55. doi: 10.1109/TIP.2008.2003404.
7
Robust object recognition with cortex-like mechanisms.具有类皮质机制的稳健目标识别
IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):411-26. doi: 10.1109/TPAMI.2007.56.
8
Evolutionary optimization of a hierarchical object recognition model.层次化目标识别模型的进化优化
IEEE Trans Syst Man Cybern B Cybern. 2005 Jun;35(3):426-37. doi: 10.1109/tsmcb.2005.846649.
9
Visually defining and querying consistent multi-granular clinical temporal abstractions.直观定义和查询一致的多粒度临床时间抽象。
Artif Intell Med. 2012 Feb;54(2):75-101. doi: 10.1016/j.artmed.2011.10.004. Epub 2011 Dec 15.
10
What the back of the object looks like: 3D reconstruction from line drawings without hidden lines.物体背面的样子:无隐藏线的线条图三维重建。
IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):507-17. doi: 10.1109/TPAMI.2007.1185.

引用本文的文献

1
Phenotype augmentation using generative AI for isocitrate dehydrogenase mutation prediction in glioma.使用生成式人工智能进行胶质瘤异柠檬酸脱氢酶突变预测的表型增强。
Sci Rep. 2025 Aug 7;15(1):28913. doi: 10.1038/s41598-025-14477-z.
2
Synthetic generation of cardiac tissue motion from surface electrocardiograms.从体表心电图合成心脏组织运动
Nat Cardiovasc Res. 2025 Apr;4(4):445-457. doi: 10.1038/s44161-025-00629-x. Epub 2025 Apr 14.
3
Synthetic bone marrow images augment real samples in developing acute myeloid leukemia microscopy classification models.合成骨髓图像在开发急性髓系白血病显微镜分类模型中增强了真实样本。
NPJ Digit Med. 2025 Mar 22;8(1):173. doi: 10.1038/s41746-025-01563-9.
4
Image-Based Generative Artificial Intelligence in Radiology: Comprehensive Updates.基于图像的放射学生成式人工智能:全面更新。
Korean J Radiol. 2024 Nov;25(11):959-981. doi: 10.3348/kjr.2024.0392.
5
Generative adversarial network-based reconstruction of healthy anatomy for anomaly detection in brain CT scans.基于生成对抗网络的脑CT扫描异常检测健康解剖结构重建
J Med Imaging (Bellingham). 2024 Jul;11(4):044508. doi: 10.1117/1.JMI.11.4.044508. Epub 2024 Aug 9.
6
Generative artificial intelligence to produce high-fidelity blastocyst-stage embryo images.生成式人工智能生成高保真囊胚期胚胎图像。
Hum Reprod. 2024 Jun 3;39(6):1197-1207. doi: 10.1093/humrep/deae064.
7
Image factory: A method for synthesizing novel CT images with anatomical guidance.图像生成工厂:一种在解剖学引导下合成新型CT图像的方法。
Med Phys. 2024 May;51(5):3464-3479. doi: 10.1002/mp.16864. Epub 2023 Dec 3.
8
Diversity Learning Based on Multi-Latent Space for Medical Image Visual Question Generation.基于多潜在空间的医学图像视觉问题生成的多样性学习。
Sensors (Basel). 2023 Jan 17;23(3):1057. doi: 10.3390/s23031057.
9
Concerns in the use of adversarial learning for image synthesis in cardiovascular intervention.心血管介入中对抗学习用于图像合成的相关问题。
Eur Heart J Digit Health. 2021 Jul 15;2(4):556. doi: 10.1093/ehjdh/ztab064. eCollection 2021 Dec.
10
Enhancing cancer differentiation with synthetic MRI examinations via generative models: a systematic review.通过生成模型利用合成磁共振成像检查增强癌症分化:一项系统综述
Insights Imaging. 2022 Dec 12;13(1):188. doi: 10.1186/s13244-022-01315-3.

本文引用的文献

1
Region-Based Convolutional Networks for Accurate Object Detection and Segmentation.基于区域的卷积神经网络用于精确的目标检测和分割。
IEEE Trans Pattern Anal Mach Intell. 2016 Jan;38(1):142-58. doi: 10.1109/TPAMI.2015.2437384.
2
Object detection with discriminatively trained part-based models.基于判别式训练的部件模型的目标检测。
IEEE Trans Pattern Anal Mach Intell. 2010 Sep;32(9):1627-45. doi: 10.1109/TPAMI.2009.167.
3
A fast learning algorithm for deep belief nets.一种用于深度信念网络的快速学习算法。
Neural Comput. 2006 Jul;18(7):1527-54. doi: 10.1162/neco.2006.18.7.1527.