猴子 IT 中的可分离神经代码可实现完美的 CAPTCHA 解码。

A separable neural code in monkey IT enables perfect CAPTCHA decoding.

机构信息

Centre for Neuroscience, Indian Institute of Science, Bangalore, India.

出版信息

J Neurophysiol. 2022 Apr 1;127(4):869-884. doi: 10.1152/jn.00160.2021. Epub 2022 Feb 23.

DOI:10.1152/jn.00160.2021

PMID:35196158

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8957334/

Abstract

Reading distorted letters is easy for us but so challenging for the machine vision that it is used on websites as CAPTCHA (Completely Automated Public Turing Test to tell Computers and Humans Apart). How does our brain solve this problem? One solution is to have neurons selective for letter combinations but invariant to distortions. Another is for neurons to encode letter distortions and longer strings to enable separable decoding. Here, we provide evidence for the latter possibility using neural recordings in the monkey inferior temporal (IT) cortex. Neural responses to distorted strings were explained better as a product (but not sum) of shape and distortion tuning, whereas by contrast, responses to letter combinations were explained better as a sum (but not product) of letters. These two rules were sufficient for perfect CAPTCHA decoding and were also emergent in neural networks trained for word recognition. Thus, a separable neural code enables efficient letter recognition. Many websites ask us to recognize distorted letters to deny access to malicious computer programs. Why is this task easy for our brains but hard for the computers? Here, we show that, in the monkey inferior temporal cortex, an area critical for recognition, single neurons encode distorted letter strings according to highly systematic rules that enable perfect distorted letter decoding. Remarkably, the same rules were present in neural networks trained for text recognition.

摘要

阅读扭曲的字母对我们来说很容易，但机器视觉却很难做到，因此它被用于网站上的验证码（完全自动化的公共图灵测试，以区分计算机和人类）。我们的大脑是如何解决这个问题的呢？一种解决方案是让神经元对字母组合具有选择性，但对扭曲不变。另一种方法是让神经元对字母扭曲和更长的字符串进行编码，以实现可分离的解码。在这里，我们使用猴子下颞叶（IT）皮层的神经记录提供了后者可能性的证据。扭曲字符串的神经反应可以更好地解释为形状和扭曲调谐的乘积（而不是和），相比之下，字母组合的反应可以更好地解释为字母的和（而不是积）。这两个规则足以实现完美的验证码解码，并且在为单词识别而训练的神经网络中也出现了这种情况。因此，可分离的神经代码能够实现高效的字母识别。许多网站要求我们识别扭曲的字母，以拒绝恶意计算机程序的访问。为什么这个任务对我们的大脑来说很容易，但对计算机来说却很难？在这里，我们表明，在猴子的下颞叶皮层中，一个对识别至关重要的区域，单个神经元根据高度系统的规则对扭曲的字母串进行编码，从而实现完美的扭曲字母解码。值得注意的是，相同的规则也存在于为文本识别而训练的神经网络中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80ae/8957334/aaf9b35dc23c/jn-00160-2021r01.jpg

相似文献

A separable neural code in monkey IT enables perfect CAPTCHA decoding.猴子 IT 中的可分离神经代码可实现完美的 CAPTCHA 解码。

J Neurophysiol. 2022 Apr 1;127(4):869-884. doi: 10.1152/jn.00160.2021. Epub 2022 Feb 23.

Multiplicative mixing of object identity and image attributes in single inferior temporal neurons.单个下颞叶神经元中物体身份和图像属性的乘法混合。

Proc Natl Acad Sci U S A. 2018 Apr 3;115(14):E3276-E3285. doi: 10.1073/pnas.1714287115. Epub 2018 Mar 20.

CAPTCHA Image Generation: Two-Step Style-Transfer Learning in Deep Neural Networks.验证码图像生成：深度神经网络中的两步风格迁移学习。

Sensors (Basel). 2020 Mar 9;20(5):1495. doi: 10.3390/s20051495.

FR-CAPTCHA: CAPTCHA based on recognizing human faces.FR-CAPTCHA：基于人脸识别的验证码。

PLoS One. 2014 Apr 15;9(4):e91708. doi: 10.1371/journal.pone.0091708. eCollection 2014.

Detecting distortions of peripherally presented letter stimuli under crowded conditions.在拥挤条件下检测周边呈现字母刺激的变形。

Atten Percept Psychophys. 2017 Apr;79(3):850-862. doi: 10.3758/s13414-016-1245-x.

Cracking the neural code for word recognition in convolutional neural networks.破解卷积神经网络中单词识别的神经密码。

PLoS Comput Biol. 2024 Sep 6;20(9):e1012430. doi: 10.1371/journal.pcbi.1012430. eCollection 2024 Sep.

Letter processing in the visual system: different activation patterns for single letters and strings.视觉系统中的字母处理：单个字母和字符串的不同激活模式。

Cogn Affect Behav Neurosci. 2005 Dec;5(4):452-66. doi: 10.3758/cabn.5.4.452.

Training-induced neural plasticity in visual-word decoding and the role of syllables.训练诱导的视觉单词解码中的神经可塑性及音节的作用。

Neuropsychologia. 2014 Aug;61:299-314. doi: 10.1016/j.neuropsychologia.2014.06.017. Epub 2014 Jun 21.

A novel CAPTCHA solver framework using deep skipping Convolutional Neural Networks.一种使用深度跳跃卷积神经网络的新型验证码求解器框架。

PeerJ Comput Sci. 2022 Apr 6;8:e879. doi: 10.7717/peerj-cs.879. eCollection 2022.

New Cognitive Deep-Learning CAPTCHA.新型认知深度伪造验证码。

Sensors (Basel). 2023 Feb 20;23(4):2338. doi: 10.3390/s23042338.

引用本文的文献

Shape and word parts combine linearly in the Bouba-Kiki effect.在布巴-基基效应中，形状和单词部分线性组合。

Atten Percept Psychophys. 2025 Sep 2. doi: 10.3758/s13414-025-03151-1.

Convolutional networks can model the functional modulation of the MEG responses associated with feed-forward processes during visual word recognition.卷积网络可以对与视觉单词识别过程中的前馈过程相关的脑磁图反应的功能调制进行建模。

Elife. 2025 May 13;13:RP96217. doi: 10.7554/eLife.96217.

Visual homogeneity computations in the brain enable solving property-based visual tasks.大脑中的视觉同质性计算有助于解决基于属性的视觉任务。

Elife. 2025 Feb 18;13:RP93033. doi: 10.7554/eLife.93033.

Using compositionality to understand parts in whole objects.利用组合性理解整体对象中的部分。

Eur J Neurosci. 2022 Aug;56(4):4378-4392. doi: 10.1111/ejn.15746. Epub 2022 Jul 20.

本文引用的文献

Reconciling print-size and display-size constraints on reading.协调阅读时的打印尺寸和显示尺寸限制。

Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30276-30284. doi: 10.1073/pnas.2007514117. Epub 2020 Nov 9.

The inferior temporal cortex is a potential cortical precursor of orthographic processing in untrained monkeys.下颞叶皮层是未经训练的猴子在字形处理中的潜在皮质前体。

Nat Commun. 2020 Aug 4;11(1):3886. doi: 10.1038/s41467-020-17714-3.

A compositional neural code in high-level visual cortex can explain jumbled word reading.高级视觉皮层中的组合神经代码可以解释乱序单词阅读。

Elife. 2020 May 5;9:e54846. doi: 10.7554/eLife.54846.

Reading Increases the Compositionality of Visual Word Representations.阅读增加视觉词汇表示的组合性。

Psychol Sci. 2019 Dec;30(12):1707-1723. doi: 10.1177/0956797619881134. Epub 2019 Nov 7.

Extensive childhood experience with Pokémon suggests eccentricity drives organization of visual cortex.广泛的儿童时期玩《宝可梦》的经历表明，古怪的癖好驱动着视觉皮层的组织。

Nat Hum Behav. 2019 Jun;3(6):611-624. doi: 10.1038/s41562-019-0592-8. Epub 2019 May 6.

Multiplicative mixing of object identity and image attributes in single inferior temporal neurons.单个下颞叶神经元中物体身份和图像属性的乘法混合。

Proc Natl Acad Sci U S A. 2018 Apr 3;115(14):E3276-E3285. doi: 10.1073/pnas.1714287115. Epub 2018 Mar 20.

Symmetric Objects Become Special in Perception Because of Generic Computations in Neurons.对称物体在感知中变得特殊，是因为神经元中的通用计算。

Psychol Sci. 2018 Jan;29(1):95-109. doi: 10.1177/0956797617729808. Epub 2017 Dec 8.

A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs.一种具有高效数据利用能力的生成式视觉模型，可破解基于文本的验证码。

Science. 2017 Dec 8;358(6368). doi: 10.1126/science.aag2612. Epub 2017 Oct 26.

Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing.深度神经网络：一种用于模拟生物视觉和大脑信息处理的新框架。

Annu Rev Vis Sci. 2015 Nov 24;1:417-446. doi: 10.1146/annurev-vision-082114-035447.

Object attributes combine additively in visual search.在视觉搜索中，物体属性以相加的方式组合。

J Vis. 2016;16(5):8. doi: 10.1167/16.5.8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

猴子 IT 中的可分离神经代码可实现完美的 CAPTCHA 解码。

A separable neural code in monkey IT enables perfect CAPTCHA decoding.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献