Data61, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Eveleigh, NSW 2015, Australia;
Data61, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Eveleigh, NSW 2015, Australia.
Proc Natl Acad Sci U S A. 2020 Nov 17;117(46):29221-29228. doi: 10.1073/pnas.2016921117. Epub 2020 Nov 4.
Adversarial examples are carefully crafted input patterns that are surprisingly poorly classified by artificial and/or natural neural networks. Here we examine adversarial vulnerabilities in the processes responsible for learning and choice in humans. Building upon recent recurrent neural network models of choice processes, we propose a general framework for generating adversarial opponents that can shape the choices of individuals in particular decision-making tasks toward the behavioral patterns desired by the adversary. We show the efficacy of the framework through three experiments involving action selection, response inhibition, and social decision-making. We further investigate the strategy used by the adversary in order to gain insights into the vulnerabilities of human choice. The framework may find applications across behavioral sciences in helping detect and avoid flawed choice.
对抗样本是精心设计的输入模式,这些模式令人惊讶地被人工和/或自然神经网络错误分类。在这里,我们研究了人类学习和决策过程中的对抗漏洞。基于最近的选择过程的递归神经网络模型,我们提出了一个通用框架,用于生成对抗对手,这些对手可以将个人在特定决策任务中的选择朝着对手期望的行为模式塑造。我们通过三个涉及动作选择、反应抑制和社会决策的实验展示了该框架的有效性。我们进一步研究了对手使用的策略,以便深入了解人类选择的脆弱性。该框架可以在帮助检测和避免有缺陷的选择方面应用于行为科学的各个领域。