最小对抗样本。

Du Zhenyu, Liu Fangzheng, Yan Xuehu

College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China.

Entropy (Basel). 2022 Mar 12;24(3):396. doi: 10.3390/e24030396.

Deep neural networks in the area of information security are facing a severe threat from adversarial examples (AEs). Existing methods of AE generation use two optimization models: (1) taking the successful attack as the objective function and limiting perturbations as the constraint; (2) taking the minimum of adversarial perturbations as the target and the successful attack as the constraint. These all involve two fundamental problems of AEs: the minimum boundary of constructing the AEs and whether that boundary is reachable. The reachability means whether the AEs of successful attack models exist equal to that boundary. Previous optimization models have no complete answer to the problems. Therefore, in this paper, for the first problem, we propose the definition of the minimum AEs and give the theoretical lower bound of the amplitude of the minimum AEs. For the second problem, we prove that solving the generation of the minimum AEs is an NPC problem, and then based on its computational inaccessibility, we establish a new third optimization model. This model is general and can adapt to any constraint. To verify the model, we devise two specific methods for generating controllable AEs under the widely used distance evaluation standard of adversarial perturbations, namely Lp constraint and SSIM constraint (structural similarity). This model limits the amplitude of the AEs, reduces the solution space's search cost, and is further improved in efficiency. In theory, those AEs generated by the new model which are closer to the actual minimum adversarial boundary overcome the blindness of the adversarial amplitude setting of the existing methods and further improve the attack success rate. In addition, this model can generate accurate AEs with controllable amplitude under different constraints, which is suitable for different application scenarios. In addition, through extensive experiments, they demonstrate a better attack ability under the same constraints as other baseline attacks. For all the datasets we test in the experiment, compared with other baseline methods, the attack success rate of our method is improved by approximately 10%.

信息安全领域的深度神经网络正面临对抗样本（AE）的严重威胁。现有的AE生成方法使用两种优化模型：（1）以成功攻击为目标函数，将扰动限制作为约束条件；（2）以对抗扰动的最小值为目标，成功攻击作为约束条件。这些都涉及到AE的两个基本问题：构建AE的最小边界以及该边界是否可达。可达性是指成功攻击模型的AE是否存在等于该边界的情况。先前的优化模型对这些问题没有完整的答案。因此，在本文中，针对第一个问题，我们提出了最小AE的定义，并给出了最小AE幅度的理论下界。针对第二个问题，我们证明了解决最小AE的生成是一个NPC问题，然后基于其计算不可达性，我们建立了一个新的第三种优化模型。该模型具有通用性，可以适应任何约束条件。为了验证该模型，我们在对抗扰动的广泛使用的距离评估标准（即Lp约束和SSIM约束（结构相似性））下设计了两种生成可控AE的具体方法。该模型限制了AE的幅度，降低了解空间的搜索成本，并在效率上得到了进一步提高。从理论上讲，新模型生成的那些更接近实际最小对抗边界的AE克服了现有方法对抗幅度设置的盲目性，并进一步提高了攻击成功率。此外，该模型可以在不同约束条件下生成幅度可控的精确AE，适用于不同的应用场景。此外，通过大量实验，在与其他基线攻击相同的约束条件下，它们展示了更好的攻击能力。对于我们在实验中测试的所有数据集，与其他基线方法相比，我们方法的攻击成功率提高了约10%。

相似文献

Minimum Adversarial Examples.最小对抗样本。

Entropy (Basel). 2022 Mar 12;24(3):396. doi: 10.3390/e24030396.

Adv-BDPM: Adversarial attack based on Boundary Diffusion Probability Model.Adv-BDPM：基于边界扩散概率模型的对抗攻击。

Neural Netw. 2023 Oct;167:730-740. doi: 10.1016/j.neunet.2023.08.048. Epub 2023 Sep 9.

Adaptive Perturbation for Adversarial Attack.

IEEE Trans Pattern Anal Mach Intell. 2024 Aug;46(8):5663-5676. doi: 10.1109/TPAMI.2024.3367773. Epub 2024 Jul 2.

Multi-Label Adversarial Attack With New Measures and Self-Paced Constraint Weighting.基于新度量和自定步长约束加权的多标签对抗攻击

IEEE Trans Image Process. 2024;33:3809-3822. doi: 10.1109/TIP.2024.3411927. Epub 2024 Jul 8.

Boosting the transferability of adversarial examples via stochastic serial attack.通过随机串行攻击提升对抗样本的可转移性。

Neural Netw. 2022 Jun;150:58-67. doi: 10.1016/j.neunet.2022.02.025. Epub 2022 Mar 7.

Frequency constraint-based adversarial attack on deep neural networks for medical image classification.基于频率约束的深度神经网络对抗攻击在医学图像分类中的应用

Comput Biol Med. 2023 Sep;164:107248. doi: 10.1016/j.compbiomed.2023.107248. Epub 2023 Jul 25.

Principal Component Adversarial Example.主成分对抗样本。

IEEE Trans Image Process. 2020 Feb 28. doi: 10.1109/TIP.2020.2975918.

Derivative-free optimization adversarial attacks for graph convolutional networks.用于图卷积网络的无导数优化对抗攻击

PeerJ Comput Sci. 2021 Aug 24;7:e693. doi: 10.7717/peerj-cs.693. eCollection 2021.

Adversarial Examples-Security Threats to COVID-19 Deep Learning Systems in Medical IoT Devices.对抗样本——医疗物联网设备中新冠病毒深度学习系统面临的安全威胁。

IEEE Internet Things J. 2020 Aug 3;8(12):9603-9610. doi: 10.1109/JIOT.2020.3013710. eCollection 2021 Jun 15.

Attention distraction with gradient sharpening for multi-task adversarial attack.用于多任务对抗攻击的梯度锐化注意力干扰

Math Biosci Eng. 2023 Jun 14;20(8):13562-13580. doi: 10.3934/mbe.2023605.

本文引用的文献

On the mathematical properties of the structural similarity index.结构相似性指数的数学性质。

IEEE Trans Image Process. 2012 Apr;21(4):1488-99. doi: 10.1109/TIP.2011.2173206. Epub 2011 Oct 24.

Image quality assessment: from error visibility to structural similarity.图像质量评估：从误差可见性到结构相似性。

IEEE Trans Image Process. 2004 Apr;13(4):600-12. doi: 10.1109/tip.2003.819861.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Minimum Adversarial Examples.最小对抗样本。

Entropy (Basel). 2022 Mar 12;24(3):396. doi: 10.3390/e24030396.

Adv-BDPM: Adversarial attack based on Boundary Diffusion Probability Model.Adv-BDPM：基于边界扩散概率模型的对抗攻击。

Neural Netw. 2023 Oct;167:730-740. doi: 10.1016/j.neunet.2023.08.048. Epub 2023 Sep 9.

Adaptive Perturbation for Adversarial Attack.

IEEE Trans Pattern Anal Mach Intell. 2024 Aug;46(8):5663-5676. doi: 10.1109/TPAMI.2024.3367773. Epub 2024 Jul 2.

Multi-Label Adversarial Attack With New Measures and Self-Paced Constraint Weighting.基于新度量和自定步长约束加权的多标签对抗攻击

IEEE Trans Image Process. 2024;33:3809-3822. doi: 10.1109/TIP.2024.3411927. Epub 2024 Jul 8.

Boosting the transferability of adversarial examples via stochastic serial attack.通过随机串行攻击提升对抗样本的可转移性。

Neural Netw. 2022 Jun;150:58-67. doi: 10.1016/j.neunet.2022.02.025. Epub 2022 Mar 7.

Frequency constraint-based adversarial attack on deep neural networks for medical image classification.基于频率约束的深度神经网络对抗攻击在医学图像分类中的应用

Comput Biol Med. 2023 Sep;164:107248. doi: 10.1016/j.compbiomed.2023.107248. Epub 2023 Jul 25.

Principal Component Adversarial Example.主成分对抗样本。

IEEE Trans Image Process. 2020 Feb 28. doi: 10.1109/TIP.2020.2975918.

Derivative-free optimization adversarial attacks for graph convolutional networks.用于图卷积网络的无导数优化对抗攻击

PeerJ Comput Sci. 2021 Aug 24;7:e693. doi: 10.7717/peerj-cs.693. eCollection 2021.

Adversarial Examples-Security Threats to COVID-19 Deep Learning Systems in Medical IoT Devices.对抗样本——医疗物联网设备中新冠病毒深度学习系统面临的安全威胁。

IEEE Internet Things J. 2020 Aug 3;8(12):9603-9610. doi: 10.1109/JIOT.2020.3013710. eCollection 2021 Jun 15.

Attention distraction with gradient sharpening for multi-task adversarial attack.用于多任务对抗攻击的梯度锐化注意力干扰

Math Biosci Eng. 2023 Jun 14;20(8):13562-13580. doi: 10.3934/mbe.2023605.

本文引用的文献

On the mathematical properties of the structural similarity index.结构相似性指数的数学性质。

IEEE Trans Image Process. 2012 Apr;21(4):1488-99. doi: 10.1109/TIP.2011.2173206. Epub 2011 Oct 24.

Image quality assessment: from error visibility to structural similarity.图像质量评估：从误差可见性到结构相似性。

IEEE Trans Image Process. 2004 Apr;13(4):600-12. doi: 10.1109/tip.2003.819861.

Minimum Adversarial Examples.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献