Wei Xingxing, Zhao Shiji, Li Bo
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8870-8882. doi: 10.1109/TPAMI.2024.3411035. Epub 2024 Nov 6.
Adversarial attacks have been proven to be potential threats to Deep Neural Networks (DNNs), and many methods are proposed to defend against adversarial attacks. However, while enhancing the robustness, the accuracy for clean examples will decline to a certain extent, implying a trade-off existed between the accuracy and adversarial robustness. In this paper, to meet the trade-off problem, we theoretically explore the underlying reason for the difference of the filters' weight distribution between standard-trained and robust-trained models and then argue that this is an intrinsic property for static neural networks, thus they are difficult to fundamentally improve the accuracy and adversarial robustness at the same time. Based on this analysis, we propose a sample-wise dynamic network architecture named Adversarial Weight-Varied Network (AW-Net), which focuses on dealing with clean and adversarial examples with a "divide and rule" weight strategy. The AW-Net adaptively adjusts the network's weights based on regulation signals generated by an adversarial router, which is directly influenced by the input sample. Benefiting from the dynamic network architecture, clean and adversarial examples can be processed with different network weights, which provides the potential to enhance both accuracy and adversarial robustness. A series of experiments demonstrate that our AW-Net is architecture-friendly to handle both clean and adversarial examples and can achieve better trade-off performance than state-of-the-art robust models.
对抗攻击已被证明是深度神经网络(DNN)的潜在威胁,并且人们提出了许多方法来抵御对抗攻击。然而,在增强鲁棒性的同时,干净样本的准确率会在一定程度上下降,这意味着在准确率和对抗鲁棒性之间存在权衡。在本文中,为了解决这种权衡问题,我们从理论上探究了标准训练模型和鲁棒训练模型之间滤波器权重分布差异的根本原因,然后认为这是静态神经网络的固有属性,因此它们难以同时从根本上提高准确率和对抗鲁棒性。基于此分析,我们提出了一种逐样本动态网络架构,称为对抗权重变化网络(AW-Net),它专注于采用“分而治之”的权重策略来处理干净样本和对抗样本。AW-Net基于对抗路由器生成的调节信号自适应地调整网络权重,该对抗路由器直接受输入样本影响。受益于动态网络架构,干净样本和对抗样本可以用不同的网络权重进行处理,这为提高准确率和对抗鲁棒性提供了潜力。一系列实验表明,我们的AW-Net在处理干净样本和对抗样本方面对架构友好,并且可以比现有最先进的鲁棒模型实现更好的权衡性能。