通过度量学习实现多模态数据的对抗鲁棒性。

Towards Adversarial Robustness for Multi-Mode Data through Metric Learning.

机构信息

Research Center for Information Technology Innovation, Academia Sinica, Taipei 11529, Taiwan.

Social Networks Human-Centered Computing, Taiwan International Graduate Program, Academia Sinica, Taipei 11529, Taiwan.

出版信息

Sensors (Basel). 2023 Jul 5;23(13):6173. doi: 10.3390/s23136173.

DOI:10.3390/s23136173

PMID:37448021

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10346491/

Abstract

Adversarial attacks have become one of the most serious security issues in widely used deep neural networks. Even though real-world datasets usually have large intra-variations or multiple modes, most adversarial defense methods, such as adversarial training, which is currently one of the most effective defense methods, mainly focus on the single-mode setting and thus fail to capture the full data representation to defend against adversarial attacks. To confront this challenge, we propose a novel multi-prototype metric learning regularization for adversarial training which can effectively enhance the defense capability of adversarial training by preventing the latent representation of the adversarial example changing a lot from its clean one. With extensive experiments on CIFAR10, CIFAR100, MNIST, and Tiny ImageNet, the evaluation results show the proposed method improves the performance of different state-of-the-art adversarial training methods without additional computational cost. Furthermore, besides Tiny ImageNet, in the multi-prototype CIFAR10 and CIFAR100 where we reorganize the whole datasets of CIFAR10 and CIFAR100 into two and ten classes, respectively, the proposed method outperforms the state-of-the-art approach by 2.22% and 1.65%, respectively. Furthermore, the proposed multi-prototype method also outperforms its single-prototype version and other commonly used deep metric learning approaches as regularization for adversarial training and thus further demonstrates its effectiveness.

摘要

对抗攻击已成为广泛使用的深度神经网络中最严重的安全问题之一。尽管现实世界的数据集通常具有较大的内部变化或多种模式，但大多数对抗防御方法，如对抗训练，这是目前最有效的防御方法之一，主要集中在单模式设置，因此无法充分捕捉数据表示以抵御对抗攻击。为了应对这一挑战，我们提出了一种新颖的多原型度量学习正则化方法，用于对抗训练，可以通过防止对抗样本的潜在表示与其干净样本的变化很大，有效提高对抗训练的防御能力。在 CIFAR10、CIFAR100、MNIST 和 Tiny ImageNet 上进行了广泛的实验，评估结果表明，该方法在不增加计算成本的情况下，提高了不同最先进的对抗训练方法的性能。此外，除了 Tiny ImageNet 之外，在我们分别将 CIFAR10 和 CIFAR100 的整个数据集重新组织为两个和十个类别的多原型 CIFAR10 和 CIFAR100 中，该方法的性能优于最先进的方法分别为 2.22%和 1.65%。此外，所提出的多原型方法也优于其单原型版本和其他常用的深度度量学习方法作为对抗训练的正则化方法，从而进一步证明了其有效性。