Adversarial Training With Anti-Adversaries.

Zhou Xiaoling, Wu Ou, Yang Nan

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10210-10227. doi: 10.1109/TPAMI.2024.3432973. Epub 2024 Nov 6.

Adversarial training is effective in improving the robustness of deep neural networks. However, existing studies still exhibit significant drawbacks in terms of the robustness, generalization, and fairness of models. In this study, we validate the importance of different perturbation directions (i.e., adversarial and anti-adversarial) and bounds from both theoretical and practical perspectives. The influence of adversarial training on deep learning models in terms of fairness, robustness, and generalization is theoretically investigated under a more general perturbation scope that different samples can have different perturbation directions and varied perturbation bounds. Our theoretical explorations suggest that combining adversaries and anti-adversaries with varied bounds in training can be more effective in achieving better fairness among classes and a better tradeoff among robustness, accuracy, and fairness in some typical learning scenarios compared with standard adversarial training. Inspired by our theoretical findings, a more general learning objective that combines adversaries and anti-adversaries with varied bounds on each training sample is presented. To solve this objective, two adversarial training frameworks based on meta-learning and reinforcement learning are proposed, in which the perturbation direction and bound for each sample are determined by its training characteristics. Furthermore, the role of the combination strategy with varied bounds is explained from a regularization perspective. Extensive experiments under different learning scenarios verify our theoretical findings and the effectiveness of the proposed methodology.

相似文献

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10210-10227. doi: 10.1109/TPAMI.2024.3432973. Epub 2024 Nov 6.

Between-Class Adversarial Training for Improving Adversarial Robustness of Image Classification.

Sensors (Basel). 2023 Mar 20;23(6):3252. doi: 10.3390/s23063252.

Perturbation diversity certificates robust generalization.

Neural Netw. 2024 Apr;172:106117. doi: 10.1016/j.neunet.2024.106117. Epub 2024 Jan 8.

Adversary Agnostic Robust Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6146-6157. doi: 10.1109/TNNLS.2021.3133537. Epub 2023 Sep 1.

Certifiable Robustness to Adversarial State Uncertainty in Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4184-4198. doi: 10.1109/TNNLS.2021.3056046. Epub 2022 Aug 31.

Interpolated Adversarial Training: Achieving robust neural networks without sacrificing too much accuracy.

Neural Netw. 2022 Oct;154:218-233. doi: 10.1016/j.neunet.2022.07.012. Epub 2022 Jul 16.

Evaluation of GAN-Based Model for Adversarial Training.

Sensors (Basel). 2023 Mar 1;23(5):2697. doi: 10.3390/s23052697.

Attention-based investigation and solution to the trade-off issue of adversarial training.

Neural Netw. 2024 Jun;174:106224. doi: 10.1016/j.neunet.2024.106224. Epub 2024 Mar 2.

Improving the robustness and accuracy of biomedical language models through adversarial training.

J Biomed Inform. 2022 Aug;132:104114. doi: 10.1016/j.jbi.2022.104114. Epub 2022 Jun 15.

Stylized Adversarial Defense.

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):6403-6414. doi: 10.1109/TPAMI.2022.3207917. Epub 2023 Apr 3.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Adversarial Training With Anti-Adversaries.

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10210-10227. doi: 10.1109/TPAMI.2024.3432973. Epub 2024 Nov 6.

Between-Class Adversarial Training for Improving Adversarial Robustness of Image Classification.

Sensors (Basel). 2023 Mar 20;23(6):3252. doi: 10.3390/s23063252.

Perturbation diversity certificates robust generalization.

Neural Netw. 2024 Apr;172:106117. doi: 10.1016/j.neunet.2024.106117. Epub 2024 Jan 8.

Adversary Agnostic Robust Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6146-6157. doi: 10.1109/TNNLS.2021.3133537. Epub 2023 Sep 1.

Certifiable Robustness to Adversarial State Uncertainty in Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4184-4198. doi: 10.1109/TNNLS.2021.3056046. Epub 2022 Aug 31.

Interpolated Adversarial Training: Achieving robust neural networks without sacrificing too much accuracy.

Neural Netw. 2022 Oct;154:218-233. doi: 10.1016/j.neunet.2022.07.012. Epub 2022 Jul 16.

Evaluation of GAN-Based Model for Adversarial Training.

Sensors (Basel). 2023 Mar 1;23(5):2697. doi: 10.3390/s23052697.

Attention-based investigation and solution to the trade-off issue of adversarial training.

Neural Netw. 2024 Jun;174:106224. doi: 10.1016/j.neunet.2024.106224. Epub 2024 Mar 2.

Improving the robustness and accuracy of biomedical language models through adversarial training.

J Biomed Inform. 2022 Aug;132:104114. doi: 10.1016/j.jbi.2022.104114. Epub 2022 Jun 15.

Stylized Adversarial Defense.

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):6403-6414. doi: 10.1109/TPAMI.2022.3207917. Epub 2023 Apr 3.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献