基于深度强化学习的动态推荐系统的对抗鲁棒性

Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems.

作者信息

Wang Siyu, Cao Yuanjiang, Chen Xiaocong, Yao Lina, Wang Xianzhi, Sheng Quan Z

机构信息

School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, Australia.

School of Computer Science, University of Technology Sydney, Sydney, NSW, Australia.

出版信息

Front Big Data. 2022 May 3;5:822783. doi: 10.3389/fdata.2022.822783. eCollection 2022.

DOI:10.3389/fdata.2022.822783

PMID:35592793

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9110778/

Abstract

Adversarial attacks, e.g., adversarial perturbations of the input and adversarial samples, pose significant challenges to machine learning and deep learning techniques, including interactive recommendation systems. The latent embedding space of those techniques makes adversarial attacks challenging to detect at an early stage. Recent advance in causality shows that counterfactual can also be considered one of the ways to generate the adversarial samples drawn from different distribution as the training samples. We propose to explore adversarial examples and attack agnostic detection on reinforcement learning (RL)-based interactive recommendation systems. We first craft different types of adversarial examples by adding perturbations to the input and intervening on the casual factors. Then, we augment recommendation systems by detecting potential attacks with a deep learning-based classifier based on the crafted data. Finally, we study the attack strength and frequency of adversarial examples and evaluate our model on standard datasets with multiple crafting methods. Our extensive experiments show that most adversarial attacks are effective, and both attack strength and attack frequency impact the attack performance. The strategically-timed attack achieves comparative attack performance with only 1/3 to 1/2 attack frequency. Besides, our white-box detector trained with one crafting method has the generalization ability over several other crafting methods.

摘要

对抗攻击，例如对输入的对抗性扰动和对抗样本，给包括交互式推荐系统在内的机器学习和深度学习技术带来了重大挑战。这些技术的潜在嵌入空间使得对抗攻击在早期阶段难以检测。因果关系方面的最新进展表明，反事实也可被视为生成与训练样本来自不同分布的对抗样本的一种方式。我们建议探索基于强化学习（RL）的交互式推荐系统中的对抗样本和与攻击无关的检测方法。我们首先通过对输入添加扰动并干预因果因素来精心构造不同类型的对抗样本。然后，我们基于精心构造的数据，通过使用基于深度学习的分类器检测潜在攻击来增强推荐系统。最后，我们研究对抗样本的攻击强度和频率，并使用多种构造方法在标准数据集上评估我们的模型。我们广泛的实验表明，大多数对抗攻击是有效的，并且攻击强度和攻击频率都会影响攻击性能。策略性定时攻击仅以1/3到1/2的攻击频率就能实现相当的攻击性能。此外，我们用一种构造方法训练的白盒检测器对其他几种构造方法具有泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c55/9110778/65879ad093b2/fdata-05-822783-g0001.jpg

相似文献

Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems.

Front Big Data. 2022 May 3;5:822783. doi: 10.3389/fdata.2022.822783. eCollection 2022.

Exploiting epistemic uncertainty of the deep learning models to generate adversarial samples.

Multimed Tools Appl. 2022;81(8):11479-11500. doi: 10.1007/s11042-022-12132-7. Epub 2022 Feb 18.

Adversarial attack vulnerability of medical image analysis systems: Unexplored factors.

Med Image Anal. 2021 Oct;73:102141. doi: 10.1016/j.media.2021.102141. Epub 2021 Jun 18.

A Feature Space-Restricted Attention Attack on Medical Deep Learning Systems.

IEEE Trans Cybern. 2023 Aug;53(8):5323-5335. doi: 10.1109/TCYB.2022.3209175. Epub 2023 Jul 18.

Robust image classification against adversarial attacks using elastic similarity measures between edge count sequences.

Neural Netw. 2020 Aug;128:61-72. doi: 10.1016/j.neunet.2020.04.030. Epub 2020 Apr 30.

Improving Adversarial Robustness via Attention and Adversarial Logit Pairing.

Front Artif Intell. 2022 Jan 27;4:752831. doi: 10.3389/frai.2021.752831. eCollection 2021.

Attention distraction with gradient sharpening for multi-task adversarial attack.

Math Biosci Eng. 2023 Jun 14;20(8):13562-13580. doi: 10.3934/mbe.2023605.

Adversarial Examples for Hamming Space Search.

IEEE Trans Cybern. 2020 Apr;50(4):1473-1484. doi: 10.1109/TCYB.2018.2882908. Epub 2018 Dec 11.

Adversarial attacks against supervised machine learning based network intrusion detection systems.

PLoS One. 2022 Oct 14;17(10):e0275971. doi: 10.1371/journal.pone.0275971. eCollection 2022.

ELAA: An Ensemble-Learning-Based Adversarial Attack Targeting Image-Classification Model.

Entropy (Basel). 2023 Jan 22;25(2):215. doi: 10.3390/e25020215.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于深度强化学习的动态推荐系统的对抗鲁棒性

Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems.

作者信息

Wang Siyu, Cao Yuanjiang, Chen Xiaocong, Yao Lina, Wang Xianzhi, Sheng Quan Z

机构信息

School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, Australia.

School of Computer Science, University of Technology Sydney, Sydney, NSW, Australia.

出版信息

Front Big Data. 2022 May 3;5:822783. doi: 10.3389/fdata.2022.822783. eCollection 2022.

DOI:10.3389/fdata.2022.822783

PMID:35592793

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9110778/

Abstract

摘要

基于深度强化学习的动态推荐系统的对抗鲁棒性

Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于深度强化学习的动态推荐系统的对抗鲁棒性

Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems.

作者信息

机构信息

出版信息

相似文献