RNAS-CL：通过跨层知识蒸馏进行鲁棒的神经架构搜索

RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge Distillation.

作者信息

Nath Utkarsh, Wang Yancheng, Turaga Pavan, Yang Yingzhen

机构信息

School of Computing and Augmented Intelligence, Arizona State University, 699 S Mill Ave, Tempe, AZ 85281, USA.

School of Electrical, Computer and Energy Engineering, Arizona State University, 950 S Forest Mall, Tempe, AZ 85281, USA.

出版信息

Int J Comput Vis. 2024 Jun 24;132(12):5698-5717. doi: 10.1007/s11263-024-02133-4.

DOI:10.1007/s11263-024-02133-4

PMID:40313374

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12045325/

Abstract

Deep Neural Networks are often vulnerable to adversarial attacks. Neural Architecture Search (NAS), one of the tools for developing novel deep neural architectures, demonstrates superior performance in prediction accuracy in various machine learning applications. However, the performance of a neural architecture discovered by NAS against adversarial attacks has not been sufficiently studied, especially under the regime of knowledge distillation. Given the presence of a robust teacher, we investigate if NAS would produce a robust neural architecture by inheriting robustness from the teacher. In this paper, we propose Robust Neural Architecture Search by Cross-Layer knowledge distillation (RNAS-CL), a novel NAS algorithm that improves the robustness of NAS by learning from a robust teacher through cross-layer knowledge distillation. Unlike previous knowledge distillation methods that encourage close student-teacher output only in the last layer, RNAS-CL automatically searches for the best teacher layer to supervise each student layer. Experimental results demonstrate the effectiveness of RNAS-CL and show that RNAS-CL produces compact and adversarially robust neural architectures. Our results point to new approaches for finding compact and robust neural architecture for many applications. The code of RNAS-CL is available at https://github.com/Statistical-Deep-Learning/RNAS-CL.

摘要

深度神经网络通常容易受到对抗攻击。神经架构搜索（NAS）作为开发新型深度神经架构的工具之一，在各种机器学习应用中的预测准确性方面表现出卓越性能。然而，由NAS发现的神经架构在对抗攻击方面的性能尚未得到充分研究，尤其是在知识蒸馏的情况下。鉴于存在一个强大的教师模型，我们研究NAS是否会通过从教师模型继承鲁棒性来产生一个鲁棒的神经架构。在本文中，我们提出了通过跨层知识蒸馏的鲁棒神经架构搜索（RNAS-CL），这是一种新颖的NAS算法，通过跨层知识蒸馏从一个鲁棒的教师模型学习来提高NAS的鲁棒性。与以往仅在最后一层鼓励学生-教师输出接近的知识蒸馏方法不同，RNAS-CL自动搜索最佳教师层来监督每个学生层。实验结果证明了RNAS-CL的有效性，并表明RNAS-CL产生了紧凑且对抗鲁棒的神经架构。我们的结果为许多应用寻找紧凑且鲁棒的神经架构指出了新的方法。RNAS-CL的代码可在https://github.com/Statistical-Deep-Learning/RNAS-CL获取。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

RNAS-CL：通过跨层知识蒸馏进行鲁棒的神经架构搜索

RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge Distillation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

RNAS-CL：通过跨层知识蒸馏进行鲁棒的神经架构搜索

RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge Distillation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献