Suppr超能文献

基于注意力机制的渐进式部分连接的神经架构搜索

Neural architecture search via progressive partial connection with attention mechanism.

作者信息

Jin Cong, Huang Jinjie, Chen Yuanjian

机构信息

School of Computer Science and Technology, Harbin University of Science and Technology, Harbin, 150080, China.

School of Automation, Harbin University of Science and Technology, Harbin, 150080, China.

出版信息

Sci Rep. 2024 Mar 18;14(1):6462. doi: 10.1038/s41598-024-57236-2.

Abstract

Differentiable architecture search requires a larger computational consumption during architecture search, and there exists the depth gap problem under deeper network architecture. In this paper, we propose an attention-based progressive partially connected neural architecture search method (PPCAtt-NAS) to address these two issues. First, we introduce a progressive search strategy in the architecture search phase, build up the sophistication of the architecture gradually and perform path-level pruning in stages to bridge the depth gap. Second, we adopt a partial search scheme that performs channel-level partial sampling of the network architecture to further reduce the computational complexity of the architecture search. In addition, an attention mechanism is devised to improve the architecture search capability by enhancing the relevance between the feature channels. Finally, we conduct extensive comparison experiments with state-of-the-art methods on several public datasets, and our method is able to present higher architecture performance.

摘要

可微架构搜索在架构搜索过程中需要更大的计算量,并且在更深的网络架构下存在深度差距问题。在本文中,我们提出了一种基于注意力的渐进式部分连接神经架构搜索方法(PPCAtt-NAS)来解决这两个问题。首先,我们在架构搜索阶段引入了一种渐进式搜索策略,逐步构建架构的复杂度,并分阶段进行路径级剪枝以弥合深度差距。其次,我们采用一种部分搜索方案,对网络架构进行通道级部分采样,以进一步降低架构搜索的计算复杂度。此外,设计了一种注意力机制,通过增强特征通道之间的相关性来提高架构搜索能力。最后,我们在几个公共数据集上与现有方法进行了广泛的比较实验,我们的方法能够呈现出更高的架构性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a52d/10948806/98584dd85715/41598_2024_57236_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验