Suppr超能文献

深度卷积堆叠在水下声目标识别中的应用

Deep convolution stack for waveform in underwater acoustic target recognition.

机构信息

Big Data Research Center, University of Electronic Science and Technology of China, Chengdu, 611731, China.

The Research Base of Digital Culture and Media, Sichuan Provincial Key Research Base of Social Science, Chengdu, 611731, China.

出版信息

Sci Rep. 2021 May 5;11(1):9614. doi: 10.1038/s41598-021-88799-z.

Abstract

In underwater acoustic target recognition, deep learning methods have been proved to be effective on recognizing original signal waveform. Previous methods often utilize large convolutional kernels to extract features at the beginning of neural networks. It leads to a lack of depth and structural imbalance of networks. The power of nonlinear transformation brought by deep network has not been fully utilized. Deep convolution stack is a kind of network frame with flexible and balanced structure and it has not been explored well in underwater acoustic target recognition, even though such frame has been proven to be effective in other deep learning fields. In this paper, a multiscale residual unit (MSRU) is proposed to construct deep convolution stack network. Based on MSRU, a multiscale residual deep neural network (MSRDN) is presented to classify underwater acoustic target. Dataset acquired in a real-world scenario is used to verify the proposed unit and model. By adding MSRU into Generative Adversarial Networks, the validity of MSRU is proved. Finally, MSRDN achieves the best recognition accuracy of 83.15%, improved by 6.99% from the structure related networks which take the original signal waveform as input and 4.48% from the networks which take the time-frequency representation as input.

摘要

在水下声目标识别中,深度学习方法已被证明在识别原始信号波形方面非常有效。以前的方法通常在神经网络的开始使用大卷积核来提取特征。这导致网络的深度和结构不平衡,深度网络带来的非线性变换能力没有得到充分利用。深度卷积堆叠是一种具有灵活和平衡结构的网络框架,尽管在其他深度学习领域已经证明了其有效性,但在水下声目标识别中尚未得到充分探索。本文提出了一种多尺度残差单元 (MSRU) 来构建深度卷积堆叠网络。基于 MSRU,提出了一种多尺度残差深度神经网络 (MSRDN) 来对水下声目标进行分类。使用真实场景中获取的数据集来验证所提出的单元和模型。通过在生成对抗网络中添加 MSRU,证明了 MSRU 的有效性。最后,MSRDN 实现了最佳识别准确率 83.15%,与以原始信号波形作为输入的相关结构网络相比提高了 6.99%,与以时频表示作为输入的网络相比提高了 4.48%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a8c/8099869/b23b0ee64987/41598_2021_88799_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验