Suppr超能文献

深度卷积堆叠在水下声目标识别中的应用

Deep convolution stack for waveform in underwater acoustic target recognition.

机构信息

Big Data Research Center, University of Electronic Science and Technology of China, Chengdu, 611731, China.

The Research Base of Digital Culture and Media, Sichuan Provincial Key Research Base of Social Science, Chengdu, 611731, China.

出版信息

Sci Rep. 2021 May 5;11(1):9614. doi: 10.1038/s41598-021-88799-z.

Abstract

In underwater acoustic target recognition, deep learning methods have been proved to be effective on recognizing original signal waveform. Previous methods often utilize large convolutional kernels to extract features at the beginning of neural networks. It leads to a lack of depth and structural imbalance of networks. The power of nonlinear transformation brought by deep network has not been fully utilized. Deep convolution stack is a kind of network frame with flexible and balanced structure and it has not been explored well in underwater acoustic target recognition, even though such frame has been proven to be effective in other deep learning fields. In this paper, a multiscale residual unit (MSRU) is proposed to construct deep convolution stack network. Based on MSRU, a multiscale residual deep neural network (MSRDN) is presented to classify underwater acoustic target. Dataset acquired in a real-world scenario is used to verify the proposed unit and model. By adding MSRU into Generative Adversarial Networks, the validity of MSRU is proved. Finally, MSRDN achieves the best recognition accuracy of 83.15%, improved by 6.99% from the structure related networks which take the original signal waveform as input and 4.48% from the networks which take the time-frequency representation as input.

摘要

在水下声目标识别中,深度学习方法已被证明在识别原始信号波形方面非常有效。以前的方法通常在神经网络的开始使用大卷积核来提取特征。这导致网络的深度和结构不平衡,深度网络带来的非线性变换能力没有得到充分利用。深度卷积堆叠是一种具有灵活和平衡结构的网络框架,尽管在其他深度学习领域已经证明了其有效性,但在水下声目标识别中尚未得到充分探索。本文提出了一种多尺度残差单元 (MSRU) 来构建深度卷积堆叠网络。基于 MSRU,提出了一种多尺度残差深度神经网络 (MSRDN) 来对水下声目标进行分类。使用真实场景中获取的数据集来验证所提出的单元和模型。通过在生成对抗网络中添加 MSRU,证明了 MSRU 的有效性。最后,MSRDN 实现了最佳识别准确率 83.15%,与以原始信号波形作为输入的相关结构网络相比提高了 6.99%,与以时频表示作为输入的网络相比提高了 4.48%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a8c/8099869/b23b0ee64987/41598_2021_88799_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验