基于声学嵌入记忆单元改进的空间自编码器的自监督声学表示学习用于水下目标识别

Self-supervised acoustic representation learning via acoustic-embedding memory unit modified space autoencoder for underwater target recognition.

作者信息

Wang Xingmei, Meng Jiaxiang, Liu Yangtao, Zhan Ge, Tian Zhaonan

机构信息

College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China.

出版信息

J Acoust Soc Am. 2022 Nov;152(5):2905. doi: 10.1121/10.0015138.

DOI:10.1121/10.0015138

PMID:36456286

Abstract

Since the expensive annotation of high-quality signals obtained from passive sonars and the weak generalization ability of the single feature in the ocean, this paper proposes the self-supervised acoustic representation learning under acoustic-embedding memory unit modified space autoencoder (ASAE) and performs the underwater target recognition task. In the manner of the animal-like acoustic auditory system, the first step is to design a self-supervised representation learning method called space autoencoder (SAE) to merge Mel filter-bank (FBank) with the acoustic discrimination and gammatone filter-bank (GBank) with the anti-noise robustness into SAE spectrogram (SAE Spec). Meanwhile, due to poor high-level semantic information in SAE Spec, an acoustic-embedding memory unit (AEMU) is introduced as the strategy of adversarial enhancement. During the auxiliary task, more negative samples are joined in the improved contrastive loss function to obtain adversarial enhanced features called ASAE spectrogram (ASAE Spec). Ultimately, the comprehensive contrast experiments and ablation experiments on two underwater datasets show that ASAE Spec increases by more than 0.96% in accuracy, convergence rate, and anti-noise robustness of other mainstream acoustic features. The results prove the potential value of ASAE in practical applications.

摘要

由于被动声纳获取的高质量信号标注成本高昂，且海洋中单一特征的泛化能力较弱，本文提出了基于声学嵌入记忆单元改进的空间自编码器（ASAE）的自监督声学表示学习方法，并执行水下目标识别任务。以类动物声学听觉系统的方式，第一步是设计一种名为空间自编码器（SAE）的自监督表示学习方法，将具有声学辨别能力的梅尔滤波器组（FBank）和具有抗噪声鲁棒性的伽马通滤波器组（GBank）合并到SAE频谱图（SAE Spec）中。同时，由于SAE Spec中高级语义信息较差，引入了声学嵌入记忆单元（AEMU）作为对抗增强策略。在辅助任务期间，在改进的对比损失函数中加入更多负样本，以获得称为ASAE频谱图（ASAE Spec）的对抗增强特征。最终，在两个水下数据集上进行的综合对比实验和消融实验表明，ASAE Spec在准确率、收敛速度和抗噪声鲁棒性方面比其他主流声学特征提高了超过0.96%。结果证明了ASAE在实际应用中的潜在价值。