National Key Laboratory of Science and Technology on Communication, University of Electronic Science and Technology of China, Chengdu 610000, China.
Sensors (Basel). 2022 Jul 26;22(15):5598. doi: 10.3390/s22155598.
Underwater acoustic signal separation is a key technique for underwater communications. The existing methods are mostly model-based, and cannot accurately characterize the practical underwater acoustic communication environment. They are only suitable for binary signal separation and cannot handle multivariate signal separation. However, recurrent neural networks (RNNs) show a powerful ability to extract the features of temporal sequences. Inspired by this, in this paper, we present a data-driven approach for underwater acoustic signal separation using deep learning technology. We use a bidirectional long short-term memory (Bi-LSTM) approach to explore the features of a time-frequency (T-F) mask, and propose a T-F-mask-aware Bi-LSTM for signal separation. Taking advantage of the sparseness of the T-F image, the designed Bi-LSTM network is able to extract the discriminative features for separation, which further improves the separation performance. In particular, this method breaks through the limitations of the existing methods and not only achieves good results in multivariate separation but also effectively separates signals when they are mixed with 40 dB Gaussian noise signals. The experimental results show that this method can achieve a 97% guarantee ratio (PSR), and the average similarity coefficient of the multivariate signal separation is stable above 0.8 under high noise conditions. It should be noted that our model can only handle known signals such as test signals for calibration.
水下声信号分离是水下通信的关键技术。现有的方法大多基于模型,无法准确地描述实际的水下声通信环境。它们仅适用于二进制信号分离,无法处理多元信号分离。然而,递归神经网络 (RNN) 在提取时间序列特征方面表现出强大的能力。受此启发,本文提出了一种基于深度学习技术的水下声信号分离的数据驱动方法。我们使用双向长短时记忆 (Bi-LSTM) 方法来探索时频 (T-F) 掩模的特征,并提出了一种 T-F 掩模感知的 Bi-LSTM 用于信号分离。利用 T-F 图像的稀疏性,设计的 Bi-LSTM 网络能够提取出用于分离的判别特征,从而进一步提高分离性能。特别是,该方法突破了现有方法的局限性,不仅在多元分离中取得了良好的效果,而且在与 40dB 高斯噪声信号混合时也能有效地分离信号。实验结果表明,该方法可以达到 97%的保证比 (PSR),并且在高噪声条件下,多元信号分离的平均相似系数稳定在 0.8 以上。需要注意的是,我们的模型只能处理已知信号,例如用于校准的测试信号。