Suppr超能文献

基于注意力机制融合的混合多分支卷积神经网络在噪声环境下的煤矸石声音识别

Coal-gangue sound recognition using hybrid multi-branch CNN based on attention mechanism fusion in noisy environments.

作者信息

Song Qingjun, Hao Wenchao, Song Qinghui, Jiang Haiyan, Li Kai, Sun Shirong

机构信息

College of Intelligent Equipment, Shandong University of Science and Technology, Shandong, 271000, Taian, China.

出版信息

Sci Rep. 2024 Oct 9;14(1):23644. doi: 10.1038/s41598-024-74308-5.

Abstract

The coal-gangue recognition technology plays an important role in the intelligent realization of fully mechanized caving face and the improvement of coal quality. Although great progress has been made for the coal-gangue recognition in recent years, most of them have not taken into account the impact of the complex environment of top coal caving on recognition performance. Herein, a hybrid multi-branch convolutional neural network (HMBCNN) is proposed for coal-gangue recognition, which based on improved Mel Frequency Cepstral Coefficient (MFCC) as well as Mel spectrogram, and attention mechanism. Firstly, the MFCC and its smooth feature matrix are input into each branch of one-dimensional multi-branch convolutional neural network, and the spliced features are extracted adaptively through multi-head attention mechanism. Secondly, the Mel spectrogram and its first-order derivative are input into each branch of the two-dimensional multi-branch convolutional neural network respectively, and the effective time-frequency information is paid attention to through the soft attention mechanism. Finally, at the decision-making level, the two networks are fused to establish a model for feature fusion and classification, obtaining optimal fusion strategies for different features and networks. A database of sound pressure signals under different signal-to-noise ratios and equipment operations is constructed based on a large amount of data collected in the laboratory and on-site. Comparative experiments and discussions are conducted on this database with advanced algorithms and different neural network structures. The results show that the proposed method achieves higher recognition accuracy and better robustness in noisy environments.

摘要

煤矸石识别技术在综采放顶煤工作面智能化实现及煤炭质量提升方面发挥着重要作用。尽管近年来煤矸石识别取得了很大进展,但大多数方法都没有考虑放顶煤复杂环境对识别性能的影响。在此,提出了一种基于改进梅尔频率倒谱系数(MFCC)、梅尔频谱图以及注意力机制的混合多分支卷积神经网络(HMBCNN)用于煤矸石识别。首先,将MFCC及其平滑特征矩阵输入到一维多分支卷积神经网络的各分支中,通过多头注意力机制自适应提取拼接特征。其次,将梅尔频谱图及其一阶导数分别输入到二维多分支卷积神经网络的各分支中,通过软注意力机制关注有效时频信息。最后,在决策层面,将两个网络融合建立特征融合与分类模型,获得不同特征和网络的最优融合策略。基于在实验室和现场收集的大量数据,构建了不同信噪比和设备运行情况下的声压信号数据库。利用先进算法和不同神经网络结构在该数据库上进行了对比实验和讨论。结果表明,所提方法在噪声环境下具有更高的识别准确率和更好的鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f25/11464836/a7ca0cfa8d53/41598_2024_74308_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验