基于改进YOLOv4算法的声相云图评估方法研究

Study on the Evaluation Method of Sound Phase Cloud Maps Based on an Improved YOLOv4 Algorithm.

作者信息

Zhu Qinfeng, Zheng Huifeng, Wang Yuebing, Cao Yonggang, Guo Shixu

机构信息

Key Laboratory of Acoustics Research, China Jiliang University, Hangzhou 310018, China.

出版信息

Sensors (Basel). 2020 Aug 2;20(15):4314. doi: 10.3390/s20154314.

DOI:10.3390/s20154314

PMID:32748865

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7436077/

Abstract

Most sound imaging instruments are currently used as measurement tools which can provide quantitative data, however, a uniform method to directly and comprehensively evaluate the results of combining acoustic and optical images is not available. Therefore, in this study, we define a localization error index for sound imaging instruments, and propose an acoustic phase cloud map evaluation method based on an improved YOLOv4 algorithm to directly and objectively evaluate the sound source localization results of a sound imaging instrument. The evaluation method begins with the image augmentation of acoustic phase cloud maps obtained from the different tests of a sound imaging instrument to produce the dataset required for training the convolutional network. Subsequently, we combine DenseNet with existing clustering algorithms to improve the YOLOv4 algorithm to train the neural network for easier feature extraction. The trained neural network is then used to localize the target sound source and its pseudo-color map in the acoustic phase cloud map to obtain a pixel-level localization error. Finally, a standard chessboard grid is used to obtain the proportional relationship between the size of the acoustic phase cloud map and the actual physical space distance; then, the true lateral and longitudinal positioning error of sound imaging instrument can be obtained. Experimental results show that the mean average precision of the improved YOLOv4 algorithm in acoustic phase cloud map detection is 96.3%, the F1-score is 95.2%, and detection speed is up to 34.6 fps. The improved algorithm can rapidly and accurately determine the positioning error of sound imaging instrument, which can be used to analyze and evaluate the positioning performance of sound imaging instrument.

摘要

目前，大多数声学成像仪器都被用作测量工具，能够提供定量数据，然而，尚无一种统一的方法来直接、全面地评估声学图像与光学图像相结合的结果。因此，在本研究中，我们定义了一种声学成像仪器的定位误差指标，并提出了一种基于改进的YOLOv4算法的声相云图评估方法，以直接、客观地评估声学成像仪器的声源定位结果。该评估方法首先对声学成像仪器不同测试获得的声相云图进行图像增强，以生成训练卷积网络所需的数据集。随后，我们将DenseNet与现有的聚类算法相结合来改进YOLOv4算法，以便更轻松地提取特征来训练神经网络。然后，使用训练好的神经网络在声相云图中定位目标声源及其伪彩色图，以获得像素级定位误差。最后，使用标准棋盘格来获取声相云图尺寸与实际物理空间距离之间的比例关系；进而可以得到声学成像仪器真实的横向和纵向定位误差。实验结果表明，改进后的YOLOv4算法在声相云图检测中的平均精度为96.3%，F1分数为95.2%，检测速度高达34.6帧/秒。改进后的算法能够快速、准确地确定声学成像仪器的定位误差，可用于分析和评估声学成像仪器的定位性能。