Escuela Politécnica. Universidad Católica de Murcia (UCAM), Campus de los Jeronimos, 30107, Guadalupe, Spain.
Computer and Systems Department (DISCA). Universitat Politècnica de València (UPV), Camino de Vera, s/n, 46022, Valencia, Spain.
Sensors (Basel). 2020 Feb 7;20(3):903. doi: 10.3390/s20030903.
Wireless acoustic sensor networks are nowadays an essential tool for noise pollution monitoring and managing in cities. The increased computing capacity of the nodes that create the network is allowing the addition of processing algorithms and artificial intelligence that provide more information about the sound sources and environment, e.g., detect sound events or calculate loudness. Several models to predict sound pressure levels in cities are available, mainly road, railway and aerial traffic noise. However, these models are mostly based in auxiliary data, e.g., vehicles flow or street geometry, and predict equivalent levels for a temporal long-term. Therefore, forecasting of temporal short-term sound levels could be a helpful tool for urban planners and managers. In this work, a Long Short-Term Memory (LSTM) deep neural network technique is proposed to model temporal behavior of sound levels at a certain location, both sound pressure level and loudness level, in order to predict near-time future values. The proposed technique can be trained for and integrated in every node of a sensor network to provide novel functionalities, e.g., a method of early warning against noise pollution and of backup in case of node or network malfunction. To validate this approach, one-minute period equivalent sound levels, captured in a two-month measurement campaign by a node of a deployed network of acoustic sensors, have been used to train it and to obtain different forecasting models. Assessments of the developed LSTM models and Auto regressive integrated moving average models were performed to predict sound levels for several time periods, from 1 to 60 min. Comparison of the results show that the LSTM models outperform the statistics-based models. In general, the LSTM models achieve a prediction of values with a mean square error less than 4.3 dB for sound pressure level and less than 2 phons for loudness. Moreover, the goodness of fit of the LSTM models and the behavior pattern of the data in terms of prediction of sound levels are satisfactory.
无线声传感器网络如今是城市噪声污染监测和管理的重要工具。创建网络的节点的计算能力不断提高,使得可以添加处理算法和人工智能,从而提供有关声源和环境的更多信息,例如,检测声音事件或计算响度。有几种预测城市噪声的模型,主要是道路交通、铁路和航空交通噪声。然而,这些模型大多基于辅助数据,例如车辆流量或街道几何形状,并预测时间长期的等效水平。因此,短期声级的预测可能是城市规划者和管理者的有用工具。在这项工作中,提出了一种长短期记忆(LSTM)深度神经网络技术,用于对特定位置的声级(包括声压级和响度级)的时间行为进行建模,以便预测近时间未来的值。该技术可以针对每个传感器网络节点进行训练和集成,以提供新的功能,例如针对噪声污染的预警方法和在节点或网络出现故障时的备份方法。为了验证这种方法,使用在已部署的声传感器网络中的一个节点进行的为期两个月的测量活动中捕获的一分钟等效声级来对其进行训练并获得不同的预测模型。对开发的 LSTM 模型和自回归综合移动平均模型进行了评估,以预测几个时间段的声级,从 1 分钟到 60 分钟。结果表明,LSTM 模型优于基于统计的模型。通常,LSTM 模型可以将声压级的预测值的均方误差控制在 4.3dB 以内,将响度的预测值的均方误差控制在 2 个音分以内。此外,LSTM 模型的拟合优度和数据的预测行为模式都令人满意。