Department of Earth and Environmental Sciences (DISAT), University of Milano-Bicocca, Piazza della Scienza 1, 20126 Milano, Italy.
Department of Physics, University of Milano-Bicocca, Piazza della Scienza 3, 20126 Milano, Italy.
Sensors (Basel). 2023 May 16;23(10):4797. doi: 10.3390/s23104797.
The goal of estimating a soundscape index, aimed at evaluating the contribution of the environmental sound components, is to provide an accurate "acoustic quality" assessment of a complex habitat. Such an index can prove to be a powerful ecological tool associated with both rapid on-site and remote surveys. The soundscape ranking index (SRI), introduced by us recently, can empirically account for the contribution of different sound sources by assigning a positive weight to natural sounds (biophony) and a negative weight to anthropogenic ones. The optimization of such weights was performed by training four machine learning algorithms (decision tree, DT; random forest, RF; adaptive boosting, AdaBoost; support vector machine, SVM) over a relatively small fraction of a labeled sound recording dataset. The sound recordings were taken at 16 sites distributed over an area of approximately 22 hectares at Parco Nord (Northern Park) of the city Milan (Italy). From the audio recordings, we extracted four different spectral features: two based on ecoacoustic indices and the other two based on mel-frequency cepstral coefficients (MFCCs). The labeling was focused on the identification of sounds belonging to biophonies and anthropophonies. This preliminary approach revealed that two classification models, DT and AdaBoost, trained by using 84 extracted features from each recording, are able to provide a set of weights characterized by a rather good classification performance (F1-score = 0.70, 0.71). The present results are in quantitative agreement with a self-consistent estimation of the mean SRI values at each site that was recently obtained by us using a different statistical approach.
估算一个旨在评估环境声音成分贡献的声音景观指数的目的是为一个复杂的栖息地提供一个准确的“声音质量”评估。这样的指数可以证明是一个与现场和远程调查都相关的强大生态工具。我们最近引入的声音景观排名指数(SRI)可以通过为自然声音(生物声音)分配正权重和为人为声音分配负权重,从经验上说明不同声源的贡献。通过在一个标记声音记录数据集的相对较小部分上训练四个机器学习算法(决策树、随机森林、自适应增强、支持向量机),对这些权重进行了优化。声音记录是在米兰市北部公园(Parco Nord)的 16 个地点采集的,该地区大约有 22 公顷。我们从音频记录中提取了四个不同的光谱特征:两个基于生态声学指数,另外两个基于梅尔频率倒谱系数(MFCCs)。标记集中于识别属于生物声音和人为声音的声音。这种初步方法表明,使用每个记录中提取的 84 个特征训练的两个分类模型,决策树和自适应增强,能够提供一组具有相当好分类性能(F1 分数=0.70,0.71)的权重。这些结果与我们最近使用不同的统计方法获得的每个地点的 SRI 均值的一致估计在定量上是一致的。