Baldini Gianmarco, Amerini Irene
European Commission, Joint Research Centre, 21027 Ispra, Italy.
Department of Computer Science, Sapienza University of Rome, 00185 Roma, Italy.
Entropy (Basel). 2020 Oct 30;22(11):1235. doi: 10.3390/e22111235.
Research findings have shown that microphones can be uniquely identified by audio recordings since physical features of the microphone components leave repeatable and distinguishable traces on the audio stream. This property can be exploited in security applications to perform the identification of a mobile phone through the built-in microphone. The problem is to determine an accurate but also efficient representation of the physical characteristics, which is not known a priori. Usually there is a trade-off between the identification accuracy and the time requested to perform the classification. Various approaches have been used in literature to deal with it, ranging from the application of handcrafted statistical features to the recent application of deep learning techniques. This paper evaluates the application of different entropy measures (Shannon Entropy, Permutation Entropy, Dispersion Entropy, Approximate Entropy, Sample Entropy, and Fuzzy Entropy) and their suitability for microphone classification. The analysis is validated against an experimental dataset of built-in microphones of 34 mobile phones, stimulated by three different audio signals. The findings show that selected entropy measures can provide a very high identification accuracy in comparison to other statistical features and that they can be robust against the presence of noise. This paper performs an extensive analysis based on filter features selection methods to identify the most discriminating entropy measures and the related hyper-parameters (e.g., embedding dimension). Results on the trade-off between accuracy and classification time are also presented.
研究结果表明,由于麦克风组件的物理特征会在音频流上留下可重复且可区分的痕迹,因此可以通过音频记录对麦克风进行唯一识别。这一特性可用于安全应用中,通过内置麦克风对手机进行识别。问题在于确定物理特征的准确且高效的表示形式,而这是无法事先得知的。通常在识别精度和执行分类所需的时间之间存在权衡。文献中已使用了各种方法来处理这一问题,从手工制作的统计特征的应用到深度学习技术的最新应用。本文评估了不同熵度量(香农熵、排列熵、离散熵、近似熵、样本熵和模糊熵)的应用及其对麦克风分类的适用性。针对由三种不同音频信号激励的34部手机的内置麦克风实验数据集对该分析进行了验证。研究结果表明,与其他统计特征相比,选定的熵度量可以提供非常高的识别精度,并且它们对噪声的存在具有鲁棒性。本文基于滤波器特征选择方法进行了广泛分析,以识别最具区分性的熵度量和相关超参数(例如嵌入维度)。还给出了精度和分类时间之间权衡的结果。