Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA.
Sensors (Basel). 2022 Oct 12;22(20):7736. doi: 10.3390/s22207736.
In a world dependent on road-based transportation, it is essential to understand automobiles. We propose an acoustic road vehicle characterization system as an integrated approach for using sound captured by mobile devices to enhance transparency and understanding of vehicles and their condition for non-expert users. We develop and implement novel deep learning cascading architectures, which we define as conditional, multi-level networks that process raw audio to extract highly granular insights for vehicle understanding. To showcase the viability of cascading architectures, we build a multi-task convolutional neural network that predicts and cascades vehicle attributes to enhance misfire fault detection. We train and test these models on a synthesized dataset reflecting more than 40 hours of augmented audio. Through cascading fuel type, engine configuration, cylinder count and aspiration type attributes, our cascading CNN achieves 87.0% test set accuracy on misfire fault detection which demonstrates margins of 8.0% and 1.7% over naïve and parallel CNN baselines. We explore experimental studies focused on acoustic features, data augmentation, and data reliability. Finally, we conclude with a discussion of broader implications, future directions, and application areas for this work.
在依赖道路运输的世界中,了解汽车至关重要。我们提出了一种声学道路车辆特征描述系统,作为一种综合方法,利用移动设备捕获的声音来增强非专业用户对车辆及其状况的透明度和理解。我们开发并实现了新颖的深度学习级联架构,我们将其定义为条件、多层次网络,用于处理原始音频以提取车辆理解的高度细粒度见解。为了展示级联架构的可行性,我们构建了一个多任务卷积神经网络,该网络可以预测和级联车辆属性,以增强点火故障检测。我们在一个反映超过 40 小时增强音频的合成数据集上训练和测试这些模型。通过级联燃料类型、发动机配置、气缸数和进气类型属性,我们的级联 CNN 在点火故障检测方面的测试集准确率达到 87.0%,与朴素和并行 CNN 基线相比,分别高出 8.0%和 1.7%。我们探讨了侧重于声学特征、数据增强和数据可靠性的实验研究。最后,我们讨论了这项工作的更广泛影响、未来方向和应用领域。